TY - GEN
T1 - Management of metadata in linguistic fieldwork
T2 - 4th International Conference on Language Resources and Evaluation, LREC 2004
AU - Hughes, Baden
AU - Penton, David
AU - Bird, Steven
AU - Bow, Catherine
AU - Wigglesworth, Gillian
AU - McConvell, Patrick
AU - Simpson, Jane
PY - 2004
Y1 - 2004
N2 - Many linguistic research projects collect large amounts of multimodal data in digital formats. Despite the plethora of data collection applications available, it is often difficult for researchers to identify and integrate applications which enable the management of collections of multimodal data in addition to facilitating the actual collection process itself. In research projects that involve substantial data analysis, data management becomes a critical issue. Whilst best practice recommendations in regard to data formats themselves are propagated through projects such as EMELD, HRELP and DOBES, there is little corresponding information available regarding best practice for field metadata management beyond the provision of standards by entities such as OLAC and IMDI. These general problems are further exacerbated in the context of multiple researchers in geographically-disparate or connectivity-challenged locations. We describe the design of a solution for a group of researchers collecting data on child language acquisition in Australian indigenous communities. We describe the context, identify pertinent issues, outline the mechanics of a solution, and finally report the implementation. In doing so, we provide an alternative model and an open source software application suite which aims to be sufficiently general that other research groups may consider adopting some or all of the infrastructure.
AB - Many linguistic research projects collect large amounts of multimodal data in digital formats. Despite the plethora of data collection applications available, it is often difficult for researchers to identify and integrate applications which enable the management of collections of multimodal data in addition to facilitating the actual collection process itself. In research projects that involve substantial data analysis, data management becomes a critical issue. Whilst best practice recommendations in regard to data formats themselves are propagated through projects such as EMELD, HRELP and DOBES, there is little corresponding information available regarding best practice for field metadata management beyond the provision of standards by entities such as OLAC and IMDI. These general problems are further exacerbated in the context of multiple researchers in geographically-disparate or connectivity-challenged locations. We describe the design of a solution for a group of researchers collecting data on child language acquisition in Australian indigenous communities. We describe the context, identify pertinent issues, outline the mechanics of a solution, and finally report the implementation. In doing so, we provide an alternative model and an open source software application suite which aims to be sufficiently general that other research groups may consider adopting some or all of the infrastructure.
UR - http://www.scopus.com/inward/record.url?scp=85037128949&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85037128949
T3 - Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004
SP - 193
EP - 196
BT - Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004
A2 - Xavier, Maria Francisca
A2 - Costa, Rute
A2 - Ferreira, Fatima
A2 - Lino, Maria Teresa
A2 - Silva, Raquel
PB - European Language Resources Association (ELRA)
Y2 - 26 May 2004 through 28 May 2004
ER -