Challenges and Opportunities in Sociolinguistic Data and Metadata Sharing

被引:4
|
作者
Cieri, Christopher [1 ]
机构
[1] Univ Penn, Linguist Data Consortium, Philadelphia, PA 19104 USA
来源
LANGUAGE AND LINGUISTICS COMPASS | 2014年 / 8卷 / 11期
关键词
D O I
10.1111/lnc3.12112
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Advances in computing technology coupled with recent focus on big data in the social sciences have provided the motivation and some of the infrastructure necessary for sociolinguists to share data among themselves and with researchers in related fields such as human language technologies (HLT). Collaboration among sociolinguists offers the promise to extend current knowledge beyond the community studies that have dominated the field for the past 50 years and focus more on regional and national patterns of variation and change and what they indicate about linguistic theory. Collaboration with HLT developers while relatively new and still uncommon has led to advances both in sociolinguistic methodology and in technologies suited to sociolinguistic research. Before the field can make full use of these advances, however, sociolinguists must confront a number of challenges. Studies that were developed with the intent of describing a single speech community presumably need not ensure, and in many cases have not ensured, consistency with prior work. Given this practice, attempts to compare phenomena across studies must addressmismatches at the levels of data elicitation and selection, coding practice, and the definition of underlying concepts. Adding to the confusion wrought by methodological differences, speech communities differ in ways that the field worker cannot always predict so that different and sometimes unique linguistic and non-linguistic features are found to vary with linguistic structure. This paper underscores the motivation for data sharing by identifying some limitations of comparisons based only on published papers and reviewing advances fueled by data sharing among linguists and between linguists and technology developers. It also documents some of the challenges that hinder data sharing by reviewing work that has build upon available corpora. Finally, it summarizes efforts outside of sociolinguistics that have proposed frameworks for sharing and comparing metadata and categories setting the stage for the papers that follow in these special issues.
引用
收藏
页码:472 / 485
页数:14
相关论文
共 50 条
  • [1] Data Sharing: Convert Challenges into Opportunities
    Figueiredo, Ana Sofia
    [J]. FRONTIERS IN PUBLIC HEALTH, 2017, 5
  • [2] SHARING BIOMECHANICAL DATA: CHALLENGES AND OPPORTUNITIES
    Hunt, M. A.
    [J]. OSTEOARTHRITIS AND CARTILAGE, 2020, 28 : S18 - S18
  • [3] Challenges and opportunities in sharing microbiome data and analyses
    Curtis Huttenhower
    Robert D. Finn
    Alice Carolyn McHardy
    [J]. Nature Microbiology, 2023, 8 : 1960 - 1970
  • [4] THE FUTURE OF GROUND DATA SHARING CHALLENGES AND OPPORTUNITIES
    不详
    [J]. AUSTRALIAN GEOMECHANICS JOURNAL, 2019, 54 (03): : 10 - 11
  • [5] Challenges and opportunities in sharing microbiome data and analyses
    Huttenhower, Curtis
    Finn, Robert D.
    Mchardy, Alice Carolyn
    [J]. NATURE MICROBIOLOGY, 2023, 8 (11) : 1960 - 1970
  • [6] The challenges and opportunities of mental health data sharing in the UK
    Ford, Tamsin
    Mansfield, Karen L
    Markham, Sarah
    McManus, Sally
    John, Ann
    O'Reilly, Dermot
    Newlove-Delgado, Tamsin
    Iveson, Matthew H
    Fazel, Mina
    Das Munshi, Jayati
    Dutta, Rina
    Leavy, Gerard
    Downs, Johnny
    Foley, Tom
    Russell, Abigail
    Maguire, Aideen
    Moon, Graham
    Kirkham, Elizabeth J
    Finning, Katie
    Russell, Ginny
    Moore, Anna
    Jones, Peter B
    Shenow, Sarah
    [J]. The Lancet Digital Health, 2021, 3 (06):
  • [7] Opportunities and challenges in sharing and reusing genomic interval data
    Xue, Bingjie
    Khoroshevskyi, Oleksandr
    Gomez, R. Ariel
    Sheffield, Nathan C.
    [J]. FRONTIERS IN GENETICS, 2023, 14
  • [8] Privacy challenges and research opportunities for genomic data sharing
    Bonomi, Luca
    Huang, Yingxiang
    Ohno-Machado, Lucila
    [J]. NATURE GENETICS, 2020, 52 (07) : 646 - 654
  • [9] The challenges and opportunities of mental health data sharing in the UK
    Ford, Tamsin
    Mansfield, Karen L.
    Markham, Sarah
    McManus, Sally
    John, Ann
    O'Reilly, Dermot
    Newlove-Delgado, Tamsin
    Iveson, Matthew H.
    Fazel, Mina
    Munshi, Jayati Das
    Dutta, Rina
    Leavy, Gerard
    Downs, Johnny
    Foley, Tom
    Russell, Abigail
    Maguire, Aideen
    Moon, Graham
    Kirkham, Elizabeth J.
    Finning, Katie
    Russell, Ginny
    Moore, Anna
    Jones, Peter B.
    Shenow, Sarah
    [J]. LANCET DIGITAL HEALTH, 2021, 3 (06): : E333 - E336
  • [10] Privacy challenges and research opportunities for genomic data sharing
    Luca Bonomi
    Yingxiang Huang
    Lucila Ohno-Machado
    [J]. Nature Genetics, 2020, 52 : 646 - 654