Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation

被引:0
|
作者
Maeda, Kazuaki [1 ]
Lee, Haejoong [1 ]
Grimes, Stephen [1 ]
Wright, Jonathan [1 ]
Parker, Robert [1 ]
Lee, David [1 ]
Mazzucchi, Andrea [1 ]
机构
[1] Univ Penn, Linguist Data Consortium, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Linguistic Data Consortium (LDC) at the University of Pennsylvania has participated as a data provider in a variety of government-sponsored programs that support development of Human Language Technologies. As the number of projects increases, the quantity and variety of the data LDC produces have increased dramatically in recent years. In this paper, we describe the technical infrastructure, both hardware and software, that LDC has built to support these complex, large-scale linguistic data creation efforts at LDC. As it would not be possible to cover all aspects of LDC's technical infrastructure in one paper, this paper focuses on recent development. We also report on our plans for making our custom-built software resources available to the community as open source software, and introduce an initiative to collaborate with software developers outside LDC. We hope that our approaches and software resources will be useful to the community members who take on similar challenges.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Linguistic data as complex items
    Mereu, L
    LINGUISTIC REVIEW, 2004, 21 (3-4): : 211 - 233
  • [22] Linguistic summaries of process data
    Wilbik, Anna
    Dijkman, Remco M.
    2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [23] AN APPROACH TO THE LINGUISTIC SUMMARY OF DATA
    YAGER, RR
    FORD, KM
    CANAS, AJ
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 521 : 456 - 468
  • [24] STATISTICAL ESTIMATION WITH LINGUISTIC DATA
    KRUSE, R
    INFORMATION SCIENCES, 1984, 33 (03) : 197 - 207
  • [25] BAYESIAN ANALYSIS OF LINGUISTIC DATA
    POWERS, JE
    STATISTICAL METHODS IN LINGUISTICS, 1975, : 32 - 50
  • [26] Pipelined processing of linguistic data
    Stachowicz, M.S.
    Grantner, J.
    Kinndy, L.L.
    Advances in Modelling and Analysis B: Signals, Information, Data, Patterns, 1992, 23 (04): : 1 - 4
  • [27] LINGUISTIC SUMMARY OF FUZZY DATA
    DICESARE, F
    SAHNOUN, Z
    BONISSONE, PP
    INFORMATION SCIENCES, 1990, 52 (02) : 141 - 152
  • [28] On a linguistic description of dependencies in data
    Batyrshin, I
    Wagenknecht, M
    NEURAL NETWORKS AND SOFT COMPUTING, 2003, : 286 - 291
  • [29] THE EUROPEAN LINGUISTIC DIVERSITY - POLITICAL AND ECONOMIC DATA OF LINGUISTIC PLANNING
    JUCQUOIS, G
    LINGUISTIQUE, 1991, 27 (01): : 29 - 58
  • [30] Automated Creation of Mappings Between Data Specifications Through Linguistic and Structural Techniques
    Kalwar, Safia
    Rossi, Matteo
    Sadeghi, Mersedeh
    IEEE ACCESS, 2023, 11 : 30324 - 30339