Flexible multi-layer spoken dialogue corpora

被引:8
|
作者
Sauer, Simon [1 ]
Ludeling, Anke [1 ]
机构
[1] Humboldt Univ, Inst Deutsch Sprache & Linguist, Unter Linden 6, D-10099 Berlin, Germany
关键词
spoken corpora; multi-layer architecture; standoff; annotation; annotation tools; NITE XML TOOLKIT;
D O I
10.1075/ijcl.21.3.06sau
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes the construction of deeply annotated spoken dialogue corpora. To ensure a maximum of flexibility - in the degree of normalization, the types and formats of annotations, the possibilities for modifying and extending the corpus, or the use for research questions not originally anticipated - we propose a flexible multi-layer standoff architecture. We also take a closer look at the interoperability of tools and formats compatible with such an architecture. Free access to the corpus data through corpus queries, visualizations, and downloads - including documentation, metadata, and the original recordings - enables transparency, verifiability, and reproducibility of every step of interpretation throughout corpus construction and of any research findings obtained from this data.
引用
收藏
页码:419 / 438
页数:20
相关论文
共 50 条
  • [1] COMPUTATIONAL TOOLS AND SPOKEN CORPORA DESIGN: AN ONGOING DIALOGUE
    Vazquez Rozas, Victoria
    Barcala, Mario
    CAPLLETRA, 2020, (69): : 221 - 240
  • [2] Coextrusion: Flexible production of multi-layer pipes
    Stieglitz, H
    KUNSTSTOFFE-PLAST EUROPE, 2004, 94 (04): : 82 - 84
  • [3] Valence-Arousal Prediction of Chinese Words with Multi-layer Corpora
    Zhang, Xinrui
    Lin, Piyuan
    Chen, Siyuan
    Cen, Hongjie
    Wang, Jundong
    Huang, Qiangjia
    Xu, Yuhong
    Tang, Jiecong
    Huang, Peijie
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 304 - 307
  • [4] Creating Spoken Dialogue Characters from Corpora without Annotations
    Gandhe, Sudeep
    Traum, David
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2696 - 2699
  • [5] Reliability analysis for the design of a multi-layer flexible board
    Pan, FX
    Vatanporast, R
    55TH ELECTRONIC COMPONENTS & TECHNOLOGY CONFERENCE, VOLS 1 AND 2, 2005 PROCEEDINGS, 2005, : 1299 - 1304
  • [6] A flexible multi-layer metamaterial for filter and biosensor at THz
    Lan, L. J.
    Jin, B. B.
    Wu, J. B.
    Kang, L.
    Xu, W. W.
    Chen, J.
    Wu, P. H.
    2014 39TH INTERNATIONAL CONFERENCE ON INFRARED, MILLIMETER, AND TERAHERTZ WAVES (IRMMW-THZ), 2014,
  • [7] Augmenting variation of system utterances using corpora in spoken dialogue systems
    Higashinaka, R
    Prasad, R
    Walker, M
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 262 - 267
  • [8] PDTSC 2.0-Spoken Corpus with Rich Multi-layer Structural Annotation
    Mikulova, Marie
    Mirovsky, Jiri
    Nedoluzhko, Anja
    Pajas, Peter
    Stepanek, Jan
    Hajic, Jan
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 129 - 137
  • [9] Multi-Layer Planar Terahertz Electric Metamaterials on Flexible Substrates
    Azad, Abul K.
    Chen, Hou-Tong
    Akhadov, Elshan
    Weisse-Bernstein, Nina R.
    Taylor, Antoinette J.
    O'Hara, John F.
    2008 CONFERENCE ON LASERS AND ELECTRO-OPTICS & QUANTUM ELECTRONICS AND LASER SCIENCE CONFERENCE, VOLS 1-9, 2008, : 1339 - 1340
  • [10] EXPERIMENTAL AND NUMERICAL STUDY OF A MULTI-LAYER FLEXIBLE PIPE DEPRESSURIZATION
    Lambert, Anais
    Felix-Henry, Antoine
    Gilbert, Philippe
    Gainville, Martin
    PROCEEDINGS OF THE ASME 31ST INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARTIC ENGINEERING, VOL 3, 2012, : 105 - 115