Multi-level annotation in SpeeCon Polish Speech Database

被引:0
|
作者
Marasek, K
Gubrynowicz, R
机构
[1] Polish Japanese Inst Informat Technol, PL-02008 Warsaw, Poland
[2] Polish Acad Sci, Inst Fundamental Technol Res, PL-00049 Warsaw, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SpeeCon Polish Speech Database was collected within the framework of the SpeeCon project partially sponsored by the EC (IST-1999-10003). The database contains two sets of data, which comprise 550 adults' recording sessions and 50 sessions from children, respectively. The adult speakers were recorded in various environments: offices, living rooms, cars and public places. Recordings contain free spontaneous speech passages, elicited spontaneous speech, phonetically compact words and sentences, general-purpose words and phrases, specific application words and utterances. One of the most important problems in the construction of the database is to define bases for multi-level transcription composed of several tiers. They could be grouped into three classes - linguistic, symbolic and physical representation. The orthographic transcription is applied to the sentence, phrase and word tiers, symbolic transcription related to grammar and articulation - to part of speech, phoneme and syllabic tiers and mnemonics - to the description of some characteristic of the measurable physical data. The paper presents the rules applied to text, speech and noise transcriptions and remarks on pronunciation varieties found in the database. The final part of the paper discusses the problem of the lexicon creation, which is an alphabetically ordered list of distinct lexical items occurring in the recorded corpus. The Polish lexicon has been built up by various methods, including hand-annotation and generation by rule with subsequent manual check.
引用
收藏
页码:58 / 67
页数:10
相关论文
共 50 条
  • [1] Multi-level annotation in the Emu speech database management system
    Cassidy, S
    Harrington, J
    [J]. SPEECH COMMUNICATION, 2001, 33 (1-2) : 61 - 77
  • [2] The Multi-level Approach to Speech Corpora Annotation for Automatic Speech Recognition
    Glavatskih, Igor
    Platonova, Tatyana
    Rogozhina, Valeria
    Shirokova, Anna
    Smolina, Anna
    Kotov, Mikhail
    Ovsyannikova, Anna
    Repalov, Sergey
    Zulkarneev, Mikhail
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 438 - 445
  • [3] Constructing multi-level speech database for spontaneous speech processing
    Hahn, M
    Kim, S
    Lee, JC
    Lee, YJ
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1930 - 1933
  • [4] Structure and Annotation of Polish LVCSR Speech Database
    Klessa, Katarzyna
    Demenko, Grazyna
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1815 - 1818
  • [5] Polish LVCSR in the Janus system - Preliminary results for the SpeeCon database
    Marasek, Krzysztof
    [J]. ARCHIVES OF ACOUSTICS, 2007, 32 (01) : 119 - 126
  • [6] dbDEPC 3.0: the database of differentially expressed proteins in human cancer with multi-level annotation and drug indication
    Yang, Qingmin
    Zhang, Yuqi
    Cui, Hui
    Chen, Lanming
    Zhao, Yong
    Lin, Yong
    Zhang, Menghuan
    Xie, Lu
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
  • [7] Automatic articulatory annotation of multi-sensor speech database
    Parlangeau, N
    AndreObrecht, R
    Marchal, A
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 829 - 832
  • [8] CASCADE OF MULTI-LEVEL MULTI-INSTANCE CLASSIFIERS FOR IMAGE ANNOTATION
    Cam-Tu Nguyen
    Ha Vu Le
    Tokuyama, Takeshi
    [J]. KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 14 - 23
  • [9] An Assessment of the Multi-level Integrated Database Approach
    Smith, Tom W.
    Kim, Jibum
    [J]. ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 2013, 645 (01): : 185 - 221
  • [10] A Tool/Database Interface for Multi-Level Analyses
    Eberle, Kurt
    Eckart, Kerstin
    Heid, Ulrich
    Haselbach, Boris
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2912 - 2916