Linguistic Resources for Reconstructing Spontaneous Speech Text

被引:0
|
作者
Fitzgerald, Erin [1 ]
Jelinek, Frederick [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accomplish speech reconstruction of its spontaneous speech input if its output were to represent, in flawless, fluent, and content-preserving English, the message that the speaker intended to convey. These cleaner speech transcripts would allow for more accurate language processing as needed for NLP tasks such as machine translation and conversation summarization, which often rely on grammatical input. Recognizing that supervised statistical methods to identify and transform ill-formed areas of the transcript will require richly labeled resources, we have built the Spontaneous Speech Reconstruction corpus. This small corpus of reconstructed and aligned conversational telephone speech transcriptions for the Fisher conversational telephone speech corpus (Strassel and Walker, 2004) was annotated on several levels including string transformations and predicate-argument structure, and will be shared with the linguistic research community.
引用
收藏
页码:3449 / 3452
页数:4
相关论文
共 50 条
  • [1] Linguistic Resources Construction: Towards Disfluency Processing in Spontaneous Tunisian Dialect Speech
    Boughariou, Emna
    Bahou, Younes
    Bleguith, Lamia Hadrich
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 316 - 328
  • [2] Linguistic patterns in spontaneous speech
    Lin, Phoebe M. S.
    [J]. CHINESE LANGUAGE AND DISCOURSE, 2012, 3 (02) : 278 - 283
  • [3] LINGUISTIC RULES FOR TEXT TO SPEECH SYNTHESIS
    UMEDA, N
    [J]. PROCEEDINGS OF THE IEEE, 1976, 64 (04) : 443 - 451
  • [4] Linguistic resources for meeting speech recognition
    Glenn, ML
    Strassel, S
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3869 : 390 - 401
  • [5] Free Linguistic and Speech Resources for Tibetan
    Li, Guanyu
    Yu, Hongzhi
    Zheng, Thomas Fang
    Yan, Jinghao
    Xu, Shipeng
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 733 - 736
  • [6] Speech-to-text and speech-to-speech summarization of spontaneous speech
    Furui, S
    Kikuchi, T
    Shinnaka, Y
    Hori, C
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 401 - 408
  • [7] Adaptive Text to Speech for Spontaneous Style
    Yan, Yuzi
    Tan, Xu
    Li, Bohan
    Zhang, Guangyan
    Qin, Tao
    Zhao, Sheng
    Shen, Yuan
    Zhang, Wei-Qiang
    Liu, Tie-Yan
    [J]. INTERSPEECH 2021, 2021, : 4668 - 4672
  • [8] LINGUISTIC-ARGUMENTATIVE RESOURCES IN ADVERTISING SPEECH
    de Oliveira, Esther Gomes
    de Azevedo, Melissa Carolina Herrero Rezende
    Nascimento, Suzete Silva
    [J]. LINGUAS & LETRAS, 2008, 9 (16): : 119 - 135
  • [9] Linguistic features of stuttering during spontaneous speech
    Warner, Haley J.
    Shroff, Ravi
    Zuanazzi, Arianna
    Arenas, Richard M.
    Jackson, Eric S.
    [J]. JOURNAL OF FLUENCY DISORDERS, 2023, 78
  • [10] The interplay of linguistic structure and breathing in German spontaneous speech
    Rochet-Capellan, Amelie
    Fuchs, Susanne
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2013 - 2017