Analysis of spontaneous Japanese in a multi-language telephone-speech corpus

被引:7
|
作者
Arai, Takayuki [1 ]
Warner, Natasha [2 ]
Greenberg, Steven [3 ,4 ]
机构
[1] Sophia Univ, Dept Elect & Elect Engn, Chiyoda Ku, 7-1 Kioi Cho, Tokyo 1028554, Japan
[2] Univ Arizona, Dept Linguist, Tucson, AZ 85721 USA
[3] Silicon Speech, Santa Venetia, CA 94903 USA
[4] Tech Univ Denmark, Ctr Appl Hearing Res, DK-2800 Lyngby, Denmark
关键词
Phonetic analysis; Spontaneous Japanese; Speech corpora; Frequency of occurrence; Duration;
D O I
10.1250/ast.28.46
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An analysis of pronunciation variations of Japanese component of the Oregon Graduate Institute Multi-Language Telephone Speech (OGI-TS) Corpus is presented. These variations include reduction or deletion, and frequencies of occurrence and duration of both vowels and consonants in corpus. This corpus contains 90 calls and each call was uttered by a unique adult speaker. Filled pauses, hesitations and other instances of interruption in the speech stream were also transcribed. The non-high vowel devoicing is common in this corpus than would be anticipated on the basis of the published literature. In Japanese, the main difference between careful and spontaneous speech is in the proportion of vowel devoicing and deletion. The variations in pronunciation of consonants in Japanese includes glottal fricative, nasalization of vowels before nasals, and other forms of consonant reduction.
引用
收藏
页码:46 / 48
页数:3
相关论文
共 50 条
  • [41] Statistical analysis of a Japanese emotion corpus for natural language processing
    Minato, Junko
    Bracewell, David B.
    Ren, Fuji
    Kuroiwa, Shingo
    COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 924 - 929
  • [42] Analysis of Multi-Intelligent Distributed Japanese Language Block Recognition Based on Knowledge Recognition Corpus
    Huang, Jianna
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03)
  • [43] Multi-language Sentiment Analysis - Lesson Learnt from NLP Case Study
    Maslankowski, Jacek
    Majewicz, Dorota
    INFORMATION SYSTEMS (EMCIS 2021), 2022, 437 : 46 - 54
  • [44] Change Impact Analysis and Cybersecurity Threats in Multi-language Systems: An Industrial Investigation
    Grichi, Manel
    Abidi, Mouna
    Jaafar, Fehmi
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 9, ICICT 2024, 2025, 1054 : 71 - 93
  • [45] Design smells in multi-language systems and bug-proneness: a survival analysis
    Abidi, Mouna
    Rahman, Md Saidur
    Openja, Moses
    Khomh, Foutse
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
  • [46] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
    Takamichi, Shinnosuke
    Nakata, Wataru
    Tanji, Naoko
    Saruwatari, Hiroshi
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, 2022-September : 2358 - 2362
  • [47] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
    Takamichi, Shinnosuke
    Nakata, Wataru
    Tanji, Naoko
    Saruwatari, Hiroshi
    INTERSPEECH 2022, 2022, : 2358 - 2362
  • [48] An Analysis of Malay Language Emotional Speech Corpus for Emotion Recognition System
    Apandi, Nurfarihah
    Jamil, Nursuriati
    2016 IEEE INDUSTRIAL ELECTRONICS AND APPLICATIONS CONFERENCE (IEACON), 2016, : 225 - 231
  • [49] Parallel Prototyping for Multi-Language Service Design A case study on introducing a multilingual tool into a Japanese local restaurant
    Cho, Hiromichi
    Kinny, David
    Lin, Donghui
    2013 INTERNATIONAL CONFERENCE ON CULTURE AND COMPUTING (CULTURE AND COMPUTING 2013), 2013, : 86 - 91
  • [50] Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition
    Masumura, Ryo
    Hahm, Seongjun
    Ito, Akinori
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1476 - 1479