Analysis of spontaneous Japanese in a multi-language telephone-speech corpus

被引：7

作者：

Arai, Takayuki ^{[1
]}

Warner, Natasha ^{[2
]}

Greenberg, Steven ^{[3
,4
]}

机构：

[1] Sophia Univ, Dept Elect & Elect Engn, Chiyoda Ku, 7-1 Kioi Cho, Tokyo 1028554, Japan

[2] Univ Arizona, Dept Linguist, Tucson, AZ 85721 USA

[3] Silicon Speech, Santa Venetia, CA 94903 USA

[4] Tech Univ Denmark, Ctr Appl Hearing Res, DK-2800 Lyngby, Denmark

来源：

ACOUSTICAL SCIENCE AND TECHNOLOGY | 2007年 / 28卷 / 01期

关键词：

Phonetic analysis; Spontaneous Japanese; Speech corpora; Frequency of occurrence; Duration;

D O I：

10.1250/ast.28.46

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

An analysis of pronunciation variations of Japanese component of the Oregon Graduate Institute Multi-Language Telephone Speech (OGI-TS) Corpus is presented. These variations include reduction or deletion, and frequencies of occurrence and duration of both vowels and consonants in corpus. This corpus contains 90 calls and each call was uttered by a unique adult speaker. Filled pauses, hesitations and other instances of interruption in the speech stream were also transcribed. The non-high vowel devoicing is common in this corpus than would be anticipated on the basis of the published literature. In Japanese, the main difference between careful and spontaneous speech is in the proportion of vowel devoicing and deletion. The variations in pronunciation of consonants in Japanese includes glottal fricative, nasalization of vowels before nasals, and other forms of consonant reduction.

引用

页码：46 / 48

页数：3

共 50 条

[41] Statistical analysis of a Japanese emotion corpus for natural language processing
Minato, Junko
Bracewell, David B.
Ren, Fuji
Kuroiwa, Shingo
COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 924 - 929
[42] Analysis of Multi-Intelligent Distributed Japanese Language Block Recognition Based on Knowledge Recognition Corpus
Huang, Jianna
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03)
[43] Multi-language Sentiment Analysis - Lesson Learnt from NLP Case Study
Maslankowski, Jacek
Majewicz, Dorota
INFORMATION SYSTEMS (EMCIS 2021), 2022, 437 : 46 - 54
[44] Change Impact Analysis and Cybersecurity Threats in Multi-language Systems: An Industrial Investigation
Grichi, Manel
Abidi, Mouna
Jaafar, Fehmi
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 9, ICICT 2024, 2025, 1054 : 71 - 93
[45] Design smells in multi-language systems and bug-proneness: a survival analysis
Abidi, Mouna
Rahman, Md Saidur
Openja, Moses
Khomh, Foutse
EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
[46] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Takamichi, Shinnosuke
Nakata, Wataru
Tanji, Naoko
Saruwatari, Hiroshi
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, 2022-September : 2358 - 2362
[47] J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Takamichi, Shinnosuke
Nakata, Wataru
Tanji, Naoko
Saruwatari, Hiroshi
INTERSPEECH 2022, 2022, : 2358 - 2362
[48] An Analysis of Malay Language Emotional Speech Corpus for Emotion Recognition System
Apandi, Nurfarihah
Jamil, Nursuriati
2016 IEEE INDUSTRIAL ELECTRONICS AND APPLICATIONS CONFERENCE (IEACON), 2016, : 225 - 231
[49] Parallel Prototyping for Multi-Language Service Design A case study on introducing a multilingual tool into a Japanese local restaurant
Cho, Hiromichi
Kinny, David
Lin, Donghui
2013 INTERNATIONAL CONFERENCE ON CULTURE AND COMPUTING (CULTURE AND COMPUTING 2013), 2013, : 86 - 91
[50] Training a language model using webdata for large vocabulary Japanese spontaneous speech recognition
Masumura, Ryo
Hahm, Seongjun
Ito, Akinori
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1476 - 1479

← 1 2 3 4 5 →