Recent progress in corpus-based spontaneous speech recognition

被引：19

作者：

Furui, S ^{[1
]}

机构：

[1] Tokyo Inst Technol, Tokyo 1528552, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2005年 / E88D卷 / 03期

关键词：

spontaneous speech recognition; corpus; model adaptation; indexing; summarization;

D O I：

10.1093/ietisy/e88-d.3.366

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper overviews recent progress in the development of corpus-based spontaneous speech recognition technology. Although speech is in almost any situation spontaneous, recognition of spontaneous speech is an area which has only recently emerged in the field of automatic speech recognition. Broadening the application of speech recognition depends crucially on raising recognition performance for spontaneous speech. For this purpose, it is necessary to build large spontaneous speech corpora for constructing acoustic and language models. This paper focuses on various achievements of a Japanese 5-year national project "Spontaneous Speech: Corpus and Processing Technology" that has recently been completed. Because of various spontaneous-speech specific phenomena, such as filled pauses, repairs, hesitations, repetitions and disfluencies, recognition of spontaneous speech requires various new techniques. These new techniques include flexible acoustic modeling, sentence boundary detection, pronunciation modeling, acoustic as well as language model adaptation, and automatic summarization. Particularly automatic summarization including indexing, a process which extracts important and reliable parts of the automatic transcription, is expected to play an important role in building various speech archives, speech-based information retrieval systems, and human-computer dialogue systems.

引用

页码：366 / 375

页数：10

共 50 条

[1] Recent Progress of Mandrain Spontaneous Speech Recognition on Mandrain Conversation Dialogue Corpus
Deng, Yu-Chih
Wang, Yih-Ru
Chen, Sin-Horng
Chiang, Chen-Yu
[J]. 2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 84 - 89
[2] Corpus-based study of repair cues in spontaneous speech
Nakatani, C.H.
Hirschberg, J.
[J]. Journal of the Acoustical Society of America, 1994, 95 (03):
[3] A CORPUS-BASED STUDY OF REPAIR CUES IN SPONTANEOUS SPEECH
NAKATANI, CH
HIRSCHBERG, J
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (03): : 1603 - 1616
[4] Recent progress in spontaneous speech recognition and understanding
Furui, S
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 253 - 258
[5] FAST SEGMENT SEARCH FOR CORPUS-BASED SPEECH ENHANCEMENT BASED ON SPEECH RECOGNITION TECHNOLOGY
Ogawa, Atsunori
Kinoshita, Keisuke
Hori, Takaaki
Nakatani, Tomohiro
Nakamura, Atsushi
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[6] On the use of prosodic labelling in corpus-based linguistic studies of spontaneous speech
Braga, D
Freitas, D
Teixeira, JP
Marques, A
[J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 388 - 393
[7] Prosodic Patterns of Estonian words: a Corpus-Based Description Using Spontaneous Speech
Nemoto, Rena
Adda-Decker, Martine
[J]. HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 286 - +
[8] Recent and Applied Corpus-based Studies
不详
[J]. INTERNATIONAL JOURNAL OF ENGLISH STUDIES, 2009, 9 (03): : VII - IX
[9] Special section on corpus-based speech technologies
Shikano, K
Tokuda, K
Matsui, T
Shinoda, K
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03) : 365 - 365
[10] Corpus-based methods in language and speech processing
Bruce, R
[J]. COMPUTATIONAL LINGUISTICS, 1998, 24 (02) : 317 - 318

← 1 2 3 4 5 →