Japanese speech databases for robust speech recognition

被引:0
|
作者
Nakamura, A
Matsunaga, S
Shimizu, T
Tonomura, M
Sagisaka, Y
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Ar ATR, a next-generation speech translation system is under development towards natural trans-language communication. To cope with the various requirements to speech recognition technology for the new system, further research efforts should emphasize the robustness for large vocabulary, speaking variations often found in fast spontaneous speech and speaker variances. These are key problems to be solved not only for speech translation bur also far the general use of speech recognition in real environments In this paper, three large speech databases are designed to cope with these problems in speech recognition acid the current status of data collection is reported.
引用
收藏
页码:2199 / 2202
页数:4
相关论文
共 50 条
  • [1] Building Robust Emotion Recognition System on Heterogeneous Speech Databases
    Yoon, Won-Jung
    Park, Kyu-Sik
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (02) : 747 - 750
  • [2] Building Robust Emotion Recognition System on Heterogeneous Speech Databases
    Yoon, Won-Jung
    Park, Kyu-Sik
    [J]. IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 825 - 826
  • [3] A robust speech analysis in speech recognition
    Miyanaga, Y
    Gozen, S
    Ohtsuki, N
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
  • [4] Speech Databases, Speech Features, and Classifiers in Speech Emotion Recognition: A Review
    Mohmad Dar, G.H.
    Delhibabu, Radhakrishnan
    [J]. IEEE Access, 2024, 12 : 151122 - 151152
  • [5] Speech parameters for the robust emotional speech recognition
    Kim, Weon-Goo
    [J]. Journal of Institute of Control, Robotics and Systems, 2010, 16 (12) : 1137 - 1142
  • [6] Robust speech detector for speech recognition applications
    Liang, WQ
    Chen, YN
    Shan, YX
    Liu, J
    Liu, RS
    [J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 453 - 456
  • [7] Histogram equalization of speech representation for robust speech recognition
    de la Torre, A
    Peinado, AM
    Segura, JC
    Pérez-Córdoba, JL
    Benítez, MC
    Rubio, AJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 355 - 366
  • [8] Normalization of the Speech Modulation Spectra for Robust Speech Recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1662 - 1674
  • [9] SPEECH RECOGNITION FOR JAPANESE PHRASES
    KAMIYA, S
    KIYAMA, J
    HAKARIDANI, M
    TANAKA, A
    [J]. SHARP TECHNICAL JOURNAL, 1991, (49): : 23 - 26
  • [10] Robust distributed speech recognition using speech enhancement
    Flynn, Ronan
    Jones, Edward
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) : 1267 - 1273