Updated MINDS Report on Speech Recognition and Understanding, Part 2

被引:28
|
作者
Baker, Janet M. [1 ]
Deng, Li [2 ,3 ]
Khudanpur, Sanjeev [4 ]
Lee, Chin-Hui [5 ,6 ]
Glass, James R. [7 ]
Morgan, Nelson [8 ,9 ]
O'Shaughnessy, Douglas [10 ]
机构
[1] Saras Inst, W Newton, MA USA
[2] Univ Washington, Seattle, WA 98195 USA
[3] Microsoft Res, Redmond, WA USA
[4] Johns Hopkins Univ, GWC Whiting Sch Engn, Baltimore, MD USA
[5] Georgia Inst Technol, Sch ECE, Atlanta, GA 30332 USA
[6] Bell Labs, Murray Hill, NJ 07974 USA
[7] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[8] Univ Calif Berkeley, ICSI, Res Lab, Berkeley, CA 94720 USA
[9] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[10] Univ Quebec, INRS EMT, Ste Foy, PQ G1V 2M3, Canada
关键词
BRAIN ACTIVITY; CONSTRAINTS; MODELS; WORDS;
D O I
10.1109/MSP.2009.932707
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The second part of the updated version of "MINDS 2006-2007 Report of the Speech Understanding Working Group" is presented which came from two workshops entitled "Meeting of the MINDS: Future Directions for Human Language Technology". The specific topics being discussed include: the fundamental science of human speech perception and production; transcription to meaning extraction; understanding the cortical speech/language processing; the heterogeneous knowledge sources for automatic speech recognition; the information-bearing elements of the speech signal; the novel computational architectures for knowledge-rich speech recognition; the adaptation and self-learning in speech recognition systems; the robustness and context-awareness in acoustic models for speech recognition; the speaker's acoustic environment and the speech acquisition channel; the speaker characteristics and style; the language characteristics; robust speech recognition in everyday environments; and finally, the novel search procedures for knowledge-rich speech recognition.
引用
收藏
页码:78 / 85
页数:8
相关论文
共 50 条
  • [1] Research Developments and Directions in Speech Recognition and Understanding, Part 1
    Baker, Janet M.
    Deng, Li
    Glass, James
    Khudanpur, Sanjeev
    Lee, Chin-Hui
    Morgan, Nelson
    O'Shaughnessy, Douglas
    IEEE SIGNAL PROCESSING MAGAZINE, 2009, 26 (03) : 75 - 80
  • [2] SPEECH RECOGNITION AND UNDERSTANDING
    VINTSYUK, TK
    CYBERNETICS, 1982, 18 (05): : 657 - 669
  • [3] The Effect of Part-of-speech on Mandarin Speech Recognition
    Gong, Caixia
    Li, Xiangang
    Wu, Xihong
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [4] Continuous Chinese speech recognition and understanding
    Gu, JH
    Liu, JM
    Shen, XQ
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 989 - 992
  • [5] Toward robust speech recognition and understanding
    Furui, S
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 2 - 11
  • [6] Toward Robust Speech Recognition and Understanding
    Sadaoki Furui
    Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 245 - 254
  • [7] Toward robust speech recognition and understanding
    Furui, S
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 41 (03): : 245 - 254
  • [8] Recent progress in spontaneous speech recognition and understanding
    Furui, S
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 253 - 258
  • [9] Prosody modeling for automatic speech recognition and understanding
    Shriberg, E
    Stolcke, A
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 105 - 114
  • [10] Study on Chinese speech recognition and understanding system
    Qiu, Wei
    Xu, Bingzheng
    Zhong, Wenqing
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology, 1996, 24 (04):