UNSUPERVISED VOCABULARY SELECTION FOR REAL-TIME SPEECH RECOGNITION OF LECTURES

被引:0
|
作者
Maergner, Paul [1 ]
Waibel, Alex [1 ]
Lane, Ian [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
Vocabulary selection; automatic speech recognition; language model adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, we propose a novel method for vocabulary selection to automatically adapt automatic speech recognition systems to the diverse topics that occur in educational and scientific lectures. Utilizing materials that are available before the lecture begins, such as lecture slides, our proposed framework iteratively searches for related documents on the web and generates a lecture-specific vocabulary based on the resulting documents. In this paper, we propose a novel method for vocabulary selection where we first collect documents similar to an initial seed document and then rank the resulting vocabulary based on a score which is calculated using a combination of word features. This is a critical component for adaptation that has typically been overlooked in prior works. On the interACT German-English simultaneous lecture translation system our proposed approach significantly improved vocabulary coverage, reducing the out-of-vocabulary rate, on average by 57.0% and up to 84.9%, compared to a lecture-independent baseline. Furthermore, our approach reduced the word error rate, by 12.5% on average and up to 25.3%, compared to a lecture-independent baseline.
引用
收藏
页码:4417 / 4420
页数:4
相关论文
共 50 条
  • [1] AN UNSUPERVISED VOCABULARY SELECTION TECHNIQUE FOR CHINESE AUTOMATIC SPEECH RECOGNITION
    Zhang, Yike
    Zhang, Pengyuan
    Li, Ta
    Yan, Yonghong
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 420 - 425
  • [2] REAL-TIME SPEECH RECOGNITION
    CAELEN, J
    CASTAN, S
    PERENNOU, G
    [J]. AUTOMATISME, 1972, 17 (03): : 87 - &
  • [3] The Recognition of Whispered Speech in Real-Time
    Hendrickson, Kristi
    Ernest, Danielle
    [J]. EAR AND HEARING, 2022, 43 (02): : 554 - 562
  • [4] INTEGRATED-CIRCUITS FOR A REAL-TIME LARGE-VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM
    STOLZLE, A
    NARAYANASWAMY, S
    MURVEIT, H
    RABAEY, JM
    BRODERSEN, RW
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1991, 26 (01) : 2 - 11
  • [5] A FLEXIBLE ARCHITECTURE FOR REAL-TIME SPEECH RECOGNITION
    MORENO, F
    ALEXANDRES, S
    MENESES, J
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1993, 37 (1-5): : 69 - 72
  • [6] Real-time recognition of broadcast radio speech
    Cook, GD
    Christie, JD
    Clarkson, PR
    Hochberg, MM
    Logan, BT
    Robinson, AJ
    Seymour, CW
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 141 - 144
  • [7] A VLSI WORD-PROCESSING SUBSYSTEM FOR A REAL-TIME LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM
    STOLZLE, A
    NARAYANASWAMY, S
    KORNEGAY, K
    RABAEY, J
    BRODERSEN, RW
    [J]. PROCEEDINGS OF THE IEEE 1989 CUSTOM INTEGRATED CIRCUITS CONFERENCE, 1989, : 611 - 615
  • [8] A VLSI GRAMMAR PROCESSING SUBSYSTEM FOR A REAL-TIME LARGE-VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM
    CHEN, DC
    YU, R
    RABAEY, J
    BRODERSEN, RW
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1991, 26 (03) : 443 - 448
  • [9] A large Czech vocabulary recognition system for real-time applications
    Nouza, J
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 217 - 222
  • [10] LARGE VOCABULARY ISOLATED WORD RECOGNITION - A REAL-TIME IMPLEMENTATION
    VICENZI, C
    FAVARETO, C
    SCIARRA, D
    CAROSSINO, A
    COLLA, AM
    SCAGLIOLA, C
    PEDRAZZI, P
    [J]. IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (02): : 127 - 132