A CONTEXT-AWARE SPEECH RECOGNITION AND UNDERSTANDING SYSTEM FOR AIR TRAFFIC CONTROL DOMAIN

被引:0
|
作者
Oualil, Youssef [1 ]
Klakow, Dietrich [1 ]
Szaszak, Gyoergy [1 ]
Srinivasamurthy, Ajay [3 ]
Helmke, Hartmut [2 ]
Motlicek, Petr [3 ]
机构
[1] Saarland Univ UdS, Spoken Language Syst Grp LSV, Saarbrucken, Germany
[2] German Aerosp Ctr DLR, Inst Flight Guidance, Braunschweig, Germany
[3] Idiap Res Inst, Martigny, Switzerland
基金
欧盟地平线“2020”;
关键词
Automatic speech recognition; context-aware systems; air traffic control; spoken language understanding;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Speech Recognition and Understanding (ASRU) systems can generally use temporal and situational context information to improve their performance for a given task. This is typically done by rescoring the ASR hypotheses or by dynamically adapting the ASR models. For some domains, such as Air Traffic Control (ATC), this context information can be, however, small in size, partial and available only as abstract concepts (e.g. airline codes), which are difficult to map into full possible spoken sentences to perform rescoring or adaptation. This paper presents a multi-modal ASRU system, which dynamically integrates partial temporal and situational ATC context information to improve its performance. This is done either by 1) extracting word sequences which carry relevant ATC information from ASR N-best lists and then perform a context-based rescoring on the extracted ATC segments or 2) by a partial adaptation of the language model. Experiments conducted on 4 hours of test data from Prague and Vienna approach (arrivals) showed a relative reduction of the ATC command error rate metric by 30% to 50%.
引用
收藏
页码:404 / 408
页数:5
相关论文
共 50 条
  • [1] A Context-Aware Language Model to Improve the Speech Recognition in Air Traffic Control
    Guo, Dongyue
    Zhang, Zichen
    Fan, Peng
    Zhang, Jianwei
    Yang, Bo
    [J]. AEROSPACE, 2021, 8 (11)
  • [2] VISUAL FEATURES FOR CONTEXT-AWARE SPEECH RECOGNITION
    Gupta, Abhinav
    Miao, Yajie
    Neves, Leonardo
    Metze, Florian
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5020 - 5024
  • [3] CONTEXT-AWARE TRANSFORMER TRANSDUCER FOR SPEECH RECOGNITION
    Chang, Feng-Ju
    Liu, Jing
    Radfar, Martin
    Mouchtaris, Athanasios
    Omologo, Maurizio
    Rastrow, Ariya
    Kunzmann, Siegfried
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 503 - 510
  • [4] CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION
    Ramet, Gaetan
    Garner, Philip N.
    Baeriswyl, Michael
    Lazaridis, Alexandros
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 126 - 131
  • [5] Context-aware RNNLM Rescoring for Conversational Speech Recognition
    Wei, Kun
    Guo, Pengcheng
    Lv, Hang
    Tu, Zhen
    Xie, Lei
    [J]. 2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [6] Context-aware Training Image Synthesis for Traffic Sign Recognition
    Sekizawa, Akira
    Nakajima, Katsuto
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 466 - 473
  • [7] CANS: context-aware traffic estimation and navigation system
    Ramazani, Azam
    Vahdat-Nejad, Hamed
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2017, 11 (06) : 326 - 333
  • [8] Context Aware Air Traffic Management System
    Al-Juboori, Nihad
    Al-Sultan, Saif
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ANALYTICS DRIVEN SOLUTIONS (ICAS 2014), 2014, : 1 - 11
  • [9] Investigation of Context-aware System Using Activity Recognition
    Watanabe, Yuki
    Suzumura, Reiji
    Matsuno, Shogo
    Ohyama, Minoru
    [J]. 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 287 - 291
  • [10] CONTEXT-AWARE NEURAL CONFIDENCE ESTIMATION FOR RARE WORD SPEECH RECOGNITION
    Qiu, David
    Munkhdalai, Tsendsuren
    He, Yanzhang
    Sim, Khe Chai
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 31 - 37