Detecting English Speech in the Air Traffic Control Voice Communication

被引:2
|
作者
Szoke, Igor [1 ]
Kesiraju, Santosh [1 ]
Novotny, Ondrej [1 ]
Kocour, Martin [1 ]
Vesely, Karel [1 ]
Cernocky, Jan [1 ]
机构
[1] Brno Univ Technol, Fac Informat Technol, Speech FIT, Brno, Czech Republic
来源
关键词
speech recognition; language detection; x-vector extractor; acoustic model; air-traffic communication; data collection; text embeddings; Bayesian methods;
D O I
10.21437/Interspeech.2021-1033
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Developing in-cockpit voice enabled applications require a real-world dataset with labels and annotations. We launched a community platform for collecting the Air-Traffic Control (ATC) speech, world-wide in the ATCO(2) project. Filtering out non-English speech is one of the main components in the data processing pipeline. The proposed English Language Detection (ELD) system is based on the embeddings from Bayesian subspace multinomial model. It is trained on the word confusion network from an ASR system. It is robust, easy to train, and light weighted. We achieved 0.0439 equal-error-rate (EER), a 50% relative reduction as compared to the state-of-the-art acoustic ELD system based on x-vectors, in the in-domain scenario. Further, we achieved an EER of 0.1352, a 33% relative reduction as compared to the acoustic ELD, in the unseen language (out-of-domain) condition. We plan to publish the evaluation dataset from the ATCO(2) project.
引用
收藏
页码:3286 / 3290
页数:5
相关论文
共 50 条
  • [31] A Unified Framework for Multilingual Speech Recognition in Air Traffic Control Systems
    Lin, Yi
    Guo, Dongyue
    Zhang, Jianwei
    Chen, Zhengmao
    Yang, Bo
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) : 3608 - 3620
  • [32] ADVANCED SPEECH TECHNOLOGY APPLIED TO PROBLEMS OF AIR-TRAFFIC CONTROL
    GRADY, MW
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1975, 11 (04) : 685 - 685
  • [33] Improved air-traffic control voice-communications systems for UK
    不详
    [J]. AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2005, 77 (06): : 504 - 504
  • [34] VSCS - A NEW VOICE-SWITCHING SYSTEM FOR AIR-TRAFFIC-CONTROL
    BERNARD, RJ
    [J]. JOURNAL OF TELECOMMUNICATION NETWORKS, 1986, 4 (01): : 20 - 29
  • [35] Modelling of a Speech-to-Text Recognition System for Air Traffic Control and NATO Air Command
    Zietsman, Grant
    Malekian, Reza
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (07): : 1527 - 1539
  • [36] Detecting fatigue from voice using speech recognition
    Greeley, H. P.
    Friets, E.
    Wilson, J. P.
    Raghavan, S.
    Picone, J.
    Berg, J.
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 567 - 571
  • [37] DETECTING SIGNAL CORRUPTIONS IN VOICE RECORDINGS FOR SPEECH THERAPY
    Nylen, Helmer
    Chatterjee, Saikat
    Ternstrom, Sten
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 386 - 390
  • [38] VOICE AND SPEECH FOR EFFECTIVE COMMUNICATION - HICKS,HG
    DEW, D
    [J]. QUARTERLY JOURNAL OF SPEECH, 1963, 49 (04) : 461 - 461
  • [39] VOICE AND SPEECH FOR EFFECTIVE COMMUNICATION - HICKS,HG
    PELANDA, K
    [J]. SPEECH TEACHER, 1964, 13 (04): : 330 - 331
  • [40] Performance Evaluation of Communication System Proposed for Oceanic Air Traffic Control
    Ho Dac Tu
    Park, Jingyu
    Shimamoto, Shigeru
    Kitaori, Jun
    [J]. 2010 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC 2010), 2010,