Detecting English Speech in the Air Traffic Control Voice Communication

被引:2
|
作者
Szoke, Igor [1 ]
Kesiraju, Santosh [1 ]
Novotny, Ondrej [1 ]
Kocour, Martin [1 ]
Vesely, Karel [1 ]
Cernocky, Jan [1 ]
机构
[1] Brno Univ Technol, Fac Informat Technol, Speech FIT, Brno, Czech Republic
来源
关键词
speech recognition; language detection; x-vector extractor; acoustic model; air-traffic communication; data collection; text embeddings; Bayesian methods;
D O I
10.21437/Interspeech.2021-1033
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Developing in-cockpit voice enabled applications require a real-world dataset with labels and annotations. We launched a community platform for collecting the Air-Traffic Control (ATC) speech, world-wide in the ATCO(2) project. Filtering out non-English speech is one of the main components in the data processing pipeline. The proposed English Language Detection (ELD) system is based on the embeddings from Bayesian subspace multinomial model. It is trained on the word confusion network from an ASR system. It is robust, easy to train, and light weighted. We achieved 0.0439 equal-error-rate (EER), a 50% relative reduction as compared to the state-of-the-art acoustic ELD system based on x-vectors, in the in-domain scenario. Further, we achieved an EER of 0.1352, a 33% relative reduction as compared to the acoustic ELD, in the unseen language (out-of-domain) condition. We plan to publish the evaluation dataset from the ATCO(2) project.
引用
收藏
页码:3286 / 3290
页数:5
相关论文
共 50 条
  • [1] Digital voice communication systems for UK air traffic control operations
    不详
    [J]. AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2008, 80 (04): : 462 - 462
  • [2] Improving pilot/air traffic control voice communication in general aviation
    Prinzo, OV
    Morrow, DG
    [J]. INTERNATIONAL JOURNAL OF AVIATION PSYCHOLOGY, 2002, 12 (04): : 341 - 357
  • [3] STOCHASTIC PROPERTIES OF TOTAL AIR-TRAFFIC CONTROL VOICE COMMUNICATION TIME
    DUNLAY, WJ
    [J]. TRANSPORTATION RESEARCH, 1975, 9 (05): : 275 - 278
  • [4] Using Speech Analysis in Voice Communication A new approach to improve Air Traffic Management Security
    Rusko, Milan
    Finke, Michael
    [J]. 2016 7TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2016, : 181 - 186
  • [5] Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development
    Luboš Šmídl
    Jan Švec
    Daniel Tihelka
    Jindřich Matoušek
    Jan Romportl
    Pavel Ircing
    [J]. Language Resources and Evaluation, 2019, 53 : 449 - 464
  • [6] Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development
    Smidl, Lubos
    Svec, Jan
    Tihelka, Daniel
    Matousek, Jindrich
    Romportl, Jan
    Ircing, Pavel
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2019, 53 (03) : 449 - 464
  • [7] A speech interface for air traffic control terminals
    Ferreiros, J.
    Pardo, J. M.
    de Cordoba, R.
    Macias-Guarasa, J.
    Montero, J. M.
    Fernandez, F.
    Sama, V.
    d'Haro, L. F.
    Gonzalez, G.
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2012, 21 (01) : 7 - 15
  • [8] Speech, Voice, and Communication
    Johnson, Julia A.
    [J]. NONMOTOR PARKINSON'S: THE HIDDEN FACE - MANAGEMENT AND THE HIDDEN FACE OF RELATED DISORDERS, 2017, 134 : 1189 - 1205
  • [9] Satellite Based Voice Communication for Air Traffic Management and Airline Operation
    Eier, Dieter
    Kampichler, Wolfgang
    [J]. 2011 IEEE/AIAA 30TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2011,
  • [10] Automatic Speech Recognition for Air Traffic Control Communications
    Badrinath, Sandeep
    Balakrishnan, Hamsa
    [J]. TRANSPORTATION RESEARCH RECORD, 2022, 2676 (01) : 798 - 810