ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

被引:1
|
作者
Yuen, Daniel Hao Xian [1 ]
Pang, Andrew Yong Chen [1 ]
Yang, Zhou [2 ]
Chong, Chun Yong [1 ]
Lim, Mei Kuan [1 ]
Lo, David [2 ]
机构
[1] Monash Univ, Sch Informat Technol, Subang Jaya, Malaysia
[2] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
D O I
10.1109/ICST57152.2023.00050
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes ASDF, an Automated Speech Recognition Differential Testing Framework to test ASR systems. ASDF extends an existing ASR testing tool, the CrossASR++, which synthesizes test cases from a text corpus. However, CrossASR++ fails to make use of the text corpus efficiently and provides limited information on how the failed test cases can improve ASR systems. To address these limitations, our tool incorporates two novel features: (1) a text transformation module to boost the number of generated test cases and uncover more errors in ASR systems, and (2) a phonetic analysis module to identify phonemes that the ASR systems tend to transcribe incorrectly. ASDF generates more high-quality test cases by applying various text transformation methods (e.g., changing tense) to the input text in a failed test case. By doing so, ASDF can utilize a small text corpus to generate a large number of audio test cases, something which CrossASR++ is not capable of. In addition, ASDF implements more metrics to evaluate the performance of ASR systems from multiple perspectives. ASDF performs phonetic analysis on the identified failed test cases to identify the phonemes that ASR systems tend to transcribe incorrectly, providing useful information for developers to improve ASR systems. The demonstration video of our tool is made online at https://www.youtube.com/watch?v=DzVwfc3h9As. The implementation is available at https://github.com/danielyuenhx/ asdf-differential-testing.
引用
收藏
页码:461 / 463
页数:3
相关论文
共 50 条
  • [1] Can Differential Testing Improve Automatic Speech Recognition Systems?
    Asyrofi, Muhammad Hilmi
    Yang, Zhou
    Shi, Jieke
    Quan, Chu Wei
    Lo, David
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2021), 2021, : 674 - 678
  • [2] CrossASR plus plus : A Modular Differential Testing Framework for Automatic Speech Recognition
    Asyrofi, Muhammad Hilmi
    Yang, Zhou
    Lo, David
    [J]. PROCEEDINGS OF THE 29TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '21), 2021, : 1575 - 1579
  • [3] Automatic testing of speech recognition
    Francart, Tom
    Moonen, Marc
    Wouters, Jan
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2009, 48 (02) : 80 - 90
  • [4] CrossASR: Efficient Differential Testing of Automatic Speech Recognition via Text-To-Speech
    Asyrofi, Muhammad Hilmi
    Thung, Ferdian
    Lo, David
    Jiang, Lingxiao
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2020), 2020, : 640 - 650
  • [5] Automatic speech recognition systems
    Catariov, A
    [J]. Information Technologies 2004, 2004, 5822 : 83 - 93
  • [6] A Dimensionality Reduction Framework for Automatic Speech Recognition
    ElMoudden, Ismail
    ElBernoussi, Souad
    Benyacoub, Badreddine
    [J]. INNOVATION MANAGEMENT AND SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE: FROM REGIONAL DEVELOPMENT TO GLOBAL GROWTH, VOLS I - VI, 2015, 2015, : 2602 - 2608
  • [7] SPEECH DISFLUENCIES MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
    Vasilisa, Verkhodanova O.
    Alexey, Karpov A.
    [J]. TOMSK STATE UNIVERSITY JOURNAL, 2012, (363): : 10 - +
  • [8] General hybrid framework for uncertainty-decoding-based automatic speech recognition systems
    Abdelaziz, Ahmed Hussen
    Kolossa, Dorothea
    [J]. SPEECH COMMUNICATION, 2016, 79 : 1 - 13
  • [9] A Joint Training Framework for Robust Automatic Speech Recognition
    Wang, Zhong-Qiu
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (04) : 796 - 806
  • [10] APPLICATION OF SPEECH RECOGNITION TO AUTOMATIC INTELLIGIBILITY TESTING PROCEDURES
    TEACHER, CF
    RICHARDS, JR
    HEWITT, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1970, 48 (01): : 131 - &