Nonlinear waveform distortion: Assessment and detection of clipping on speech data and systems

被引:5
|
作者
Hansen, John H. L. [1 ]
Stauffer, Allen [1 ]
Xia, Wei [1 ]
机构
[1] Univ Texas Dallas, Erik Jonsson Sch Engn, Ctr Robust Speech Syst CRSS, Richardson, TX 75083 USA
关键词
Audio clipping; Speech quality assessment; Non-linear distortion; Speaker recognition;
D O I
10.1016/j.specom.2021.07.007
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech, speaker, and language systems have traditionally relied on carefully collected speech material for training acoustic models. There is an enormous amount of freely accessible audio content. A major challenge, however, is that such data is not professionally recorded, and therefore may contain a wide diversity of background noise, nonlinear distortions, or other unknown environmental or technology-based contamination or mismatch. There is a crucial need for automatic analysis to screen such unknown datasets before acoustic model development training, or to perform input audio purity screening prior to classification. In this study, we propose a waveform based clipping detection algorithm for naturalistic audio streams and examine the impact of clipping at different severities on speech quality measurements and automatic speaker recognition systems. We use the TIMIT and NIST SRE08 corpora as case studies. The results show, as expected, that clipping introduces a nonlinear distortion into clean speech data, which reduces speech quality and performance for speaker recognition. We also investigate what degree of clipping can be present to sustain effective speech system performance. The proposed detection system, which will be released, could contribute to massive new audio collections for speech and language technology development (e.g. Google Audioset (Gemmeke et al., 2017), CRSS-UTDallas Apollo Fearless-Steps (Yu et al., 2014) (19,000 h naturalistic audio from NASA Apollo missions)).
引用
收藏
页码:20 / 31
页数:12
相关论文
共 50 条
  • [1] Performance Evaluation of Optical OFDM Systems with Nonlinear Clipping Distortion
    Chen, Liang
    Krongold, Brian
    Evans, Jamie
    2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 2536 - 2540
  • [2] Review of Waveform Distortion Interactions Assessment in Railway Power Systems
    Salles, Rafael S.
    Ronnberg, Sarah K.
    ENERGIES, 2023, 16 (14)
  • [3] Adaptive Prony method for waveform distortion detection in power systems
    Bracale, A.
    Caramia, P.
    Carpinelli, G.
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2007, 29 (05) : 371 - 379
  • [4] Clipping distortion in DMT ADSL systems
    Gross, R.
    Veeneman, D.
    Electronics Letters, 1993, 29 (24): : 2080 - 2081
  • [5] CLIPPING DISTORTION IN DMT ADSL SYSTEMS
    GROSS, R
    VEENEMAN, D
    ELECTRONICS LETTERS, 1993, 29 (24) : 2080 - 2081
  • [6] Research on Recovery of Clipping and HPA Nonlinear Distortion Based on Compressive Sensing in OFDM Systems
    Yang L.
    Song K.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2018, 46 (05): : 1078 - 1083
  • [7] DETECTION OF CLIPPING IN CODED SPEECH SIGNALS
    Eaton, James
    Naylor, Patrick A.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [8] Speaker Recognition with Nonlinear Distortion: Clipping Analysis and Impact
    Xia, Wei
    Hanson, John H. L.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 746 - 750
  • [9] On models of clipping distortion for lightwave CATV systems
    Ho, KP
    Kahn, JM
    IEEE PHOTONICS TECHNOLOGY LETTERS, 1996, 8 (01) : 125 - 126
  • [10] On Iterative Compensation of Clipping Distortion in OFDM Systems
    Liang, Shansuo
    Tong, Jun
    Ping, Li
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2019, 8 (02) : 436 - 439