Existence detection and embedding rate estimation of blended speech in covert speech communications

被引:0
|
作者
Li, Lijuan [1 ]
Gao, Yong [1 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610064, Sichuan, Peoples R China
来源
SPRINGERPLUS | 2016年 / 5卷
关键词
Blended speech; Covert speech communication; Embedding rate estimation; Existence detection; Odd-even difference (OED); AUDIO STEGANALYSIS;
D O I
10.1186/s40064-016-2691-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Covert speech communications may be used by terrorists to commit crimes through Internet. Steganalysis aims to detect secret information in covert communications to prevent crimes. Herein, based on the average zero crossing rate of the odd-even difference (AZCR-OED), a steganalysis algorithm for blended speech is proposed; it can detect the existence and estimate the embedding rate of blended speech. First, the odd-even difference (OED) of the speech signal is calculated and divided into frames. The average zero crossing rate (ZCR) is calculated for each OED frame, and the minimum average ZCR and AZCR-OED of the entire speech signal are extracted as features. Then, a support vector machine classifier is used to determine whether the speech signal is blended. Finally, a voice activity detection algorithm is applied to determine the hidden location of the secret speech and estimate the embedding rate. The results demonstrate that without attack, the detection accuracy can reach 80 % or more when the embedding rate is greater than 10 %, and the estimated embedding rate is similar to the real value. And when some attacks occur, it can also reach relatively high detection accuracy. The algorithm has high performance in terms of accuracy, effectiveness and robustness.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Visual Voice Activity Detection and Adaptive Threshold Estimation for Speech Recognition
    Song, Taeyup
    Lee, Kyungsun
    Kim, Sung Soo
    Lee, Jae-Won
    Ko, Hanseok
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (04): : 321 - 327
  • [42] Non-Intrusive Bit-Rate Detection of Coded Speech
    Sharma, Dushyant
    Jost, Uwe
    Naylor, Patrick A.
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1799 - 1803
  • [43] Vowel Onset Point Detection for Low Bit Rate Coded Speech
    Vuppala, Anil Kumar
    Yadav, Jainath
    Chakrabarti, Saswat
    Rao, K. Sreenivasa
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1894 - 1903
  • [44] Listener detection of talker stress in low-rate coded speech
    Voran, Stephen
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4813 - 4816
  • [45] SPEECH DATA RATE REDUCTION .1. APPLICABILITY OF MODERN ESTIMATION THEORY
    GUNN, JE
    SAGE, AP
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1973, AES9 (02) : 130 - 138
  • [46] Frequency-domain spectral envelope estimation for low rate coding of speech
    Jelinek, M
    Adoul, JP
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 253 - 256
  • [47] SINGLE-ENDED PACKET LOSS RATE ESTIMATION OF TRANSMITTED SPEECH SIGNALS
    Mittag, Gabriel
    Moeller, Sebastian
    [J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 226 - 230
  • [48] A study on model-based error rate estimation for automatic speech recognition
    Huang, CS
    Wang, HC
    Lee, CH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06): : 581 - 589
  • [49] Improved channel coding and estimation for Adaptive Multi Rate (AMR) speech transmission
    Hindeland, T
    Kaindl, M
    Hagenauer, J
    Schmautz, M
    Xu, W
    [J]. 2000 IEEE 51ST VEHICULAR TECHNOLOGY CONFERENCE, PROCEEDINGS, VOLS 1-3, 2000, : 1210 - 1214
  • [50] EVALUATION OF SYLLABLE RATE ESTIMATION IN EXPRESSIVE SPEECH AND ITS CONTRIBUTION TO EMOTION RECOGNITION
    Abdelwahab, Mohammed
    Busso, Carlos
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 472 - 477