Existence detection and embedding rate estimation of blended speech in covert speech communications

被引:0
|
作者
Li, Lijuan [1 ]
Gao, Yong [1 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610064, Sichuan, Peoples R China
来源
SPRINGERPLUS | 2016年 / 5卷
关键词
Blended speech; Covert speech communication; Embedding rate estimation; Existence detection; Odd-even difference (OED); AUDIO STEGANALYSIS;
D O I
10.1186/s40064-016-2691-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Covert speech communications may be used by terrorists to commit crimes through Internet. Steganalysis aims to detect secret information in covert communications to prevent crimes. Herein, based on the average zero crossing rate of the odd-even difference (AZCR-OED), a steganalysis algorithm for blended speech is proposed; it can detect the existence and estimate the embedding rate of blended speech. First, the odd-even difference (OED) of the speech signal is calculated and divided into frames. The average zero crossing rate (ZCR) is calculated for each OED frame, and the minimum average ZCR and AZCR-OED of the entire speech signal are extracted as features. Then, a support vector machine classifier is used to determine whether the speech signal is blended. Finally, a voice activity detection algorithm is applied to determine the hidden location of the secret speech and estimate the embedding rate. The results demonstrate that without attack, the detection accuracy can reach 80 % or more when the embedding rate is greater than 10 %, and the estimated embedding rate is similar to the real value. And when some attacks occur, it can also reach relatively high detection accuracy. The algorithm has high performance in terms of accuracy, effectiveness and robustness.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Word Error Rate Estimation for Speech Recognition: e-WER
    Ali, Ahmed
    Renals, Steve
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 20 - 24
  • [32] Fundamental Frequency Estimation in Speech Signals With Variable Rate Particle Filters
    Zhang, Geliang
    Godsill, Simon
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 890 - 900
  • [33] Hidden-Markov-model-based voice activity detector with high speech detection rate for speech enhancement
    Veisi, H.
    Sameti, H.
    [J]. IET SIGNAL PROCESSING, 2012, 6 (01) : 54 - 63
  • [34] A 1200 bits/s HSX speech coder for very low bit rate communications
    Gournay, P
    Chartier, F
    [J]. 1998 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS-SIPS 98: DESIGN AND IMPLEMENTATION, 1998, : 347 - 355
  • [35] Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal
    Abe, Yohei
    Ito, Akinori
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013), 2013, : 271 - 274
  • [36] Hate Speech Detection using Word Embedding and Deep Learning in the Arabic Language Context
    Faris, Hossam
    Aljarah, Ibrahim
    Habib, Maria
    Castillo, Pedro A.
    [J]. ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 453 - 460
  • [37] Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding
    Ng, Si-Ioi
    Ng, Cymie Wing-Yee
    Li, Jingyu
    Lee, Tan
    [J]. INTERSPEECH 2021, 2021, : 2931 - 2935
  • [38] Detection of Speech Related Disorders by Pre-trained Embedding Models Extracted Biomarkers
    Jenei, Attila Zoltan
    Kiss, Gabor
    Sztaho, David
    [J]. SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 279 - 289
  • [39] Multi-Level Error Detection and Concealment Algorithm to Improve Speech Quality in GSM Full Rate Speech Codecs
    王林芳
    刘加
    刘小青
    李明
    [J]. Tsinghua Science and Technology, 2011, 16 (03) : 247 - 255
  • [40] Robust Voicing Detection and Estimation for HMM-Based Speech Synthesis
    Narendra, N. P.
    Rao, K. Sreenivasa
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (08) : 2597 - 2619