A Word-Embedding-Based Steganalysis Method for Linguistic Steganography via Synonym Substitution

被引:22
|
作者
Xiang, Lingyun [1 ,2 ]
Yu, Jingmin [2 ]
Yang, Chunfang [3 ]
Zeng, Daojian [1 ,2 ]
Shen, Xiaobo [4 ]
机构
[1] Changsha Univ Sci & Technol, Hunan Prov Key Lab Intelligent Proc Big Data Tran, Changsha 410114, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China
[3] Zhengzhou Sci & Technol Inst, Zhengzhou 450001, Henan, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Steganalysis; steganography; word embedding; Skip-gram language model; TF-IDF;
D O I
10.1109/ACCESS.2018.2878273
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The development of steganography technology threatens the security of privacy information in smart campus. To prevent privacy disclosure, a linguistic steganalysis method based on word embedding is proposed to detect the privacy information hidden in synonyms in the texts. With the continuous Skip-gram language model, each synonym and words in its context are represented as word embeddings, which aims to encode semantic meanings of words into low-dimensional dense vectors. The context fitness, which characterizes the suitability of a synonym by its semantic correlations with context words, is effectively estimated by their corresponding word embeddings and weighted by TF-IDF values of context words. By analyzing the differences of context fitness values of synonyms in the same synonym set and the differences of those in the cover and stego text, three features are extracted and fed into a support vector machine classifier for steganalysis task. The experimental results show that the proposed steganalysis improves the average F-value at least 4.8% over two baselines. In addition, the detection performance can be further improved by learning better word embeddings.
引用
收藏
页码:64131 / 64141
页数:11
相关论文
共 50 条
  • [1] Steganalysis on synonym substitution steganography
    Luo, Gang
    Sun, Xingming
    Xiang, Lingyun
    Liu, Yuling
    Gan, Can
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2008, 45 (10): : 1696 - 1703
  • [2] A convolutional neural network-based linguistic steganalysis for synonym substitution steganography
    Xiang, Lingyun
    Guo, Guoqing
    Yu, Jingming
    Sheng, Victor S.
    Yang, Peng
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (02) : 1041 - 1058
  • [3] Steganalysis against substitution-based linguistic steganography based on context clusters
    Chen, Zhili
    Huang, Liusheng
    Miao, Haibo
    Yang, Wei
    Meng, Peng
    COMPUTERS & ELECTRICAL ENGINEERING, 2011, 37 (06) : 1071 - 1081
  • [4] Practical Linguistic Steganography using Contextual Synonym Substitution and a Novel Vertex Coding Method
    Chang, Ching-Yun
    Clark, Stephen
    COMPUTATIONAL LINGUISTICS, 2014, 40 (02) : 403 - 448
  • [5] Synonym Based Malay Linguistic Text Steganography
    Muhammad, Hasanah Zulcefli
    Rahman, Sharifah Mumtazah Syed Ahmad Abdul
    Shakil, Asma
    2009 CONFERENCE ON INNOVATIVE TECHNOLOGIES IN INTELLIGENT SYSTEMS AND INDUSTRIAL APPLICATIONS, 2009, : 423 - 427
  • [6] Blind Linguistic Steganalysis against Translation Based Steganography
    Chen, Zhili
    Huang, Liusheng
    Meng, Peng
    Yang, Wei
    Miao, Haibo
    DIGITAL WATERMARKING, 2011, 6526 : 251 - 265
  • [7] Text Semantic Steganalysis Based on Word Embedding
    Zuo, Xin
    Hu, Huanhuan
    Zhang, Weiming
    Yu, Nenghai
    CLOUD COMPUTING AND SECURITY, PT IV, 2018, 11066 : 485 - 495
  • [8] A word-embedding-based approach for accurate identification of corresponding activities
    Shahzad, Khurram
    Kanwal, Safia
    Malik, Kamran
    Aslam, Faisal
    Ali, Muhammad
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 78 : 218 - 229
  • [9] Steganalysis of DCT-Embedding Based Adaptive Steganography and YASS
    Liu, Qingzhong
    MM&SEC 11: PROCEEDINGS OF THE 2011 ACM SIGMM MULTIMEDIA AND SECURITY WORKSHOP, 2011, : 77 - 85
  • [10] A word-frequency-preserving steganographic method based on synonym substitution
    Xiang, Lingyun
    Yang, Xiao
    Zhang, Jiahe
    Wang, Weizheng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 19 (01) : 132 - 139