Audio Perceptual Hashing Based on NMF and MDCT Coefficients

被引:13
|
作者
Li Jinfeng [1 ]
Wang Hongxia [1 ]
Jing Yi [2 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat & Sci & Technol, Chengdu 610031, Peoples R China
[2] Northeastern Univ, Coll Engn, Boston, MA 02115 USA
基金
中国国家自然科学基金;
关键词
Perceptual audio hashing; Modified discrete cosine transform; Non-negative matrix factorization; Perceptual robustness;
D O I
10.1049/cje.2015.07.024
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Audio perceptual hashing is a digest of audio contents, which is independent of content preserving manipulations, such as MP3 compression, amplitude scaling, noise addition, etc. It provides a fast and reliable tool for identification, retrieval, and authentication of audio signals. A new audio hashing scheme based on non-Negative matrix factorization (NMF) of Modified discrete cosine transform (MDCT) coefficients is proposed. MDCT coefficients, which have been widely used in audio coding,. exhibit good discrimination for different audio contents and highly robustness against content preserving manipulations, especially MDCT based compression such as MP3, AAC, etc. Based on the extraction of MDCT coefficients of the audio frames firstly, NMF is used to construct hash bits. Experiment results demonstrate that, compared with methods mentioned in literature, the proposed scheme exhibits a high efficiency in terms of discrimination, perceptual robustness identification rate and time consumption.
引用
收藏
页码:579 / 583
页数:5
相关论文
共 50 条
  • [1] Audio Perceptual Hashing Based on NMF and MDCT Coefficients
    LI Jinfeng
    WANG Hongxia
    JING Yi
    [J]. Chinese Journal of Electronics, 2015, 24 (03) : 579 - 583
  • [2] MDCT-based perceptual hashing for compressed audio content identification
    Jia, Yuhua
    Yang, Bian
    Li, Mingyu
    Niu, Xiatnu
    [J]. 2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 381 - +
  • [3] Perceptual Hashing Based on Salient Region and NMF
    Wu, Xujun
    Cui, Chen
    Wang, Shen
    [J]. ADVANCES IN INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2021 & FITAT 2021), VOL 1, 2022, 277 : 119 - 127
  • [4] Robust Perceptual Image Hashing Based on Ring Partition and NMF
    Tang, Zhenjun
    Zhang, Xianquan
    Zhang, Shichao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (03) : 711 - 724
  • [5] The Analysis of an NMF-based Perceptual Image Hashing Scheme
    Hossein, S. Amir
    Tabatabaei, A. E.
    Ruland, Christoph
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 108 - 112
  • [6] Robust Audio Watermarking Based on MDCT Coefficients
    Wang, Mu-Liang
    Lin, Hong-Xun
    Lee, Mn-Ta
    [J]. 2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 372 - 375
  • [7] Perceptual Audio Hashing Functions
    Hamza Özer
    Bülent Sankur
    Nasir Memon
    Emin Anarım
    [J]. EURASIP Journal on Advances in Signal Processing, 2005
  • [8] Perceptual audio hashing functions
    [J]. Özer, H. (hozer@uekae.tubitak.gov.tr), 1780, Hindawi Publishing Corporation (2005):
  • [9] Robust perceptual audio hashing
    Özer, H
    Sankur, B
    [J]. PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 25 - 28
  • [10] Perceptual audio hashing functions
    Özer, H
    Sankur, B
    Memon, N
    Anarim, E
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (12) : 1780 - 1793