Retrieval Oriented Robust Audio Hashing

被引:0
|
作者
Cui, Delong [1 ]
Zuo, Jinglong [1 ]
机构
[1] Maoming Univ, Coll Comp & Elect Informat, Maoming, Peoples R China
来源
关键词
Content-based audio retrieval; audio hash; audio digest; non-negative matrix factorization; lifting-based wavelet; audio retrieval;
D O I
10.4028/www.scientific.net/AMR.121-122.854
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Aiming at content-based audio retrieval (CBAR) applications, a robust audio hashing scheme is proposed. First the audio is divided to frame by fixed length and then low-frequent and high-frequent components are obtained by three-level lifting-based wavelet transformation in every frame. Secondly the audio frame is approximately represented as a product of a base matrix and an encoding matrix, or coefficient matrix, using non-negative matrix factorization (NMF). Finally the sum of each column in the coefficient matrix is calculated, which is then quantized to produce one bit of the hash sequence. Experiment results show that the proposed scheme is robust against Mp3 compression, Real compression, filtering, amplitude compression, equalization, echo, etc. It is insensitive to small local change, and therefore is suitable for distinguishing different audios.
引用
收藏
页码:854 / 859
页数:6
相关论文
共 50 条
  • [31] Robust talker-independent audio document retrieval
    Jones, GJF
    Foote, JT
    Jones, KS
    Young, SJ
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 311 - 314
  • [32] ROBUST MULTI-VIEW HASHING FOR CROSS-MODAL RETRIEVAL
    Wang, Haitao
    Chen, Hui
    Meng, Min
    Wu, JiGang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1012 - 1017
  • [33] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
    Li, Chuan-Xiang
    Yan, Ting-Kun
    Luo, Xin
    Nie, Liqiang
    Xu, Xin-Shun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (11) : 2863 - 2877
  • [34] Robust and discrete matrix factorization hashing for cross-modal retrieval
    Zhang, Donglin
    Wu, Xiao-Jun
    PATTERN RECOGNITION, 2022, 122
  • [35] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
    Yan, Ting-Kun
    Xu, Xin-Shun
    Guo, Shanqing
    Huang, Zi
    Wang, Xiao-Lin
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1271 - 1280
  • [36] Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval
    Sun, Wei
    Lu, Zhe-Ming
    Yu, Fa-Xin
    Shen, Rong-Jun
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2012, 4 (02) : 49 - 69
  • [37] Asymmetric Supervised Fusion-Oriented Hashing for Cross-Modal Retrieval
    Yang, Zhan
    Deng, Xiyin
    Guo, Lin
    Long, Jun
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) : 851 - 864
  • [38] USING NAiVE TEXT QUERIES FOR ROBUST AUDIO INFORMATION RETRIEVAL
    Kim, Samuel
    Georgiou, Panayiotis
    Narayanan, Shrikanth
    Sundaram, Shiva
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2406 - 2409
  • [39] Boosting deep cross-modal retrieval hashing with adversarially robust training
    Zhang, Xingwei
    Zheng, Xiaolong
    Mao, Wenji
    Zeng, Daniel Dajun
    APPLIED INTELLIGENCE, 2023, 53 (20) : 23698 - 23710
  • [40] Boosting deep cross-modal retrieval hashing with adversarially robust training
    Xingwei Zhang
    Xiaolong Zheng
    Wenji Mao
    Daniel Dajun Zeng
    Applied Intelligence, 2023, 53 : 23698 - 23710