Retrieval Oriented Robust Audio Hashing

被引：0

作者：

Cui, Delong ^{[1
]}

Zuo, Jinglong ^{[1
]}

机构：

[1] Maoming Univ, Coll Comp & Elect Informat, Maoming, Peoples R China

来源：

NANOTECHNOLOGY AND COMPUTER ENGINEERING | 2010年 / 121-122卷

关键词：

Content-based audio retrieval; audio hash; audio digest; non-negative matrix factorization; lifting-based wavelet; audio retrieval;

D O I：

10.4028/www.scientific.net/AMR.121-122.854

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Aiming at content-based audio retrieval (CBAR) applications, a robust audio hashing scheme is proposed. First the audio is divided to frame by fixed length and then low-frequent and high-frequent components are obtained by three-level lifting-based wavelet transformation in every frame. Secondly the audio frame is approximately represented as a product of a base matrix and an encoding matrix, or coefficient matrix, using non-negative matrix factorization (NMF). Finally the sum of each column in the coefficient matrix is calculated, which is then quantized to produce one bit of the hash sequence. Experiment results show that the proposed scheme is robust against Mp3 compression, Real compression, filtering, amplitude compression, equalization, echo, etc. It is insensitive to small local change, and therefore is suitable for distinguishing different audios.

引用

页码：854 / 859

页数：6

共 50 条

[31] Robust talker-independent audio document retrieval
Jones, GJF
Foote, JT
Jones, KS
Young, SJ
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 311 - 314
[32] ROBUST MULTI-VIEW HASHING FOR CROSS-MODAL RETRIEVAL
Wang, Haitao
Chen, Hui
Meng, Min
Wu, JiGang
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1012 - 1017
[33] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
Li, Chuan-Xiang
Yan, Ting-Kun
Luo, Xin
Nie, Liqiang
Xu, Xin-Shun
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (11) : 2863 - 2877
[34] Robust and discrete matrix factorization hashing for cross-modal retrieval
Zhang, Donglin
Wu, Xiao-Jun
PATTERN RECOGNITION, 2022, 122
[35] Supervised Robust Discrete Multimodal Hashing for Cross-Media Retrieval
Yan, Ting-Kun
Xu, Xin-Shun
Guo, Shanqing
Huang, Zi
Wang, Xiao-Lin
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1271 - 1280
[36] Daubechies Wavelets Based Robust Audio Fingerprinting for Content-Based Audio Retrieval
Sun, Wei
Lu, Zhe-Ming
Yu, Fa-Xin
Shen, Rong-Jun
INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2012, 4 (02) : 49 - 69
[37] Asymmetric Supervised Fusion-Oriented Hashing for Cross-Modal Retrieval
Yang, Zhan
Deng, Xiyin
Guo, Lin
Long, Jun
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) : 851 - 864
[38] USING NAiVE TEXT QUERIES FOR ROBUST AUDIO INFORMATION RETRIEVAL
Kim, Samuel
Georgiou, Panayiotis
Narayanan, Shrikanth
Sundaram, Shiva
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2406 - 2409
[39] Boosting deep cross-modal retrieval hashing with adversarially robust training
Zhang, Xingwei
Zheng, Xiaolong
Mao, Wenji
Zeng, Daniel Dajun
APPLIED INTELLIGENCE, 2023, 53 (20) : 23698 - 23710
[40] Boosting deep cross-modal retrieval hashing with adversarially robust training
Xingwei Zhang
Xiaolong Zheng
Wenji Mao
Daniel Dajun Zeng
Applied Intelligence, 2023, 53 : 23698 - 23710

← 1 2 3 4 5 →