Audio fingerprinting scheme by temporal filtering for audio identification immune to channel-distortion

被引:0
|
作者
Park, M [1 ]
Kim, HR
Shin, DH
Yang, SH
机构
[1] Informat & Commun Univ, Sch Engn, Taejon, South Korea
[2] Konan Technol Inc, Informat Retrieval & Min Res Team, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Channel-distortion in real-environment is at issue in music information retrieval system by content-based audio identification technique. As a matter of fact, audio signal is commonly distorted by channel and background noise in case of that it is recorded under real-situation. Recently, Philips published a robust and efficient audio fingerprinting system for audio identification. To extract a robust and efficient audio fingerprint, Philips applied the first derivative (differential) to the frequency-time sequence of perceptual filter-bank energies. In practice, however, it is not sufficient to remove the undesired perturbations. This paper introduces an extension method of the audio fingerprint extraction scheme of Philips that is more immune to channel-distortion. The channel-normalization techniques for temporal filtering are used to lessen the channel effects of real-environment.
引用
收藏
页码:528 / 533
页数:6
相关论文
共 44 条
  • [31] AN IMPROVED SCHEME OF AUDIO WATERMARKING BASED ON TURBO CODES AND CHANNEL EFFECT MODELING
    Majoul, Taoufik
    Raouafi, Fathi
    Jaidane, Meriem
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 353 - 356
  • [32] CAS-TJ: Channel attention shuffle and temporal jigsaw for audio classification
    Kim, Yongmin
    Ko, Kyungdeuk
    Lee, Junyeop
    Ko, Hanseok
    APPLIED ACOUSTICS, 2025, 233
  • [33] PREDICTIVE AUDIO CODING USING RATE-DISTORTION-OPTIMAL PRE- AND POST-FILTERING
    Moussa, Obada Alhaj
    Li, Minyue
    Kleijn, W. Bastiaan
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 213 - 216
  • [34] Robust Temporal Registration Scheme for Video Copies Using Visual-Audio Features
    Roopalakshmi, R.
    Venkatesh, Revanur
    Rahul, K. M.
    3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 385 - 394
  • [35] An audio watermarking scheme based on extended bipolar echo kernel and robust to temporal offsets
    Shine K. P.
    Krishna Kumar S.
    CSI Transactions on ICT, 2015, 3 (2-4) : 111 - 117
  • [36] Detecting Replay Attacks Using Single-Channel Audio: The Temporal Autocorrelation of Speech
    Lee, Shih-Kuang
    Tsao, Yu
    Wang, Hsin-Min
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1984 - 1990
  • [37] Energy Aware Modelling of Inter-Channel Level Difference Distortion Impact on Spatial Audio Perception
    Delgado, Pablo
    Herre, Juergen
    Taghipour, Armin
    Schinkel-Bielefeld, Nadja
    2018 AES INTERNATIONAL CONFERENCE ON SPATIAL REPRODUCTION - AESTHETICS AND SCIENCE, 2018,
  • [38] The Time Course of Audio-Visual Phoneme Identification: a High Temporal Resolution Study
    Sanchez-Garcia, Carolina
    Kandel, Sonia
    Savariaux, Christophe
    Soto-Faraco, Salvador
    MULTISENSORY RESEARCH, 2018, 31 (1-2) : 57 - 78
  • [39] Temporal Filtering of Visual Speech for Audio-Visual Speech Recognition in Acoustically and Visually Challenging Environments
    Lee, Jong-Seok
    Park, Cheol Hoon
    ICMI'07: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 2007, : 220 - 227
  • [40] A High-Capacity Reversible Data Hiding Scheme Using Dual-Channel Audio
    Yu, Heng
    Wang, Rangding
    Dong, Li
    Yan, Diqun
    Gong, Yongkang
    Lin, Yuzhen
    IEEE ACCESS, 2020, 8 (08) : 162271 - 162278