Consumer-level multimedia event detection through unsupervised audio signal modeling

被引:0
|
作者
Byun, Byungki [1 ]
Kim, Ilseo [1 ]
Siniscalchi, Sabato Marco
Lee, Chin-Hui [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
multimedia event detection; unsupervised audio modeling; acoustic segment models;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work(1), a novel acoustic characterization approach to multimedia event detection (MED) task for unconstrained and unstructured consumer-level videos through audio signal modeling is proposed. The key idea is to characterize the acoustic space of interest with a set of fundamental acoustic units around which a set of acoustic segment models (ASMs) is built. A vector space modeling technique to address MED is here adopted, where an incoming audio signal is first decoded into a sequence of acoustic segments. Then, a feature vector is generated by using co-occurrence statistics of acoustic units, and the MED final decision is implemented with a vector space language classifier. Experimental evidence on the TRECVID2011 MED demonstrates the viability of the proposed approach. Furthermore, it better accounts for temporal dependencies than previously proposed MFCC bag-of-word approaches.
引用
收藏
页码:2079 / 2082
页数:4
相关论文
共 38 条
  • [1] UNSUPERVISED FEATURE EXTRACTION FOR MULTIMEDIA EVENT DETECTION AND RANKING USING AUDIO CONTENT
    Amid, Ehsan
    Mesaros, Annamaria
    Palomaki, Kalle J.
    Laaksonen, Jorma
    Kurimo, Mikko
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] An NFC Based Consumer-Level Counterfeit Detection Framework
    Saeed, Muhammad Qasim
    Bilal, Zeeshan
    Walter, Colin D.
    [J]. 2013 ELEVENTH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2013, : 135 - 142
  • [3] Audio based event detection for multimedia surveillance
    Atrey, Pradeep K.
    Maddage, Namunu C.
    Kankanhalli, Mohan S.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5671 - 5674
  • [4] Compact Audio Representation for Event Detection in Consumer Media
    Zhuang, Xiaodan
    Tsakalidis, Stavros
    Wu, Shuang
    Natarajan, Pradeep
    Prasad, Rohit
    Natarajan, Prem
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2087 - 2090
  • [5] IMPROVED AUDIO FEATURES FOR LARGE-SCALE MULTIMEDIA EVENT DETECTION
    Metze, Florian
    Rawat, Shourabh
    Wang, Yipei
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [6] Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling
    Ashraf, Khalid
    Elizalde, Benjamin
    Iandola, Forrest
    Moskewicz, Matthew
    Bernd, Julia
    Friedland, Gerald
    Keutzer, Kurt
    [J]. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 611 - 614
  • [7] Audio Bank: A High-Level Acoustic Signal Representation for Audio Event Recognition
    Sandhan, Tushar
    Sonowal, Sukanya
    Choi, Jin Young
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 82 - 87
  • [8] Recurrent Support Vector Machines for Audio-Based Multimedia Event Detection
    Wang, Yun
    Metze, Florian
    [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 265 - 269
  • [9] Mental Illness Detection Through Audio Signal Processing
    Karmore, Pravin
    Karmore, Swapnili
    Uparkar, Satyajit
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 305 - 309
  • [10] Multi-rate modulation encoding via unsupervised learning for audio event detection
    Kothinti, Sandeep Reddy
    Elhilali, Mounya
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01)