Consumer-level multimedia event detection through unsupervised audio signal modeling

被引：0

作者：

Byun, Byungki ^{[1
]}

Kim, Ilseo ^{[1
]}

Siniscalchi, Sabato Marco

Lee, Chin-Hui ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

multimedia event detection; unsupervised audio modeling; acoustic segment models;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work(1), a novel acoustic characterization approach to multimedia event detection (MED) task for unconstrained and unstructured consumer-level videos through audio signal modeling is proposed. The key idea is to characterize the acoustic space of interest with a set of fundamental acoustic units around which a set of acoustic segment models (ASMs) is built. A vector space modeling technique to address MED is here adopted, where an incoming audio signal is first decoded into a sequence of acoustic segments. Then, a feature vector is generated by using co-occurrence statistics of acoustic units, and the MED final decision is implemented with a vector space language classifier. Experimental evidence on the TRECVID2011 MED demonstrates the viability of the proposed approach. Furthermore, it better accounts for temporal dependencies than previously proposed MFCC bag-of-word approaches.

引用

页码：2079 / 2082

页数：4

共 38 条

[1] UNSUPERVISED FEATURE EXTRACTION FOR MULTIMEDIA EVENT DETECTION AND RANKING USING AUDIO CONTENT
Amid, Ehsan
Mesaros, Annamaria
Palomaki, Kalle J.
Laaksonen, Jorma
Kurimo, Mikko
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[2] An NFC Based Consumer-Level Counterfeit Detection Framework
Saeed, Muhammad Qasim
Bilal, Zeeshan
Walter, Colin D.
[J]. 2013 ELEVENTH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2013, : 135 - 142
[3] Audio based event detection for multimedia surveillance
Atrey, Pradeep K.
Maddage, Namunu C.
Kankanhalli, Mohan S.
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5671 - 5674
[4] Compact Audio Representation for Event Detection in Consumer Media
Zhuang, Xiaodan
Tsakalidis, Stavros
Wu, Shuang
Natarajan, Pradeep
Prasad, Rohit
Natarajan, Prem
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2087 - 2090
[5] IMPROVED AUDIO FEATURES FOR LARGE-SCALE MULTIMEDIA EVENT DETECTION
Metze, Florian
Rawat, Shourabh
Wang, Yipei
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
[6] Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling
Ashraf, Khalid
Elizalde, Benjamin
Iandola, Forrest
Moskewicz, Matthew
Bernd, Julia
Friedland, Gerald
Keutzer, Kurt
[J]. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 611 - 614
[7] Audio Bank: A High-Level Acoustic Signal Representation for Audio Event Recognition
Sandhan, Tushar
Sonowal, Sukanya
Choi, Jin Young
[J]. 2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 82 - 87
[8] Recurrent Support Vector Machines for Audio-Based Multimedia Event Detection
Wang, Yun
Metze, Florian
[J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 265 - 269
[9] Mental Illness Detection Through Audio Signal Processing
Karmore, Pravin
Karmore, Swapnili
Uparkar, Satyajit
[J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 305 - 309
[10] Multi-rate modulation encoding via unsupervised learning for audio event detection
Kothinti, Sandeep Reddy
Elhilali, Mounya
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01)

← 1 2 3 4 →