Detecting semantic concepts from video using temporal gradients and audio classification

被引：0

作者：

Rautiainen, M

Seppänen, T

Penttilä, J

Peltola, J

机构：

[1] Univ Oulu, MediaTeam Oulu, FIN-90014 Oulu, Finland

[2] VTT Tech Res Ctr Finland, FIN-90571 Oulu, Finland

来源：

IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS | 2003年 / 2728卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we describe new methods to detect semantic concepts from digital video based on audible and visual content. Temporal Gradient Correlogram captures temporal correlations of gradient edge directions from sampled shot frames. Power-related physical features are extracted from short audio samples in video shots. Video shots containing people, cityscape, landscape, speech or instrumental sound are detected with trained self-organized maps and kNN classification results of audio samples. Test runs and evaluations in TREC 2002 Video Track show consistent performance for Temporal Gradient Correlogram and state-of-the-art precision in audio-based instrumental sound detection.

引用

页码：260 / 270

页数：11

共 50 条

[41] Decision Tree Based Depression Classification from Audio Video and Language Information
Yang, Le
Jiang, Dongmei
He, Lang
Pei, Ercheng
Oveneke, Meshia Cedric
Sahli, Hichem
PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON AUDIO/VISUAL EMOTION CHALLENGE (AVEC'16), 2016, : 89 - 96
[42] Towards Virtual Audio/Video Environments Using Semantic Service Composition on a Service Oriented Infrastructure
Lalbakhsh, Pooia
Goodarzi, Ayub
Fesharaki, Mehdi N.
INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL : ICACC 2009 - PROCEEDINGS, 2009, : 485 - +
[43] Detection and Classification of Moving Vehicle From Video Using Multiple Spatio-Temporal Features
Wang, Yu
Ban, Xiaojuan
Wang, Huan
Wu, Di
Wang, Hao
Yang, Shouqing
Liu, Sinuo
Lai, Jinhui
IEEE ACCESS, 2019, 7 : 80287 - 80299
[44] Robust Temporal Registration Scheme for Video Copies Using Visual-Audio Features
Roopalakshmi, R.
Venkatesh, Revanur
Rahul, K. M.
3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 385 - 394
[45] From semantic weight to legal ontology via classification of concepts in legal texts
Allison, Neil Grainger
LAW TEACHER, 2023, 57 (02): : 201 - 217
[46] Learning Temporal Relations from Semantic Neighbors for Acoustic Scene Classification
Zhang, Liwen
Han, Jiqing
Shi, Ziqiang
IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 950 - 954
[47] Extraction of Video Songs from Movies using Audio Features
Darjii, Mittal C.
Patel, Narendra M.
Shah, Zankhana H.
2015 International Symposium on Advanced Computing and Communication (ISACC), 2015, : 60 - 64
[48] Structuring soccer video based on audio classification and segmentation using hidden Markov model
Chen, JY
Li, YH
Lao, SY
Wu, LD
Bai, L
IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 98 - 105
[49] Audio classification and segmentation for sports video structure extraction using support vector machine
Bai, Liang
Lao, Song-Yang
Liao, Hu-Xiong
Chen, Jian-Yun
PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3303 - +
[50] Video semantic concept discovery using multimodal-based association classification
Lin, Lin
Ravitz, Guy
Shyu, Mei-Ling
Chen, Shu-Ching
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 859 - +

← 1 2 3 4 5 →