Detecting semantic concepts from video using temporal gradients and audio classification

被引:0
|
作者
Rautiainen, M
Seppänen, T
Penttilä, J
Peltola, J
机构
[1] Univ Oulu, MediaTeam Oulu, FIN-90014 Oulu, Finland
[2] VTT Tech Res Ctr Finland, FIN-90571 Oulu, Finland
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe new methods to detect semantic concepts from digital video based on audible and visual content. Temporal Gradient Correlogram captures temporal correlations of gradient edge directions from sampled shot frames. Power-related physical features are extracted from short audio samples in video shots. Video shots containing people, cityscape, landscape, speech or instrumental sound are detected with trained self-organized maps and kNN classification results of audio samples. Test runs and evaluations in TREC 2002 Video Track show consistent performance for Temporal Gradient Correlogram and state-of-the-art precision in audio-based instrumental sound detection.
引用
收藏
页码:260 / 270
页数:11
相关论文
共 50 条
  • [41] Decision Tree Based Depression Classification from Audio Video and Language Information
    Yang, Le
    Jiang, Dongmei
    He, Lang
    Pei, Ercheng
    Oveneke, Meshia Cedric
    Sahli, Hichem
    PROCEEDINGS OF THE 6TH INTERNATIONAL WORKSHOP ON AUDIO/VISUAL EMOTION CHALLENGE (AVEC'16), 2016, : 89 - 96
  • [42] Towards Virtual Audio/Video Environments Using Semantic Service Composition on a Service Oriented Infrastructure
    Lalbakhsh, Pooia
    Goodarzi, Ayub
    Fesharaki, Mehdi N.
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL : ICACC 2009 - PROCEEDINGS, 2009, : 485 - +
  • [43] Detection and Classification of Moving Vehicle From Video Using Multiple Spatio-Temporal Features
    Wang, Yu
    Ban, Xiaojuan
    Wang, Huan
    Wu, Di
    Wang, Hao
    Yang, Shouqing
    Liu, Sinuo
    Lai, Jinhui
    IEEE ACCESS, 2019, 7 : 80287 - 80299
  • [44] Robust Temporal Registration Scheme for Video Copies Using Visual-Audio Features
    Roopalakshmi, R.
    Venkatesh, Revanur
    Rahul, K. M.
    3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 385 - 394
  • [45] From semantic weight to legal ontology via classification of concepts in legal texts
    Allison, Neil Grainger
    LAW TEACHER, 2023, 57 (02): : 201 - 217
  • [46] Learning Temporal Relations from Semantic Neighbors for Acoustic Scene Classification
    Zhang, Liwen
    Han, Jiqing
    Shi, Ziqiang
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 950 - 954
  • [47] Extraction of Video Songs from Movies using Audio Features
    Darjii, Mittal C.
    Patel, Narendra M.
    Shah, Zankhana H.
    2015 International Symposium on Advanced Computing and Communication (ISACC), 2015, : 60 - 64
  • [48] Structuring soccer video based on audio classification and segmentation using hidden Markov model
    Chen, JY
    Li, YH
    Lao, SY
    Wu, LD
    Bai, L
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 98 - 105
  • [49] Audio classification and segmentation for sports video structure extraction using support vector machine
    Bai, Liang
    Lao, Song-Yang
    Liao, Hu-Xiong
    Chen, Jian-Yun
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 3303 - +
  • [50] Video semantic concept discovery using multimodal-based association classification
    Lin, Lin
    Ravitz, Guy
    Shyu, Mei-Ling
    Chen, Shu-Ching
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 859 - +