Combining the Right Features for Complex Event Recognition

被引:48
|
作者
Tang, Kevin [1 ]
Yao, Bangpeng [1 ]
Li Fei-Fei [1 ]
Koller, Daphne [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年
关键词
D O I
10.1109/ICCV.2013.335
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the problem of combining features extracted from video for complex event recognition. Feature combination is an especially relevant task in video data, as there are many features we can extract, ranging from image features computed from individual frames to video features that take temporal information into account. To combine features effectively, we propose a method that is able to be selective of different subsets of features, as some features or feature combinations may be uninformative for certain classes. We introduce a hierarchical method for combining features based on the AND/OR graph structure, where nodes in the graph represent combinations of different sets of features. Our method automatically learns the structure of the AND/OR graph using score-based structure learning, and we introduce an inference procedure that is able to efficiently compute structure scores. We present promising results and analysis on the difficult and large-scale 2011 TRECVID Multimedia Event Detection dataset [17].
引用
收藏
页码:2696 / 2703
页数:8
相关论文
共 50 条
  • [21] Improvement of speaker recognition by combining residual and prosodic features with acoustic features
    Chen, SH
    Wang, HC
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 93 - 96
  • [22] Face Recognition Combining Eigen Features with a Parzen Classifier
    孙鑫
    刘兵
    刘本永
    Journal of Electronic Science and Technology of China, 2005, (01) : 18 - 21
  • [23] Combining Binaural and Cortical Features for Robust Speech Recognition
    Spille, Constantin
    Kollmeier, Birger
    Meyer, Bernd T.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 756 - 767
  • [24] Face Image Recognition Combining Holistic and Local Features
    Pan, Chen
    Cao, Feilong
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 407 - +
  • [25] Combining Features for Chinese Sign Language Recognition with Kinect
    Geng, Lubo
    Ma, Xin
    Xue, Bingxia
    Wu, Hanbo
    Gu, Jason
    Li, Yibin
    11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 1393 - 1398
  • [26] Robust Speech Recognition Combining Cepstral and Articulatory Features
    Zha, Zhuan-ling
    Hu, Jin
    Zhan, Qing-ran
    Shan, Ya-hui
    Xie, Xiang
    Wang, Jing
    Cheng, Hao-bo
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
  • [27] Combining Perceptual Features With Diffusion Distance for Face Recognition
    Zhou, Huiyu
    Sadka, Abdul H.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (05): : 577 - 588
  • [28] Combining Appearance and Geometric Features for Facial Expression Recognition
    Yu, Hui
    Liu, Honghai
    SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
  • [29] Combining Prosodic and Spectral Features for Mandarin Intonation Recognition
    Bao, Wei
    Li, Ya
    Gu, Mingliang
    Tao, Jianhua
    Chao, Linlin
    Liu, Shanfeng
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 497 - +
  • [30] Methods for combining the information of various features in speech recognition
    Wang, Chengyou
    Tang, Shuqi
    Liang, Diannong
    Chen, Huihuang
    Tang, Chaojing
    Shengxue Xuebao/Acta Acustica, 1997, 22 (02): : 111 - 115