Combining the Right Features for Complex Event Recognition

被引：48

作者：

Tang, Kevin ^{[1
]}

Yao, Bangpeng ^{[1
]}

Li Fei-Fei ^{[1
]}

Koller, Daphne ^{[1
]}

机构：

[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2013年

关键词：

D O I：

10.1109/ICCV.2013.335

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we tackle the problem of combining features extracted from video for complex event recognition. Feature combination is an especially relevant task in video data, as there are many features we can extract, ranging from image features computed from individual frames to video features that take temporal information into account. To combine features effectively, we propose a method that is able to be selective of different subsets of features, as some features or feature combinations may be uninformative for certain classes. We introduce a hierarchical method for combining features based on the AND/OR graph structure, where nodes in the graph represent combinations of different sets of features. Our method automatically learns the structure of the AND/OR graph using score-based structure learning, and we introduce an inference procedure that is able to efficiently compute structure scores. We present promising results and analysis on the difficult and large-scale 2011 TRECVID Multimedia Event Detection dataset [17].

引用

页码：2696 / 2703

页数：8

共 50 条

[21] Improvement of speaker recognition by combining residual and prosodic features with acoustic features
Chen, SH
Wang, HC
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 93 - 96
[22] Face Recognition Combining Eigen Features with a Parzen Classifier
孙鑫
刘兵
刘本永
Journal of Electronic Science and Technology of China, 2005, (01) : 18 - 21
[23] Combining Binaural and Cortical Features for Robust Speech Recognition
Spille, Constantin
Kollmeier, Birger
Meyer, Bernd T.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 756 - 767
[24] Face Image Recognition Combining Holistic and Local Features
Pan, Chen
Cao, Feilong
ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 407 - +
[25] Combining Features for Chinese Sign Language Recognition with Kinect
Geng, Lubo
Ma, Xin
Xue, Bingxia
Wu, Hanbo
Gu, Jason
Li, Yibin
11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 1393 - 1398
[26] Robust Speech Recognition Combining Cepstral and Articulatory Features
Zha, Zhuan-ling
Hu, Jin
Zhan, Qing-ran
Shan, Ya-hui
Xie, Xiang
Wang, Jing
Cheng, Hao-bo
PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
[27] Combining Perceptual Features With Diffusion Distance for Face Recognition
Zhou, Huiyu
Sadka, Abdul H.
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (05): : 577 - 588
[28] Combining Appearance and Geometric Features for Facial Expression Recognition
Yu, Hui
Liu, Honghai
SIXTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2014), 2015, 9443
[29] Combining Prosodic and Spectral Features for Mandarin Intonation Recognition
Bao, Wei
Li, Ya
Gu, Mingliang
Tao, Jianhua
Chao, Linlin
Liu, Shanfeng
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 497 - +
[30] Methods for combining the information of various features in speech recognition
Wang, Chengyou
Tang, Shuqi
Liang, Diannong
Chen, Huihuang
Tang, Chaojing
Shengxue Xuebao/Acta Acustica, 1997, 22 (02): : 111 - 115

← 1 2 3 4 5 →