An Adaptive Semisupervised Feature Analysis for Video Semantic Recognition

被引:277
|
作者
Luo, Minnan [1 ]
Chang, Xiaojun [2 ]
Nie, Liqiang [3 ]
Yang, Yi [4 ]
Hauptmann, Alexander G. [2 ]
Zheng, Qinghua [1 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci, SPKLSTN Lab, Xian 710049, Shaanxi, Peoples R China
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Shandong Univ, Sch Comp Sci & Technol, Jinan 250100, Shandong, Peoples R China
[4] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW 2007, Australia
基金
美国国家科学基金会;
关键词
Feature selection; manifold regularization; semisupervised learning; video semantic recognition; FEATURE-SELECTION;
D O I
10.1109/TCYB.2017.2647904
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video semantic recognition usually suffers from the curse of dimensionality and the absence of enough high-quality labeled instances, thus semisupervised feature selection gains increasing attentions for its efficiency and comprehensibility. Most of the previous methods assume that videos with close distance (neighbors) have similar labels and characterize the intrinsic local structure through a predetermined graph of both labeled and unlabeled data. However, besides the parameter tuning problem underlying the construction of the graph, the affinity measurement in the original feature space usually suffers from the curse of dimensionality. Additionally, the predetermined graph separates itself from the procedure of feature selection, which might lead to downgraded performance for video semantic recognition. In this paper, we exploit a novel semisupervised feature selection method from a new perspective. The primary assumption underlying our model is that the instances with similar labels should have a larger probability of being neighbors. Instead of using a predetermined similarity graph, we incorporate the exploration of the local structure into the procedure of joint feature selection so as to learn the optimal graph simultaneously. Moreover, an adaptive loss function is exploited to measure the label fitness, which significantly enhances model's robustness to videos with a small or substantial loss. We propose an efficient alternating optimization algorithm to solve the proposed challenging problem, together with analyses on its convergence and computational complexity in theory. Finally, extensive experimental results on benchmark datasets illustrate the effectiveness and superiority of the proposed approach on video semantic recognition related tasks.
引用
收藏
页码:648 / 660
页数:13
相关论文
共 50 条
  • [1] Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition
    Han, Yahong
    Yang, Yi
    Yan, Yan
    Ma, Zhigang
    Sebe, Nicu
    Zhou, Xiaofang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (02) : 252 - 264
  • [2] Manifold Adaptive Kernel Semisupervised Discriminant Analysis for Gait Recognition
    Wang, Ziqiang
    Sun, Xia
    Sun, Lijun
    Huang, Yuchun
    [J]. ADVANCES IN MECHANICAL ENGINEERING, 2013,
  • [3] ADAPTIVE ADJUSTMENT WITH SEMANTIC FEATURE SPACE FOR ZERO-SHOT RECOGNITION
    Guo, Jingcai
    Guo, Song
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3287 - 3291
  • [4] A feature selection framework for video semantic recognition via integrated cross-media analysis and embedded learning
    Zhang, Jianguang
    Han, Yahong
    Jiang, Jianmin
    Zhou, Zhongrun
    An, Da
    Liu, JieJing
    Song, Zhifei
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2019, 2019 (1)
  • [5] A feature selection framework for video semantic recognition via integrated cross-media analysis and embedded learning
    Jianguang Zhang
    Yahong Han
    Jianmin Jiang
    Zhongrun Zhou
    Da An
    JieJing Liu
    Zhifei Song
    [J]. EURASIP Journal on Image and Video Processing, 2019
  • [6] MULTI-LEVEL FEATURE ANALYSIS FOR SEMANTIC CATEGORY RECOGNITION
    Sridharan, Harini
    Cheriyadat, Anil
    [J]. 2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 4371 - 4374
  • [7] Semantic multimedia analysis for content-adaptive video streaming
    Tekalp, A. Murat
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2089 - 2092
  • [8] Semantic video analysis for adaptive content delivery and automatic description
    Cavallaro, A
    Steiger, O
    Ebrahimi, T
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (10) : 1200 - 1209
  • [9] Invariant Feature Analysis in Gait Recognition Based on Video Stream
    Kang, Xiaoli
    Bai, Guifeng
    Li, Hui
    Zhang, XinXin
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 135 - 135
  • [10] Semisupervised Discriminant Multimanifold Analysis for Action Recognition
    Xu, Zengmin
    Hu, Ruimin
    Chen, Jun
    Chen, Chen
    Jiang, Junjun
    Li, Jiaofen
    Li, Hongyang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (10) : 2951 - 2962