Perceptual Visual Feature Learning With Applications in Sports Educational Image Understanding

被引:0
|
作者
Liu, Tengsheng [1 ]
Xu, Minghui [2 ]
机构
[1] Wuhan Inst Technol, Dept Phys Educ, Wuhan 430070, Peoples R China
[2] Jinhua Polytech, Key Lab Crop Harvesting Equipment Technol Zhejiang, Jinhua 321017, Peoples R China
关键词
Perceptual; feature fusion; local-global; active learning; deep architecture; SCENE; CLASSIFICATION; SEGMENTATION; MANIFOLD; MODEL;
D O I
10.1109/ACCESS.2024.3377657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively understanding the semantics of sophisticated sceneries is a key module in plenty of artificial intelligence (AI) systems. In this article, we optimally fuse multi-channel perceptual visual features for recognizing scenic pictures with complex spatial configurations, focusing on formulating a deep hierarchical model to actively discover human gaze allocation. In detail, to uncover semantically/visually important patches within each scenery, we utilize the BING objectness descriptor to rapidly and accurately localize multi-scale objects or their components. Subsequently, a local-global feature fusion scenario is proposed to dynamically combine the multiple low-level features from multiple scenic patches. To simulate how humans perceiving semantically/visually important scenic patches, we design a robust deep active learning (RDAL) paradigm that sequentially derives gaze shift path (GSP) and hierarchically learns deep GSP features in a unified architecture. Notably, the key advantage of RDAL is the high tolerance of label noise by adding an elaborately-designed sparse penalty. That is, the contaminated and redundant deep GSP features can be implicitly abandoned. Finally, the refined deep GSP features are integrated into a multi-label SVM for recognizing sceneries of different categories. Empirical comparisons showed that: 1) our method performs competitively on six generic scenery set (average accuracy 2% similar to 4.3% higher than the second best performer), and 2) our deep GSP feature is particularly discriminative to our compiled sport educational image set (average accuracy 7.7% higher than the second best performer).
引用
收藏
页码:41168 / 41179
页数:12
相关论文
共 50 条
  • [21] Image Comparison by Compound Disjoint Information with Applications to Perceptual Visual Quality Assessment, Image Registration and Tracking
    Zhaohui Sun
    Anthony Hoogs
    [J]. International Journal of Computer Vision, 2010, 88 : 461 - 488
  • [22] Unsupervised learning of perceptual feature combinations
    Tamosiunaite, Minija
    Tetzlaff, Christian
    Woergoetter, Florentin
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (03)
  • [23] Understanding Masked Image Modeling via Learning Occlusion Invariant Feature
    Kong, Xiangwen
    Zhang, Xiangyu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6241 - 6251
  • [24] Robust Multiview Feature Learning for RGB-D Image Understanding
    Zha, Zheng-Jun
    Yang, Yang
    Tang, Jinhui
    Wang, Meng
    Chua, Tat-Seng
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (02)
  • [25] Composition of Visual Feature Vector Pattern for Deep Learning in Image Forensics
    Rhee, Kang Hyeon
    [J]. IEEE ACCESS, 2020, 8 : 188970 - 188980
  • [26] Perceptual feature selection for semantic image classification
    Depalov, Dejan
    Pappas, Thrasyvoulos N.
    Li, Dongge
    Gandhi, Bhavan
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2921 - +
  • [27] Application of image content feature retrieval based on deep learning in sports public industry
    Xu, Nianli
    Liu, Fengying
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 1867 - 1877
  • [28] Visual Understanding via Multi-Feature Shared Learning With Global Consistency
    Zhang, Lei
    Zhang, David
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (02) : 247 - 259
  • [29] Understanding brain plasticity in perceptual learning
    Anja Stemme
    Gustavo Deco
    Elmar Lang
    [J]. BMC Neuroscience, 10 (Suppl 1)
  • [30] Iterative Learning Control for Image Based Visual Servoing Applications
    Sutanto, Erick
    Alleyne, Andrew G.
    [J]. 2014 AMERICAN CONTROL CONFERENCE (ACC), 2014,