Perceptual Visual Feature Learning With Applications in Sports Educational Image Understanding

被引:0
|
作者
Liu, Tengsheng [1 ]
Xu, Minghui [2 ]
机构
[1] Wuhan Inst Technol, Dept Phys Educ, Wuhan 430070, Peoples R China
[2] Jinhua Polytech, Key Lab Crop Harvesting Equipment Technol Zhejiang, Jinhua 321017, Peoples R China
关键词
Perceptual; feature fusion; local-global; active learning; deep architecture; SCENE; CLASSIFICATION; SEGMENTATION; MANIFOLD; MODEL;
D O I
10.1109/ACCESS.2024.3377657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively understanding the semantics of sophisticated sceneries is a key module in plenty of artificial intelligence (AI) systems. In this article, we optimally fuse multi-channel perceptual visual features for recognizing scenic pictures with complex spatial configurations, focusing on formulating a deep hierarchical model to actively discover human gaze allocation. In detail, to uncover semantically/visually important patches within each scenery, we utilize the BING objectness descriptor to rapidly and accurately localize multi-scale objects or their components. Subsequently, a local-global feature fusion scenario is proposed to dynamically combine the multiple low-level features from multiple scenic patches. To simulate how humans perceiving semantically/visually important scenic patches, we design a robust deep active learning (RDAL) paradigm that sequentially derives gaze shift path (GSP) and hierarchically learns deep GSP features in a unified architecture. Notably, the key advantage of RDAL is the high tolerance of label noise by adding an elaborately-designed sparse penalty. That is, the contaminated and redundant deep GSP features can be implicitly abandoned. Finally, the refined deep GSP features are integrated into a multi-label SVM for recognizing sceneries of different categories. Empirical comparisons showed that: 1) our method performs competitively on six generic scenery set (average accuracy 2% similar to 4.3% higher than the second best performer), and 2) our deep GSP feature is particularly discriminative to our compiled sport educational image set (average accuracy 7.7% higher than the second best performer).
引用
收藏
页码:41168 / 41179
页数:12
相关论文
共 50 条
  • [1] Profiles of visual perceptual learning in feature space
    Shen, Shiqi
    Sun, Yueling
    Lu, Jiachen
    Li, Chu
    Chen, Qinglin
    Mo, Ce
    Fang, Fang
    Zhang, Xilin
    [J]. ISCIENCE, 2024, 27 (03)
  • [2] Interference and feature specificity in visual perceptual learning
    Yotsumoto, Yuko
    Chang, Li-hung
    Watanabe, Takeo
    Sasaki, Yuka
    [J]. VISION RESEARCH, 2009, 49 (21) : 2611 - 2623
  • [3] A Visual Perceptual Descriptor with Depth Feature for Image Retrieval
    Wang, Tianyang
    Qin, Zhengrui
    [J]. NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [4] Evidence for feature and location learning in human visual perceptual learning
    Manuela Moreno-Fernandez, Maria
    Mohd Salleh, Nurizzati
    Prados, Jose
    [J]. PSICOLOGICA, 2015, 36 (02): : 185 - 204
  • [5] Improving Visual Representation Learning through Perceptual Understanding
    Tukra, Samyakh
    Hoffman, Frederick
    Chatfield, Ken
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14486 - 14495
  • [6] Visual perceptual learning of a task-irrelevant feature of the stimulus
    Galliussi, Jessica
    Grzeczkowski, Lukasz
    Gerbino, Walter
    Herzog, Michael
    Bernardis, Paolo
    [J]. PERCEPTION, 2016, 45 : 321 - 321
  • [7] SPATIAL PROGRESSION OF PERCEPTUAL LEARNING IN VISUAL FEATURE CONJUNCTION SEARCH
    Reavis, Eric
    Frank, Sebastian
    Tse, Peter
    [J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, : 233 - 233
  • [8] Visual perceptual learning
    Lu, Zhong-Lin
    Hua, Tianmiao
    Huang, Chang-Bing
    Zhou, Yifeng
    Dosher, Barbara Anne
    [J]. NEUROBIOLOGY OF LEARNING AND MEMORY, 2011, 95 (02) : 145 - 151
  • [9] Immersion into visual media: New applications of image understanding
    Kanade, T
    [J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1996, 11 (01): : 73 - 80
  • [10] Visual perceptual learning
    Shi, ZZ
    Li, QY
    Zheng, Z
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : PL75 - PL80