Perceptual Visual Feature Learning With Applications in Sports Educational Image Understanding

被引：0

作者：

Liu, Tengsheng ^{[1
]}

Xu, Minghui ^{[2
]}

机构：

[1] Wuhan Inst Technol, Dept Phys Educ, Wuhan 430070, Peoples R China

[2] Jinhua Polytech, Key Lab Crop Harvesting Equipment Technol Zhejiang, Jinhua 321017, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Perceptual; feature fusion; local-global; active learning; deep architecture; SCENE; CLASSIFICATION; SEGMENTATION; MANIFOLD; MODEL;

D O I：

10.1109/ACCESS.2024.3377657

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Effectively understanding the semantics of sophisticated sceneries is a key module in plenty of artificial intelligence (AI) systems. In this article, we optimally fuse multi-channel perceptual visual features for recognizing scenic pictures with complex spatial configurations, focusing on formulating a deep hierarchical model to actively discover human gaze allocation. In detail, to uncover semantically/visually important patches within each scenery, we utilize the BING objectness descriptor to rapidly and accurately localize multi-scale objects or their components. Subsequently, a local-global feature fusion scenario is proposed to dynamically combine the multiple low-level features from multiple scenic patches. To simulate how humans perceiving semantically/visually important scenic patches, we design a robust deep active learning (RDAL) paradigm that sequentially derives gaze shift path (GSP) and hierarchically learns deep GSP features in a unified architecture. Notably, the key advantage of RDAL is the high tolerance of label noise by adding an elaborately-designed sparse penalty. That is, the contaminated and redundant deep GSP features can be implicitly abandoned. Finally, the refined deep GSP features are integrated into a multi-label SVM for recognizing sceneries of different categories. Empirical comparisons showed that: 1) our method performs competitively on six generic scenery set (average accuracy 2% similar to 4.3% higher than the second best performer), and 2) our deep GSP feature is particularly discriminative to our compiled sport educational image set (average accuracy 7.7% higher than the second best performer).

引用

页码：41168 / 41179

页数：12

共 50 条

[21] Image Comparison by Compound Disjoint Information with Applications to Perceptual Visual Quality Assessment, Image Registration and Tracking
Zhaohui Sun
Anthony Hoogs
[J]. International Journal of Computer Vision, 2010, 88 : 461 - 488
[22] Unsupervised learning of perceptual feature combinations
Tamosiunaite, Minija
Tetzlaff, Christian
Woergoetter, Florentin
[J]. PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (03)
[23] Understanding Masked Image Modeling via Learning Occlusion Invariant Feature
Kong, Xiangwen
Zhang, Xiangyu
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6241 - 6251
[24] Robust Multiview Feature Learning for RGB-D Image Understanding
Zha, Zheng-Jun
Yang, Yang
Tang, Jinhui
Wang, Meng
Chua, Tat-Seng
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2015, 6 (02)
[25] Composition of Visual Feature Vector Pattern for Deep Learning in Image Forensics
Rhee, Kang Hyeon
[J]. IEEE ACCESS, 2020, 8 : 188970 - 188980
[26] Perceptual feature selection for semantic image classification
Depalov, Dejan
Pappas, Thrasyvoulos N.
Li, Dongge
Gandhi, Bhavan
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2921 - +
[27] Application of image content feature retrieval based on deep learning in sports public industry
Xu, Nianli
Liu, Fengying
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 1867 - 1877
[28] Visual Understanding via Multi-Feature Shared Learning With Global Consistency
Zhang, Lei
Zhang, David
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (02) : 247 - 259
[29] Understanding brain plasticity in perceptual learning
Anja Stemme
Gustavo Deco
Elmar Lang
[J]. BMC Neuroscience, 10 (Suppl 1)
[30] Iterative Learning Control for Image Based Visual Servoing Applications
Sutanto, Erick
Alleyne, Andrew G.
[J]. 2014 AMERICAN CONTROL CONFERENCE (ACC), 2014,

← 1 2 3 4 5 →