Perceptual Visual Feature Learning With Applications in Sports Educational Image Understanding

被引：0

作者：

Liu, Tengsheng ^{[1
]}

Xu, Minghui ^{[2
]}

机构：

[1] Wuhan Inst Technol, Dept Phys Educ, Wuhan 430070, Peoples R China

[2] Jinhua Polytech, Key Lab Crop Harvesting Equipment Technol Zhejiang, Jinhua 321017, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Perceptual; feature fusion; local-global; active learning; deep architecture; SCENE; CLASSIFICATION; SEGMENTATION; MANIFOLD; MODEL;

D O I：

10.1109/ACCESS.2024.3377657

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Effectively understanding the semantics of sophisticated sceneries is a key module in plenty of artificial intelligence (AI) systems. In this article, we optimally fuse multi-channel perceptual visual features for recognizing scenic pictures with complex spatial configurations, focusing on formulating a deep hierarchical model to actively discover human gaze allocation. In detail, to uncover semantically/visually important patches within each scenery, we utilize the BING objectness descriptor to rapidly and accurately localize multi-scale objects or their components. Subsequently, a local-global feature fusion scenario is proposed to dynamically combine the multiple low-level features from multiple scenic patches. To simulate how humans perceiving semantically/visually important scenic patches, we design a robust deep active learning (RDAL) paradigm that sequentially derives gaze shift path (GSP) and hierarchically learns deep GSP features in a unified architecture. Notably, the key advantage of RDAL is the high tolerance of label noise by adding an elaborately-designed sparse penalty. That is, the contaminated and redundant deep GSP features can be implicitly abandoned. Finally, the refined deep GSP features are integrated into a multi-label SVM for recognizing sceneries of different categories. Empirical comparisons showed that: 1) our method performs competitively on six generic scenery set (average accuracy 2% similar to 4.3% higher than the second best performer), and 2) our deep GSP feature is particularly discriminative to our compiled sport educational image set (average accuracy 7.7% higher than the second best performer).

引用

页码：41168 / 41179

页数：12

共 50 条

[1] Profiles of visual perceptual learning in feature space
Shen, Shiqi
Sun, Yueling
Lu, Jiachen
Li, Chu
Chen, Qinglin
Mo, Ce
Fang, Fang
Zhang, Xilin
[J]. ISCIENCE, 2024, 27 (03)
[2] Interference and feature specificity in visual perceptual learning
Yotsumoto, Yuko
Chang, Li-hung
Watanabe, Takeo
Sasaki, Yuka
[J]. VISION RESEARCH, 2009, 49 (21) : 2611 - 2623
[3] A Visual Perceptual Descriptor with Depth Feature for Image Retrieval
Wang, Tianyang
Qin, Zhengrui
[J]. NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
[4] Evidence for feature and location learning in human visual perceptual learning
Manuela Moreno-Fernandez, Maria
Mohd Salleh, Nurizzati
Prados, Jose
[J]. PSICOLOGICA, 2015, 36 (02): : 185 - 204
[5] Improving Visual Representation Learning through Perceptual Understanding
Tukra, Samyakh
Hoffman, Frederick
Chatfield, Ken
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14486 - 14495
[6] Visual perceptual learning of a task-irrelevant feature of the stimulus
Galliussi, Jessica
Grzeczkowski, Lukasz
Gerbino, Walter
Herzog, Michael
Bernardis, Paolo
[J]. PERCEPTION, 2016, 45 : 321 - 321
[7] SPATIAL PROGRESSION OF PERCEPTUAL LEARNING IN VISUAL FEATURE CONJUNCTION SEARCH
Reavis, Eric
Frank, Sebastian
Tse, Peter
[J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, : 233 - 233
[8] Visual perceptual learning
Lu, Zhong-Lin
Hua, Tianmiao
Huang, Chang-Bing
Zhou, Yifeng
Dosher, Barbara Anne
[J]. NEUROBIOLOGY OF LEARNING AND MEMORY, 2011, 95 (02) : 145 - 151
[9] Immersion into visual media: New applications of image understanding
Kanade, T
[J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1996, 11 (01): : 73 - 80
[10] Visual perceptual learning
Shi, ZZ
Li, QY
Zheng, Z
[J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : PL75 - PL80

← 1 2 3 4 5 →