Deep feature voting: a semantic-driven and local context-aware approach for image classification

被引:0
|
作者
Xu, Ye [1 ]
Duan, Lihua [1 ]
Huang, Conggui [1 ]
Huang, Chongpeng [1 ]
机构
[1] Wuxi Inst Technol, Sch IoT Technol, 1600 Gaolang West Rd, Wuxi 214121, Jiangsu, Peoples R China
关键词
Image classification; Deep learning model; Deep feature; Voting; Decision tree; NEURAL-NETWORKS; FEATURE FUSION; CNN; MODELS;
D O I
10.1007/s11042-023-17881-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of addressing new image classification tasks with insufficient training samples via pre-trained deep learning models, the methods based on the Bag-of-Deep-Visual-Words (BoDVW) model have achieved higher classification accuracy across various image classification tasks compared to directly using the new classification layer of the pre-trained model for classification. These methods perform a sequence of operations on the input image - deep feature extraction, feature encoding, and feature pooling - to obtain an image representation vector, which is then fed into classifiers for classification. However, they ignore two crucial aspects: the high-level semantic characteristics of deep features and their local context within the feature space, which limits the image classification performance. To address this issue, we propose a new image classification method with a unique workflow. Specifically, our method identifies low-entropy local regions in the feature space by constructing multiple decision trees, using the set of labelled deep features built from training images. For a given image, the voting vector of each deep feature from the image is calculated based on the category label distributions of the low-entropy local regions where it is located. This vector reflects the degree of support that the feature provides for the hypothesis that it belongs to each category. The voting vectors of all features are aggregated according to image regions of different sizes and positions to obtain the representation vector of the image. The representation vectors of testing images are input into Support Vector Machines (SVMs) trained using those of training images to predict their categories. Experimental results on six public datasets show that our method achieves higher classification accuracy by 0.07% to 3.6% (averaging at 0.8%) compared to two BoDVW methods, and by 0.1% to 10.69% (averaging at 2.69%) compared to directly using the new classification layer of the pre-trained model for classification. These results demonstrate the effectiveness of considering the high-level semantic characteristics of deep features and their local context within the feature space for image classification. Importantly, the unique workflow of our method opens up new potential avenues for improving classification performance. These include increasing the number of local regions where deep features primarily originate from one or a few image categories, improving the accuracy of low-entropy local region identification, and developing an end-to-end deep learning model based on this workflow. While maintaining classification accuracy comparable to recent works, our method offers notable potential for the advancement of the image classification field.
引用
收藏
页码:58607 / 58643
页数:37
相关论文
共 50 条
  • [31] MULTICAO: A SEMANTIC APPROACH TO CONTEXT-AWARE ADAPTATION DECISION TAKING
    Barbosa, Vitor
    Andrade, Maria Teresa
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 133 - 136
  • [32] Semantic context-aware attention UNET for lung cancer segmentation and classification
    Balachandran, Sangeetha
    Ranganathan, Vidhyapriya
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2023, 33 (03) : 822 - 836
  • [33] Feature-attention module for context-aware image-to-image translation
    Jing Bai
    Ran Chen
    Min Liu
    The Visual Computer, 2020, 36 : 2145 - 2159
  • [34] Feature-attention module for context-aware image-to-image translation
    Bai, Jing
    Chen, Ran
    Liu, Min
    VISUAL COMPUTER, 2020, 36 (10-12): : 2145 - 2159
  • [35] An Approach for Feature Modeling of Context-Aware Software Product Line
    Fernandes, Paula
    Werner, Claudia
    Teixeira, Eldanae
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2011, 17 (05) : 807 - 829
  • [36] Context-aware and local-aware fusion with transformer for medical image segmentation
    Xiao, Hanguang
    Li, Li
    Liu, Qiyuan
    Zhang, Qihang
    Liu, Junqi
    Liu, Zhi
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (02):
  • [37] An automatic histopathological image segmentation network based on global context-aware module and deep feature aggregation
    Shi, Xu
    Zhou, Fanlin
    Wang, Long
    Fu, Yan
    Wu, Ruoyu
    Wu, Jian
    Li, Yu
    Huang, Hong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [38] CFDIL: a context-aware feature deep interaction learning for app recommendation
    Qingbo Hao
    Ke Zhu
    Chundong Wang
    Peng Wang
    Xiuliang Mo
    Zhen Liu
    Soft Computing, 2022, 26 : 4755 - 4770
  • [39] Target tracking algorithm based on context-aware deep feature compression
    Wang Y.
    Wang A.
    Wang R.
    Liu H.
    Iwahori Y.
    International Journal of Performability Engineering, 2019, 15 (07) : 1802 - 1812
  • [40] CFDIL: a context-aware feature deep interaction learning for app recommendation
    Hao, Qingbo
    Zhu, Ke
    Wang, Chundong
    Wang, Peng
    Mo, Xiuliang
    Liu, Zhen
    SOFT COMPUTING, 2022, 26 (10) : 4755 - 4770