Deep feature voting: a semantic-driven and local context-aware approach for image classification

被引:0
|
作者
Xu, Ye [1 ]
Duan, Lihua [1 ]
Huang, Conggui [1 ]
Huang, Chongpeng [1 ]
机构
[1] Wuxi Inst Technol, Sch IoT Technol, 1600 Gaolang West Rd, Wuxi 214121, Jiangsu, Peoples R China
关键词
Image classification; Deep learning model; Deep feature; Voting; Decision tree; NEURAL-NETWORKS; FEATURE FUSION; CNN; MODELS;
D O I
10.1007/s11042-023-17881-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of addressing new image classification tasks with insufficient training samples via pre-trained deep learning models, the methods based on the Bag-of-Deep-Visual-Words (BoDVW) model have achieved higher classification accuracy across various image classification tasks compared to directly using the new classification layer of the pre-trained model for classification. These methods perform a sequence of operations on the input image - deep feature extraction, feature encoding, and feature pooling - to obtain an image representation vector, which is then fed into classifiers for classification. However, they ignore two crucial aspects: the high-level semantic characteristics of deep features and their local context within the feature space, which limits the image classification performance. To address this issue, we propose a new image classification method with a unique workflow. Specifically, our method identifies low-entropy local regions in the feature space by constructing multiple decision trees, using the set of labelled deep features built from training images. For a given image, the voting vector of each deep feature from the image is calculated based on the category label distributions of the low-entropy local regions where it is located. This vector reflects the degree of support that the feature provides for the hypothesis that it belongs to each category. The voting vectors of all features are aggregated according to image regions of different sizes and positions to obtain the representation vector of the image. The representation vectors of testing images are input into Support Vector Machines (SVMs) trained using those of training images to predict their categories. Experimental results on six public datasets show that our method achieves higher classification accuracy by 0.07% to 3.6% (averaging at 0.8%) compared to two BoDVW methods, and by 0.1% to 10.69% (averaging at 2.69%) compared to directly using the new classification layer of the pre-trained model for classification. These results demonstrate the effectiveness of considering the high-level semantic characteristics of deep features and their local context within the feature space for image classification. Importantly, the unique workflow of our method opens up new potential avenues for improving classification performance. These include increasing the number of local regions where deep features primarily originate from one or a few image categories, improving the accuracy of low-entropy local region identification, and developing an end-to-end deep learning model based on this workflow. While maintaining classification accuracy comparable to recent works, our method offers notable potential for the advancement of the image classification field.
引用
收藏
页码:58607 / 58643
页数:37
相关论文
共 50 条
  • [1] Context-aware automated ICD coding: A semantic-driven approach
    Reshma, O. K.
    Saleena, N.
    Nazeer, K. A. Abdul
    INFORMATION SYSTEMS, 2025, 132
  • [2] Context-Aware Feature Selection and Classification
    Wang, Juanyan
    Bilgic, Mustafa
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4317 - 4325
  • [3] Semantic Storytelling Automation: A Context-Aware and Metadata-Driven Approach
    Viana, Paula
    Carvalho, Pedro
    Andrade, Maria Teresa
    Jonker, Pieter P.
    Papanikolaou, Vasileios
    Teixeira, Ines N.
    Vilaca, Luis
    Pinto, Jose P.
    Costa, Tiago
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4491 - 4493
  • [4] Geostatistics for Context-Aware Image Classification
    Codevilla, Felipe
    Botelho, Silvia S. C.
    Duarte, Nelson
    Purkis, Samuel
    Shihavuddin, A. S. M.
    Garcia, Rafael
    Gracias, Nuno
    COMPUTER VISION SYSTEMS (ICVS 2015), 2015, 9163 : 228 - 239
  • [5] Semantic Context-Aware Image Style Transfer
    Liao, Yi-Sheng
    Huang, Chun-Rong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1911 - 1923
  • [6] Semantic-Driven Context Aggregation Network for Underwater Image Enhancement
    Shi, Dongxiang
    Ma, Long
    Liu, Risheng
    Fan, Xin
    Luo, Zhongxuan
    PATTERN RECOGNITION AND COMPUTER VISION,, PT III, 2021, 13021 : 29 - 40
  • [7] An efficient context-aware approach for whole-slide image classification
    Shen, Hongru
    Wu, Jianghua
    Shen, Xilin
    Hu, Jiani
    Liu, Jilei
    Zhang, Qiang
    Sun, Yan
    Chen, Kexin
    Li, Xiangchun
    ISCIENCE, 2023, 26 (12)
  • [8] Context-Aware Residual Module for Image Classification
    Bai, Jing
    Chen, Ran
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3388 - 3395
  • [9] Context-Aware Image Inpainting with Learned Semantic Priors
    Zhang, Wendong
    Zhu, Junwei
    Tai, Ying
    Wang, Yunbo
    Chu, Wenqing
    Ni, Bingbing
    Wang, Chengjie
    Yang, Xiaokang
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1323 - 1329
  • [10] Multiscale Context-Aware Ensemble Deep KELM for Efficient Hyperspectral Image Classification
    Xi, Bobo
    Li, Jiaojiao
    Li, Yunsong
    Song, Rui
    Sun, Weiwei
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (06): : 5114 - 5130