Deep feature voting: a semantic-driven and local context-aware approach for image classification

被引:0
|
作者
Xu, Ye [1 ]
Duan, Lihua [1 ]
Huang, Conggui [1 ]
Huang, Chongpeng [1 ]
机构
[1] Wuxi Inst Technol, Sch IoT Technol, 1600 Gaolang West Rd, Wuxi 214121, Jiangsu, Peoples R China
关键词
Image classification; Deep learning model; Deep feature; Voting; Decision tree; NEURAL-NETWORKS; FEATURE FUSION; CNN; MODELS;
D O I
10.1007/s11042-023-17881-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of addressing new image classification tasks with insufficient training samples via pre-trained deep learning models, the methods based on the Bag-of-Deep-Visual-Words (BoDVW) model have achieved higher classification accuracy across various image classification tasks compared to directly using the new classification layer of the pre-trained model for classification. These methods perform a sequence of operations on the input image - deep feature extraction, feature encoding, and feature pooling - to obtain an image representation vector, which is then fed into classifiers for classification. However, they ignore two crucial aspects: the high-level semantic characteristics of deep features and their local context within the feature space, which limits the image classification performance. To address this issue, we propose a new image classification method with a unique workflow. Specifically, our method identifies low-entropy local regions in the feature space by constructing multiple decision trees, using the set of labelled deep features built from training images. For a given image, the voting vector of each deep feature from the image is calculated based on the category label distributions of the low-entropy local regions where it is located. This vector reflects the degree of support that the feature provides for the hypothesis that it belongs to each category. The voting vectors of all features are aggregated according to image regions of different sizes and positions to obtain the representation vector of the image. The representation vectors of testing images are input into Support Vector Machines (SVMs) trained using those of training images to predict their categories. Experimental results on six public datasets show that our method achieves higher classification accuracy by 0.07% to 3.6% (averaging at 0.8%) compared to two BoDVW methods, and by 0.1% to 10.69% (averaging at 2.69%) compared to directly using the new classification layer of the pre-trained model for classification. These results demonstrate the effectiveness of considering the high-level semantic characteristics of deep features and their local context within the feature space for image classification. Importantly, the unique workflow of our method opens up new potential avenues for improving classification performance. These include increasing the number of local regions where deep features primarily originate from one or a few image categories, improving the accuracy of low-entropy local region identification, and developing an end-to-end deep learning model based on this workflow. While maintaining classification accuracy comparable to recent works, our method offers notable potential for the advancement of the image classification field.
引用
收藏
页码:58607 / 58643
页数:37
相关论文
共 50 条
  • [41] A Model-driven Approach for Context-Aware Services Composition
    Baidouri, Hicham
    Hafiddi, Hatim
    Nassar, Mahmoud
    Kriouile, Abdelaziz
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 693 - 698
  • [42] A model-driven development approach for context-aware systems
    Imen Jaouadi
    Raoudha Ben Djemaa
    Hanêne Ben-Abdallah
    Software & Systems Modeling, 2018, 17 : 1169 - 1195
  • [43] A model-driven development approach for context-aware systems
    Jaouadi, Imen
    Ben Djemaa, Raoudha
    Ben-Abdallah, Hanene
    SOFTWARE AND SYSTEMS MODELING, 2018, 17 (04): : 1169 - 1195
  • [44] A Quality Driven Approach for Provisioning Context Information to Adaptive Context-Aware Services
    Badidi, Elarbi
    MODERN TRENDS AND TECHNIQUES IN COMPUTER SCIENCE (CSOC 2014), 2014, 285 : 407 - 420
  • [45] Hyperspectral Image Classification With Context-Aware Dynamic Graph Convolutional Network
    Wan, Sheng
    Gong, Chen
    Zhong, Ping
    Pan, Shirui
    Li, Guangyu
    Yang, Jian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 597 - 612
  • [46] Concrete crack detection using context-aware deep semantic segmentation network
    Zhang, Xinxiang
    Rajan, Dinesh
    Story, Brett
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2019, 34 (11) : 951 - 971
  • [47] Context-aware deep weakly supervised image hashing learning method
    Liu M.
    Zhou D.
    Tian C.
    Qi M.
    Nie X.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (03): : 85 - 92
  • [48] Enhancing cardiac diagnostics through semantic-driven image synthesis: a hybrid GAN approach
    S. Gurusubramani
    B. Latha
    Neural Computing and Applications, 2024, 36 : 8181 - 8197
  • [49] Detection Based Local Feature Context for Image Classification
    Sun, Tao
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1355 - 1358
  • [50] Enhancing cardiac diagnostics through semantic-driven image synthesis: a hybrid GAN approach
    Gurusubramani, S.
    Latha, B.
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (14): : 8181 - 8197