Adopting Attention and Cross-Layer Features for Fine-Grained Representation

被引:3
|
作者
Sun Fayou [1 ]
Ngo, Hea Choon [1 ]
Sek, Yong Wee [1 ]
机构
[1] Univ Teknikal Malaysia Melaka, Fac Informat & Commun Technol, Ctr Adv Comp Technol, Durian Tunggal 76100, Malacca, Malaysia
关键词
Feature extraction; Representation learning; Semantics; Transformers; Sun; Convolution; Task analysis; Associating cross-layer features; attention-based operations; self-attention; CLNET;
D O I
10.1109/ACCESS.2022.3195907
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained visual classification (FGVC) is challenging task due to discriminative feature representations. The attention-based methods show great potential for FGVC, which neglect that the deeply digging inter-layer feature relations have an impact on refining feature learning. Similarly, the associating cross-layer features methods achieve significant feature enhancement, which lost the long-distance dependencies between elements. However, most of the previous researches neglect that these two methods are mutually correlated to reinforce feature learning, which are independent of each other in related models. Thus, we adopt the respective advantages of the two methods to promote fine-gained feature representations. In this paper, we propose a novel CLNET network, which effectively applies attention mechanism and cross-layer features to obtain feature representations. Specifically, CL-NET consists of 1) adopting self-attention to capture long-rang dependencies for each element, 2) associating cross-layer features to reinforce feature learning,and 3) to cover more feature regions,we integrate attention-based operations between output and input. Experiments verify that CLNET yields new state-of-the-art performance on three widely used fine-grained benchmarks, including CUB-200-2011, Stanford Cars and FGVC-Aircraft. The url of our code is https://github.com/dlearing/CLNET.git.
引用
收藏
页码:82376 / 82383
页数:8
相关论文
共 50 条
  • [1] Fine-grained Cross-Layer Attention Framework for Wound Stage Classification
    Nagda, Keval
    Briden, Michael
    Norouzi, Narges
    [J]. 2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,
  • [2] Cross-layer progressive attention bilinear fusion method for fine-grained visual classification
    Wang, Chaoqing
    Qian, Yurong
    Gong, Weijun
    Cheng, Junjong
    Wang, Yongqiang
    Wang, Yuefei
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 82
  • [3] Learn from each other to Classify better: Cross-layer mutual attention learning for fine-grained visual classification
    Liu, Dichao
    Zhao, Longjiao
    Wang, Yu
    Kato, Jien
    [J]. PATTERN RECOGNITION, 2023, 140
  • [4] AARC: Cross-layer Wireless Rate Control Driven by Fine-grained Channel Assessment
    Song, Lixing
    Wu, Shaoen
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 3311 - 3316
  • [5] Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network
    Liang, Meiyu
    Du, Junping
    Liu, Wu
    Xue, Zhe
    Geng, Yue
    Yang, Congxian
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1313 - 1321
  • [6] Local Attention and Global Representation Collaborating for Fine-grained Classification
    Zhang, He
    Bai, Yunming
    Zhang, Hui
    Liu, Jing
    Li, Xingguang
    He, Zhaofeng
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10658 - 10665
  • [7] CFP: A Cross-layer Recommender System with Fine-grained Preloading for Short Video Streaming at Network Edge
    Ran, Dezhi
    Zhang, Yuanxing
    Yuan, Ye
    Bian, Kaigui
    [J]. 2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 380 - 388
  • [8] A cross-layer framework for content based fine-grained scheduling of audiovisual streams over wireless network
    Kalleitner, F
    Konegger, M
    Takács, A
    Kovács, A
    [J]. PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INTERNET, AND INFORMATION TECHNOLOGY, 2005, : 152 - 157
  • [9] FGCVQA: FINE-GRAINED CROSS-ATTENTION FOR MEDICAL VQA
    Wu, Ziheng
    Shu, Xinyao
    Yan, Shiyang
    Lu, Zhenyu
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 975 - 979
  • [10] Learning discriminative representation with global and fine-grained features for cross-view gait recognition
    Xiao, Jing
    Yang, Huan
    Xie, Kun
    Zhu, Jia
    Zhang, Ji
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (02) : 187 - 199