Fine-Grained Image Classification Based on Multi-Modal Features and Enhanced Alignment

被引:0
|
作者
Han, Jing [1 ]
Zhang, Tianpeng [1 ]
Lyu, Xueqiang [1 ]
机构
[1] Beijing Key Laboratory of Internet Culture and Digital Dissemination Research, Beijing Information Science and Technology University, Beijing,100101, China
关键词
D O I
10.13190/j.jbupt.2023-140
中图分类号
学科分类号
摘要
13
引用
收藏
页码:130 / 135
相关论文
共 50 条
  • [1] Multi-modal Knowledge-Enhanced Fine-Grained Image Classification
    Cheng, Suyan
    Zhang, Feifei
    Zhou, Haoliang
    Xu, Changsheng
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 333 - 346
  • [2] MKTformer: Fine-grained Meter Classification Based on Multi-modal Knowledge Transfer
    Zheng, Zhaoye
    Zhang, Ke
    Shi, Chaojun
    Zheng, Fei
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1564 - 1570
  • [3] Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
    Mafla, Andres
    Dey, Sounak
    Biten, Ali Furkan
    Gomez, Lluis
    Karatzas, Dimosthenis
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 4022 - 4032
  • [4] Multi-modal hierarchical fusion network for fine-grained paper classification
    Tan Yue
    Yong Li
    Jiedong Qin
    Zonghai Hu
    Multimedia Tools and Applications, 2024, 83 : 31527 - 31543
  • [5] Multi-modal hierarchical fusion network for fine-grained paper classification
    Yue, Tan
    Li, Yong
    Qin, Jiedong
    Hu, Zonghai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 31527 - 31543
  • [6] Learning enhanced features and inferring twice for fine-grained image classification
    Nie, Xuan
    Chai, Bosong
    Wang, Luyao
    Liao, Qiyu
    Xu, Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 14799 - 14813
  • [7] Learning enhanced features and inferring twice for fine-grained image classification
    Xuan Nie
    Bosong Chai
    Luyao Wang
    Qiyu Liao
    Min Xu
    Multimedia Tools and Applications, 2023, 82 : 14799 - 14813
  • [8] Fine-Grained Context and Multi-modal Alignment for Freehand 3D Ultrasound Reconstruction
    Yan, Zhongnuo
    Yang, Xin
    Luo, Mingyuan
    Chen, Jiongquan
    Chen, Rusi
    Liu, Lian
    Ni, Dong
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 340 - 349
  • [9] Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
    Munro, Jonathan
    Damen, Dima
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 119 - 129
  • [10] Multi-Modal Domain Adaptation for Fine-grained Action Recognition
    Munro, Jonathan
    Damen, Dima
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3723 - 3726