Transformer-based neural architecture search for effective visible-infrared person re-identification

被引：0

作者：

Sarker, Prodip Kumar ^{[1
]}

机构：

[1] Begum Rokeya Univ, Dept Comp Sci & Engn, Rangpur 5400, Bangladesh

来源：

NEUROCOMPUTING | 2025年 / 620卷

关键词：

Transformer; Neural architecture search; Attention mechanism; Feature extraction; Cross-modality;

D O I：

10.1016/j.neucom.2024.129257

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visible-infrared person re-identification (VI-reID) is a complex task insecurity and video surveillance that aims to identify and match a person captured by various non-overlapping cameras. In recent years, there has been a notable advancement in reID owing to the development of transformer-based architectures. Although many existing methods emphasize on learning both modality-specific and shared features, challenges remain in fully exploiting the complementary information between infrared and visible modalities. Consequently, there is still opportunity to increase retrieval performance by effectively comprehending and integrating cross- modality semantic information. These designs often have problems with model complexity and time-consuming processes. To tackle these issues, we employ a novel transformer-based neural architecture search (TNAS) deep learning approach for effective VI-reID. To alleviate modality gaps, we first introduce a global-local transformer (GLT) module that captures features at both global and local levels across different modalities, contributing to better feature representation and matching. Then, an efficient neural architecture search (NAS) module is developed to search for the optimal transformer-based architecture, which further enhances the performance of VI-reID. Additionally, we introduce distillation loss and modality discriminative (MD) loss to examine the potential consistency between different modalities to promote intermodality separation between classes and intramodality compactness within classes. Experimental results on two challenging benchmark datasets illustrate that our developed model achieves state-of-the-art results, outperforming existing VI-reID methods.

引用

页数：10

共 50 条

[1] A simple but effective vision transformer framework for visible-infrared person re-identification
Li, Yudong
Zhao, Sanyuan
Shen, Jianbing
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
[2] Attributes Based Visible-Infrared Person Re-identification
Zheng, Aihua
Feng, Mengya
Pan, Peng
Jiang, Bo
Luo, Bin
PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 254 - 266
[3] Cross-Modality Transformer for Visible-Infrared Person Re-Identification
Jiang, Kongzhu
Zhang, Tianzhu
Liu, Xiang
Qian, Bingqiao
Zhang, Yongdong
Wu, Feng
COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 480 - 496
[4] A guidance and alignment transformer model for visible-infrared person re-identification
Huang, Linyu
Xue, Zijie
Ning, Qian
Guo, Yong
Li, Yongsheng
MULTIMEDIA SYSTEMS, 2025, 31 (02)
[5] Occluded Visible-Infrared Person Re-Identification
Feng, Yujian
Ji, Yimu
Wu, Fei
Gao, Guangwei
Gao, Yang
Liu, Tianliang
Liu, Shangdong
Jing, Xiao-Yuan
Luo, Jiebo
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1401 - 1413
[6] CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification
Fu, Chaoyou
Hu, Yibo
Wu, Xiang
Shi, Hailin
Mei, Tao
He, Ran
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11803 - 11812
[7] Spatial-Channel Enhanced Transformer for Visible-Infrared Person Re-Identification
Zhao, Jiaqi
Wang, Hanzheng
Zhou, Yong
Yao, Rui
Chen, Silin
Saddik, Abdulmotaleb El
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3668 - 3680
[8] Structure-Aware Positional Transformer for Visible-Infrared Person Re-Identification
Chen, Cuiqun
Ye, Mang
Qi, Meibin
Wu, Jingjing
Jiang, Jianguo
Lin, Chia-Wen
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2352 - 2364
[9] Interaction and Alignment for Visible-Infrared Person Re-Identification
Gong, Jiahao
Zhao, Sanyuan
Lam, Kin-Man
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2253 - 2259
[10] Visible-Infrared Person Re-Identification via Cross-Modality Interaction Transformer
Feng, Yujian
Yu, Jian
Chen, Feng
Ji, Yimu
Wu, Fei
Liu, Shangdon
Jing, Xiao-Yuan
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7647 - 7659

← 1 2 3 4 5 →