DSAP: Dynamic Sparse Attention Perception Matcher for Accurate Local Feature Matching
被引:0
|
作者:
Dai, Kun
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Dai, Kun
[1
]
Wang, Ke
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Harbin Inst Technol, Zhengzhou Res Inst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Wang, Ke
[2
,3
]
Xie, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Yangtze River Delta HIT Robot Technol Res Inst, Wuhu 241000, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Xie, Tao
[1
,4
]
Sun, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Yangtze River Delta HIT Robot Technol Res Inst, Wuhu 241000, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Sun, Tao
[4
]
Zhang, Jinhang
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Zhang, Jinhang
[1
]
Kong, Qingjia
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Kong, Qingjia
[1
]
Jiang, Zhiqiang
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Jiang, Zhiqiang
[1
]
Li, Ruifeng
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Li, Ruifeng
[1
]
Zhao, Lijun
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Harbin Inst Technol, Zhengzhou Res Inst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Zhao, Lijun
[2
,3
]
Omar, Mohamed
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R ChinaHarbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
Omar, Mohamed
[1
]
机构:
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
[2] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150006, Peoples R China
[3] Harbin Inst Technol, Zhengzhou Res Inst, Harbin 150006, Peoples R China
[4] Yangtze River Delta HIT Robot Technol Res Inst, Wuhu 241000, Peoples R China
Deep learning;
dynamic attention perception;
local feature matching;
relative pose estimation;
sparse attention;
visual localization;
D O I:
10.1109/TIM.2024.3370781
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
Local feature matching, which aims to establish the matches between image pairs, is a pivotal component of multiple visual applications. While current transformer-based works exhibit remarkable performance, they mechanically alternate self- and cross-attention in a predetermined order without considering their prioritization, culminating in inadequate enhancement of visual descriptors. Moreover, when calculating attention matrices to integrate global context, current methods only explicitly model the correlation among the feature channels without taking their importance into account, leaving insufficient message propagation. In this work, we develop a dynamic sparse attention perception (DSAP) matcher to tackle the aforementioned issues. To resolve the first issue, DSAP presents a dynamic perception strategy (DPS) that enables the network to dynamically implement feature enhancement via modifying both forward and backward propagation. During forward propagation, DPS assigns a learnable perception score to each transformer layer and employs an exponential moving average algorithm (EMA) to calculate the current score. After that, DPS utilizes an indicator function to binarize the score, allowing DSAP to adaptively determine the appropriate utilization of self- or cross-attention at the current iteration. During backward propagation, DPS employs a gradient estimator that adjusts the gradient of perception scores, thus rendering them differentiable. To tackle the second issue, DSAP introduces a weighted sparse transformer (WSFormer) that recalibrates attention matrices by concurrently considering both channel importance and channel correlation. WSFormer predicts attention vectors to weight attention matrices while constructing multiple sparse attention matrices to integrate various global messages, thus highlighting informative channels and inhibiting redundant message propagation. Extensive experiments in public datasets and real environments demonstrate that DSAP achieves exceptional performances across various downstream tasks, including relative pose estimation and visual localization. The code is available at https://github.com/mooncake199809/DSAP.
机构:
Cent South Univ, Sch Automat, Changsha, Hunan, Peoples R ChinaCent South Univ, Sch Automat, Changsha, Hunan, Peoples R China
Wang, Yun
Song, Mengmeng
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Automat, Changsha, Hunan, Peoples R China
Harbin Inst Technol, Sch Elect Engn & Automat, Harbin, Heilongjiang, Peoples R ChinaCent South Univ, Sch Automat, Changsha, Hunan, Peoples R China
Song, Mengmeng
Yang, Dazhi
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Sch Elect Engn & Automat, Harbin, Heilongjiang, Peoples R ChinaCent South Univ, Sch Automat, Changsha, Hunan, Peoples R China
机构:
Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Key Lab Sichuan Prov, Mfg Ind Chains Collaborat & Informat Support Techn, Chengdu 611756, Peoples R ChinaSouthwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Yang, Yong
Chen, Hongmei
论文数: 0引用数: 0
h-index: 0
机构:
Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Key Lab Sichuan Prov, Mfg Ind Chains Collaborat & Informat Support Techn, Chengdu 611756, Peoples R ChinaSouthwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Chen, Hongmei
Mi, Yong
论文数: 0引用数: 0
h-index: 0
机构:
Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Key Lab Sichuan Prov, Mfg Ind Chains Collaborat & Informat Support Techn, Chengdu 611756, Peoples R ChinaSouthwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Mi, Yong
Luo, Chuan
论文数: 0引用数: 0
h-index: 0
机构:
Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R ChinaSouthwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Luo, Chuan
Horng, Shi-Jinn
论文数: 0引用数: 0
h-index: 0
机构:
Asia Univ, Dept Comp Sci & Informat Engn, Taichung 41354, Taiwan
China Med Univ, China Med Univ Hosp, Dept Med Res, Taichung, TaiwanSouthwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Horng, Shi-Jinn
Li, Tianrui
论文数: 0引用数: 0
h-index: 0
机构:
Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
Minist Educ, Engn Res Ctr Sustainable Urban Intelligent Transpo, Chengdu 611756, Peoples R China
Southwest Jiaotong Univ, Key Lab Sichuan Prov, Mfg Ind Chains Collaborat & Informat Support Techn, Chengdu 611756, Peoples R ChinaSouthwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China