TANet: Transformer-based asymmetric network for RGB-D salient object detection
被引:6
|
作者:
Liu, Chang
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Shenyang, Liaoning, Peoples R ChinaNortheastern Univ, Shenyang, Liaoning, Peoples R China
Liu, Chang
[1
]
Yang, Gang
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Shenyang, Liaoning, Peoples R China
Northeastern Univ, Wenhua Rd, Shenyang 110000, Liaoning, Peoples R ChinaNortheastern Univ, Shenyang, Liaoning, Peoples R China
Yang, Gang
[1
,3
]
Wang, Shuo
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Shenyang, Liaoning, Peoples R ChinaNortheastern Univ, Shenyang, Liaoning, Peoples R China
Wang, Shuo
[1
]
Wang, Hangxu
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Shenyang, Liaoning, Peoples R China
DUT Artificial Intelligence Inst, Dalian, Peoples R ChinaNortheastern Univ, Shenyang, Liaoning, Peoples R China
Wang, Hangxu
[1
,2
]
Zhang, Yunhua
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Shenyang, Liaoning, Peoples R ChinaNortheastern Univ, Shenyang, Liaoning, Peoples R China
Zhang, Yunhua
[1
]
Wang, Yutao
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Shenyang, Liaoning, Peoples R ChinaNortheastern Univ, Shenyang, Liaoning, Peoples R China
Wang, Yutao
[1
]
机构:
[1] Northeastern Univ, Shenyang, Liaoning, Peoples R China
[2] DUT Artificial Intelligence Inst, Dalian, Peoples R China
[3] Northeastern Univ, Wenhua Rd, Shenyang 110000, Liaoning, Peoples R China
Existing RGB-D salient object detection methods mainly rely on a symmetric two-stream Convolutional Neural Network (CNN)-based network to extract RGB and depth channel features separately. However, there are two problems with the symmetric conventional network structure: first, the ability of CNN in learning global contexts is limited; second, the symmetric two-stream structure ignores the inherent differences between modalities. In this study, a Transformer-based asymmetric network is proposed to tackle the issues mentioned above. The authors employ the powerful feature extraction capability of Transformer to extract global semantic information from RGB data and design a lightweight CNN backbone to extract spatial structure information from depth data without pre-training. The asymmetric hybrid encoder effectively reduces the number of parameters in the model while increasing speed without sacrificing performance. Then, a cross-modal feature fusion module which enhances and fuses RGB and depth features with each other is designed. Finally, the authors add edge prediction as an auxiliary task and propose an edge enhancement module to generate sharper contours. Extensive experiments demonstrate that our method achieves superior performance over 14 state-of-the-art RGB-D methods on six public datasets. The code of the authors will be released at .
机构:
Civil Aviat Univ China, Coll Comp Sci & Technol, Tianjin 300300, Peoples R China
Three Gorges Univ, Hubei Key Lab Intelligent Vis Based Monitoring Hy, Yichang 443002, Peoples R ChinaCivil Aviat Univ China, Coll Comp Sci & Technol, Tianjin 300300, Peoples R China
Huang, Rui
Xing, Yan
论文数: 0引用数: 0
h-index: 0
机构:
Civil Aviat Univ China, Coll Comp Sci & Technol, Tianjin 300300, Peoples R ChinaCivil Aviat Univ China, Coll Comp Sci & Technol, Tianjin 300300, Peoples R China
Xing, Yan
Zou, Yaobin
论文数: 0引用数: 0
h-index: 0
机构:
Three Gorges Univ, Hubei Key Lab Intelligent Vis Based Monitoring Hy, Yichang 443002, Peoples R ChinaCivil Aviat Univ China, Coll Comp Sci & Technol, Tianjin 300300, Peoples R China
机构:
Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
Int Joint Res Lab Cooperat Vehicular Networks Hena, Zhengzhou 450046, Peoples R ChinaHenan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
Du, Haishun
Zhang, Zhen
论文数: 0引用数: 0
h-index: 0
机构:
Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
Int Joint Res Lab Cooperat Vehicular Networks Hena, Zhengzhou 450046, Peoples R ChinaHenan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
Zhang, Zhen
Zhang, Minghao
论文数: 0引用数: 0
h-index: 0
机构:
Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R ChinaHenan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
Zhang, Minghao
Qiao, Kangyi
论文数: 0引用数: 0
h-index: 0
机构:
Henan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R ChinaHenan Univ, Sch Artificial Intelligence, Zhengzhou 450046, Peoples R China
机构:
Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Li, Gongyang
Liu, Zhi
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Liu, Zhi
Chen, Minyu
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Chen, Minyu
Bai, Zhen
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Bai, Zhen
Lin, Weisi
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science, Stony Brook University, Stony Brook,NY, United StatesShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
Lin, Weisi
Ling, Haibin
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science, Stony Brook University, Stony Brook,NY, United StatesShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China
机构:
Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
Liang, Fangfang
Duan, Lijuan
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
Beijing Key Lab Trusted Comp, Beijing, Peoples R China
Natl Engn Lab Crit Technol Informat Secur Classif, Beijing, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
Duan, Lijuan
Ma, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
Beijing Key Lab Trusted Comp, Beijing, Peoples R China
Natl Engn Lab Crit Technol Informat Secur Classif, Beijing, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
Ma, Wei
Qiao, Yuanhua
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Coll Appl Sci, Beijing, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
Qiao, Yuanhua
Miao, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Informat Sci & Technol Univ, Sch Comp Sci, Beijing Key Lab Internet Culture & Digital Dissem, Beijing, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
Miao, Jun
Ye, Qixiang
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chinese Acad Sci, Beijing, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China