Vision Transformer (ViT) has achieved promising single-label image classification results compared to conventional neural network-based models. Nevertheless, few ViT related studies have explored the label dependencies in the multi-label image recognition field. To this end, we propose STMG that combines transformer and graph convolution network (GCN) to extract the image features and learn the label dependencies for multi-label image recognition. STMG consists of an image representation learning module and a label co-occurrence embedding module. Firstly, in the image representation learning module, to avoid computing the similarity between each two patches, we adopt Swin transformer instead of ViT to generate the image feature for each input image. Secondly, in the label co-occurrence embedding module, we design a two-layer GCN to adaptively capture the label dependencies to output the label co-occurrence embeddings. At last, STMG fuses the image feature and label co-occurrence embeddings to produce the image classification results with the commonly-used multi-label classification loss function and a L2-norm loss function. We conduct extensive experiments on two multi-label image datasets including MS-COCO and FLICKR25K. Experimental results demonstrate STMG can achieve better performance including the convergence efficiency and classification results compared to the state-of-the-art multi-label image recognition methods. Our code is open-sourced and publicly available on GitHub: https://github.com/lzHZWZ/STMG.
机构:
Donghua University,College of Information Sciences and TechnologyDonghua University,College of Information Sciences and Technology
Zhihong Lin
Xue-song Tang
论文数: 0引用数: 0
h-index: 0
机构:
Donghua University,Engineering Research Center of Digitized Textile & Apparel Technology, Ministry of EducationDonghua University,College of Information Sciences and Technology
Xue-song Tang
Kuangrong Hao
论文数: 0引用数: 0
h-index: 0
机构:
Donghua University,College of Information Sciences and TechnologyDonghua University,College of Information Sciences and Technology
Kuangrong Hao
Mingbo Zhao
论文数: 0引用数: 0
h-index: 0
机构:
Donghua University,Engineering Research Center of Digitized Textile & Apparel Technology, Ministry of EducationDonghua University,College of Information Sciences and Technology
Mingbo Zhao
Yubing Li
论文数: 0引用数: 0
h-index: 0
机构:
Donghua University,College of Information Sciences and TechnologyDonghua University,College of Information Sciences and Technology
机构:
Liuzhou Inst Technol, Sch Informat Sci & Engn, Liuzhou 545000, Peoples R ChinaLiuzhou Inst Technol, Sch Informat Sci & Engn, Liuzhou 545000, Peoples R China
Li, Yun
Wang, Su
论文数: 0引用数: 0
h-index: 0
机构:
China Mobile Commun Grp Jiangsu Co Ltd, Yangzhou Branch, Yangzhou 225000, Peoples R ChinaLiuzhou Inst Technol, Sch Informat Sci & Engn, Liuzhou 545000, Peoples R China
Wang, Su
Mo, Jiawei
论文数: 0引用数: 0
h-index: 0
机构:
Liuzhou Inst Technol, Sch Informat Sci & Engn, Liuzhou 545000, Peoples R ChinaLiuzhou Inst Technol, Sch Informat Sci & Engn, Liuzhou 545000, Peoples R China
Mo, Jiawei
Wei, Xin
论文数: 0引用数: 0
h-index: 0
机构:
Liuzhou Inst Technol, Sch Informat Sci & Engn, Liuzhou 545000, Peoples R ChinaLiuzhou Inst Technol, Sch Informat Sci & Engn, Liuzhou 545000, Peoples R China
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
Zhou, Wei
Jiang, Weitao
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
Jiang, Weitao
Chen, Dihu
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
Chen, Dihu
Hu, Haifeng
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
Hu, Haifeng
Su, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
机构:
Xinjiang Univ, Coll Software, Urumqi, Peoples R China
Key Lab Signal Detect & Proc Xinjiang Uygur Auton, Urumqi, Peoples R China
Xinjiang Univ, Key Lab Software Engn, Urumqi, Peoples R ChinaXinjiang Univ, Coll Software, Urumqi, Peoples R China
Yang, Minhang
Liu, Hui
论文数: 0引用数: 0
h-index: 0
机构:
Key Lab Signal Detect & Proc Xinjiang Uygur Auton, Urumqi, Peoples R China
Xinjiang Univ, Key Lab Software Engn, Urumqi, Peoples R China
Xinjiang Univ, Coll Informat Sci & Engn, Urumqi, Peoples R ChinaXinjiang Univ, Coll Software, Urumqi, Peoples R China
Liu, Hui
Gao, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Software, Urumqi, Peoples R China
Key Lab Signal Detect & Proc Xinjiang Uygur Auton, Urumqi, Peoples R China
Xinjiang Univ, Key Lab Software Engn, Urumqi, Peoples R ChinaXinjiang Univ, Coll Software, Urumqi, Peoples R China
Gao, Liang
Qian, Yurong
论文数: 0引用数: 0
h-index: 0
机构:
Xinjiang Univ, Coll Software, Urumqi, Peoples R China
Key Lab Signal Detect & Proc Xinjiang Uygur Auton, Urumqi, Peoples R China
Xinjiang Univ, Key Lab Software Engn, Urumqi, Peoples R ChinaXinjiang Univ, Coll Software, Urumqi, Peoples R China
Qian, Yurong
Xiao, Zhengqing
论文数: 0引用数: 0
h-index: 0
机构:
Key Lab Signal Detect & Proc Xinjiang Uygur Auton, Urumqi, Peoples R China
Xinjiang Univ, Key Lab Software Engn, Urumqi, Peoples R China
Xinjiang Univ, Coll Math & Syst Sci, Urumqi, Peoples R ChinaXinjiang Univ, Coll Software, Urumqi, Peoples R China
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R China
Zhou, Wei
Zheng, Zhijie
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R China
Zheng, Zhijie
Su, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R China
Su, Tao
Hu, Haifeng
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R China
机构:
Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R ChinaHunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China
Yu, Bin
Xie, Hengjie
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R ChinaHunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China
Xie, Hengjie
Fu, Yu
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R ChinaHunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China
Fu, Yu
Xu, Zeshui
论文数: 0引用数: 0
h-index: 0
机构:
Sichuan Univ, Business Sch, Chengdu 610064, Sichuan, Peoples R ChinaHunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China