Deep LSAC for Fine-Grained Recognition

被引:7
|
作者
Lin, Di [1 ]
Wang, Yi [2 ]
Liang, Lingyu [3 ]
Li, Ping [4 ]
Chen, C. L. Philip [5 ,6 ,7 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
[2] Shenzhen Univ, Sch Biomed Engn, Shenzhen 518060, Peoples R China
[3] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510641, Guangdong, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[5] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[6] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
[7] Univ Macau, Fac Sci & Technol, Macau 999078, Peoples R China
关键词
Shape; Training; Image segmentation; Task analysis; Neural networks; Adaptation models; Semantics; Convolutional neural network (CNN); fine-grained recognition; object detection; pose alignment; semantic segmentation; SEGMENTATION;
D O I
10.1109/TNNLS.2020.3027603
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained recognition emphasizes the identification of subtle differences among object categories given objects that appear in different shapes and poses. These variances should be reduced for reliable recognition. We propose a fine-grained recognition system that incorporates localization, segmentation, alignment, and classification in a unified deep neural network. The input to the classification module includes functions that enable backward-propagation (BP) in constructing the solver. Our major contribution is to propose a valve linkage function (VLF) for BP chaining and form our deep localization, segmentation, alignment, and classification (LSAC) system. The VLF can adaptively compromise errors of classification and alignment when training the LSAC model. It in turn helps to update the localization and segmentation. We evaluate our framework on two widely used fine-grained object data sets. The performance confirms the effectiveness of our LSAC system.
引用
收藏
页码:200 / 214
页数:15
相关论文
共 50 条
  • [41] FenceNet: Fine-grained Footwork Recognition in Fencing
    Zhu, Kevin
    Wong, Alexander
    McPhee, John
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 3588 - 3597
  • [42] Fine-Grained Grounding for Multimodal Speech Recognition
    Srinivasan, Tejas
    Sanabria, Ramon
    Metze, Florian
    Elliott, Desmond
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2667 - 2677
  • [43] Fine-grained recognition of plants from images
    Milan Šulc
    Jiří Matas
    [J]. Plant Methods, 13
  • [44] Semantic bilinear pooling for fine-grained recognition
    School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
    [J]. Proc. Int. Conf. Pattern Recognit., (3660-3666):
  • [45] Semantic Bilinear Pooling for Fine-Grained Recognition
    Li, Xinjie
    Yang, Chun
    Chen, Song-Lu
    Zhu, Chao
    Yin, Xu-Cheng
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3660 - 3666
  • [46] Dynamic Perception Framework for Fine-Grained Recognition
    Ding, Yao
    Han, Zhenjun
    Zhou, Yanzhao
    Zhu, Yi
    Chen, Jie
    Ye, Qixiang
    Jiao, Jianbin
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1353 - 1365
  • [47] Fine-Grained Activity Recognition for Assembly Videos
    Jones, Jonathan D.
    Cortesa, Cathryn
    Shelton, Amy
    Landau, Barbara
    Khudanpur, Sanjeev
    Hager, Gregory D.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3728 - 3735
  • [48] Fine-Grained Recognition without Part Annotations
    Krause, Jonathan
    Jin, Hailin
    Yang, Jianchao
    Li Fei-Fei
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5546 - 5555
  • [49] Selective Pooling Vector for Fine-grained Recognition
    Chen, Guang
    Yang, Jianchao
    Jin, Hailin
    Shechtman, Eli
    Brandt, Jonathan
    Han, Tony X.
    [J]. 2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 860 - 867
  • [50] Learning to locate for fine-grained image recognition
    Chen, Jiamin
    Hu, Jianguo
    Li, Shiren
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 206