Improving Object Recognition of CNNs with Multiple Queries and HMMs

被引:1
|
作者
Czuni, Laszlo [1 ]
Nagy, Amr M. [1 ]
机构
[1] Univ Pannonia, Egyet Str 10, Veszprem, Hungary
关键词
Computer vision; object recognition; VGG16; Hidden Markov Model; information fusion;
D O I
10.1117/12.2559393
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In our paper we combine neural networks with Hidden Markov Models for multiview object recognition. While convolutional neural networks are very efficient in object recognition there is still need for improvements in many practical cases. For example if the training is not satisfactory or the object localization is not solved with the neural network then information fusion from several images and from inertial sensors can still help a lot to improve recognition rate. In our use case we are to recognize objects from several directions with the VGG16 network. We assume that no localization of objects is possible on the images due to the lack of bounding box annotations, we have to recognize the objects even if they occupy only about 25% of the field of view. To overcome this problem we propose to use a Hidden Markov Model approach where the consecutive queries, shots taken from different viewing directions, are first evaluated with VGG16 inference and then with the Viterbi algorithm. The role of the later is to estimate the most probable sequence of poses of candidates (from the predefined 8 horizontal views in our experiments), thus we can select the most probable object. The approach, as evaluated with different number of queries over a set of 40 objects from the COIL-100 dataset, can result in significant increase of hit rate compared to one shot recognition or to combining individual shots without the HMM model.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Moving Object Detection With Deep CNNs
    Zhu, Haidi
    Yan, Xin
    Tang, Hongying
    Chang, Yuchao
    Li, Baoqing
    Yuan, Xiaobing
    IEEE ACCESS, 2020, 8 : 29729 - 29741
  • [42] Heterogeneous Face Recognition with CNNs
    Saxena, Shreyas
    Verbeek, Jakob
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 483 - 491
  • [43] DetTrack: An Algorithm for Multiple Object Tracking by Improving Occlusion Object Detection
    Gao, Xinyue
    Wang, Zhengyou
    Wang, Xiaofan
    Zhang, Shuo
    Zhuang, Shanna
    Wang, Hui
    ELECTRONICS, 2024, 13 (01)
  • [44] Object Recognition by Integrating Multiple Image Segmentations
    Pantofaru, Caroline
    Schmid, Cordelia
    Hebert, Martial
    COMPUTER VISION - ECCV 2008, PT III, PROCEEDINGS, 2008, 5304 : 481 - +
  • [45] Multiple spatial pooling for visual object recognition
    Huang, Yongzhen
    Wu, Zifeng
    Wang, Liang
    Song, Chunfeng
    NEUROCOMPUTING, 2014, 129 : 225 - 231
  • [46] Incremental Multiple Kernel Learning for Object Recognition
    Kembhavi, Aniruddha
    Siddiquie, Behjat
    Miezianko, Roland
    McCloskey, Scott
    Davis, Larry S.
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 638 - 645
  • [47] Integration of multiple methods for robust object recognition
    Mansur, A.
    Kuno, Yoshinori
    PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 1985 - 1990
  • [48] Multiple Visual Object Recognition For Poster Detection
    Kuzhan, Abdullah
    Ozden, Kemal Egemen
    ICECCO'12: 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION, 2012, : 301 - 304
  • [49] Generic object recognition using multiple representations
    Das, S
    Bhanu, B
    Ho, CC
    IMAGE AND VISION COMPUTING, 1996, 14 (05) : 323 - 338
  • [50] Integrating multiple model views for object recognition
    Ferrari, V
    Tuytelaars, T
    Van Gool, L
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 105 - 112