Unsupervised image categorization and object localization using topic models and correspondences between images

被引:0
|
作者
Liu, David [1 ]
Chen, Tsuhan [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic models from the text understanding literature have shown promising results in unsupervised image categorization and object localization. Categories are treated as topics, and words are formed by vector quantizing local descriptors of image patches. Limitations of topic models include their weakness in localizing objects, and the requirement of a fairly large proportion of words coming from the object. We present a new approach that employs correspondences between images to provide information about object configuration, which in turn enhances the reliability of object localization and categorization. This approach is efficient, as it requires only a small number of correspondences. We show improved categorization and localization performance on real and synthetic data. Moreover, we can push the limits of topic models when the proportion of words coming from the object is very low.
引用
收藏
页码:191 / 197
页数:7
相关论文
共 50 条
  • [31] SCENE INTERPRETATION FOR SAR IMAGES USING SUPERVISED TOPIC MODELS
    Liu, Bin
    Wang, Huanyu
    Wang, Kaizhi
    Liu, Xingzhao
    Yu, Wenxian
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 3807 - 3810
  • [32] PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models
    Prakash, Nirmalendu
    Wang, Han
    Hoang, Nguyen Khoi
    Hee, Ming Shan
    Lee, Roy Ka-Wei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 621 - 631
  • [33] Localization of a mobile robot using the image of a moving object
    Lee, JM
    Son, K
    Lee, MC
    Choi, JC
    Han, SH
    Lee, MH
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2003, 50 (03) : 612 - 619
  • [34] Image categorization using Fisher kernels of non-iid image models
    Cinbis, Ramazan Gokberk
    Verbeek, Jakob
    Schmid, Cordelia
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2184 - 2191
  • [35] CONTROLLING ROBOTS USING IMAGE PROCESSING FOR OBJECT LOCALIZATION
    Neustupa, Zdenek
    Neustupa, Zdenek
    GEOCONFERENCE ON INFORMATICS, GEOINFORMATICS AND REMOTE SENSING - CONFERENCE PROCEEDINGS, VOL I, 2013, : 53 - 60
  • [36] Cross-domain object detection using unsupervised image translation
    Arruda, Vinicius F.
    Berriel, Rodrigo F.
    Paixao, Thiago M.
    Badue, Claudine
    De Souza, Alberto F.
    Sebe, Nicu
    Oliveira-Santos, Thiago
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 192
  • [37] Unsupervised Object Discovery from Images by Mining Local Features Using Hashing
    Pineda, Gibran Fuentes
    Koga, Hisashi
    Watanabe, Toshinori
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 978 - 985
  • [38] Retrieval of images using Pixel based object models
    Mukunoki, M
    Minoh, M
    Ikeda, K
    ADVANCES IN INTELLIGENT COMPUTING - IPMU '94, 1995, 945 : 557 - 566
  • [39] Multi-class binary object categorization using blurred shape models
    Escalera, Sergio
    Fornes, Alicia
    Pujol, Oriol
    Llados, Josep
    Radeva, Petia
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 142 - 151
  • [40] Effective Object Localization in Images by Calculating Ratio and Distance Between Pixels
    Bohush, Rykhard
    Ablameyko, Sergey
    Adamovskiy, Yahor
    Glukhov, Dmitry
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON INTELLIGENT INFORMATION TECHNOLOGIES & SYSTEMS OF INFORMATION SECURITY (INTELITSIS 2020), VOL 1, 2020, 2623 : 57 - 66