Approximate Image Matching using Strings of Bag-of-Visual Words Representation

被引:0
|
作者
Hong Thinh Nguyen [1 ]
Barat, Cecile
Ducottet, Christophe
机构
[1] Univ Lyon, F-42023 St Etienne, France
关键词
Edit Distance; String of Histograms; Bag-of-Visual Words; Image Classification; EDIT-DISTANCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Spatial Pyramid Matching approach has become very popular to model images as sets of local bag-of-words. The image comparison is then done region-by-region with an intersection kernel. Despite its success, this model presents some limitations: the grid partitioning is predefined and identical for all images and the matching is sensitive to intra-and inter-class variations. In this paper, we propose a novel approach based on approximate string matching to overcome these limitations and improve the results. First, we introduce a new image representation as strings of ordered bag-of-words. Second, we present a new edit distance specifically adapted to strings of histograms in the context of image comparison. This distance identifies local alignments between subregions and allows to remove sequences of similar subregions to better match two images. Experiments on 15 Scenes and Caltech 101 show that the proposed approach outperforms the classical spatial pyramid representation and most existing concurrent methods for classification presented in recent years.
引用
收藏
页码:345 / 353
页数:9
相关论文
共 50 条
  • [21] Performance evaluation of large-scale object recognition system using bag-of-visual words model
    Kim, Min-Uk
    Yoon, Kyoungro
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (07) : 2499 - 2517
  • [22] A Hamming Embedding Kernel with Informative Bag-of-Visual Words for Video Semantic Indexing
    Wang, Feng
    Zhao, Wan-Lei
    Ngo, Chong-Wah
    Merialdo, Bernard
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 10 (03)
  • [23] Improvement the Bag of Words Image Representation Using Spatial Information
    Farhangi, Mohammad Mehdi
    Soryani, Mohsen
    Fathy, Mahmood
    [J]. ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2, 2013, 177 : 681 - 690
  • [24] Bag-of-Visual Words for Word-Wise Video Script Identification: A Study
    Sharma, Nabin
    Mandal, Ranju
    Sharma, Rabi
    Pal, Umapada
    Blumenstein, Michael
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [25] Image Classification Model Using Visual Bag of Semantic Words
    Qi, Yali
    Zhang, Guoshan
    Li, Yeli
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2019, 29 (03) : 404 - 414
  • [26] Offline Signature Verification Based on Bag-of-Visual Words Model Using KAZE Features and Weighting Schemes
    Okawa, Manabu
    [J]. PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 252 - 258
  • [27] Image Retrieval using Extended Bag-of-Visual-Words
    Bhattacharya, Nandita
    Sil, Jaya
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1969 - 1975
  • [28] Image Classification Model Using Visual Bag of Semantic Words
    Yali Qi
    Guoshan Zhang
    Yeli Li
    [J]. Pattern Recognition and Image Analysis, 2019, 29 : 404 - 414
  • [29] ON-ROAD VEHICLE CLASSIFICATION BASED ON RANDOM NEURAL NETWORK AND BAG-OF-VISUAL WORDS
    Hussain, Khaled F.
    Moussa, Ghada S.
    [J]. PROBABILITY IN THE ENGINEERING AND INFORMATIONAL SCIENCES, 2016, 30 (03) : 403 - 412
  • [30] Region Matching Techniques for Spatial Bag of Visual Words Based Image Category Recognition
    Viitaniemi, Ville
    Laaksonen, Jorma
    [J]. ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 531 - 540