Transportation Object Detection with Bag of Visual Words Model by PLSA and MLP

被引:3
|
作者
Song, Hyun Chul [1 ]
Choi, Kwang Nam [1 ]
机构
[1] Chung Ang Univ, Sch Comp Sci & Engn, Chung Ang, South Korea
来源
MOBILE NETWORKS & APPLICATIONS | 2018年 / 23卷 / 04期
基金
新加坡国家研究基金会;
关键词
Transportation detection; Bag of visual words; Multi-layer perceptron; Probabilistic latent semantic analysis; Scale-invariant feature transform; FRAMEWORK; FEATURES;
D O I
10.1007/s11036-018-1075-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Visual big data is an essential and significant research topic, due to its diverse applications. In this paper, a new visual detection method for transportation is proposed based on probabilistic latent semantic analysis with visual data. We detect the distinctiveness by integrating three steps as follows: first, representing the co-ocurrence matrix of images, which were vectorized using the bag of visual words (BoVW) framework; then calculating the histograms of the visual words of each class; and finally applying the test images as the visual words. A multilayer perceptron (MLP) is used as the classification method in our system. The visual words are extracted by sampling the patches from the current image. A new topology of the neural network for the BoVW model is proposed, and management of the learning rate by reducing at specific iterations is exploited. The Probabilistic latent semantic analysis (PLSA) is compared to the MLP using the Caltech 256 datasets. The classes used include cars, motorbikes, and horses. The results of the experiment show that the MLP outperforms current methods in predicting transportation objects, and properly approximates the transportation detection function with extracted local features. It shows that the proposed method yields about 4.4% higher accuracy than the conventional PLSA for all classes.
引用
收藏
页码:1103 / 1110
页数:8
相关论文
共 50 条
  • [1] Transportation Object Detection with Bag of Visual Words Model by PLSA and MLP
    Hyun Chul Song
    Kwang Nam Choi
    Mobile Networks and Applications, 2018, 23 : 1103 - 1110
  • [2] Bag of Visual Words Method based on PLSA and Chi-Square Model for Object Category
    Zhao, Yongwei
    Peng, Tianqiang
    Li, Bicheng
    Ke, Shengcai
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (07): : 2633 - 2648
  • [3] Retrieval of pathological retina images using Bag of Visual Words and pLSA model
    Sreejini, K. S.
    Govindan, V. K.
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2019, 22 (03): : 777 - 785
  • [4] A Novel Bag-of-Visual-Words Approach for Geospatial Object Detection
    Aytekin, Caglar
    Alatan, A. Aydin
    OPTICAL PATTERN RECOGNITION XXII, 2011, 8055
  • [5] An object detection and classification method for underwater visual images based on the bag-of-words model
    Zhang, Tianchi
    Li, Qian
    Liu, Xing
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART M-JOURNAL OF ENGINEERING FOR THE MARITIME ENVIRONMENT, 2023, 237 (02) : 487 - 497
  • [6] WEIGHTED BAG OF VISUAL WORDS FOR OBJECT RECOGNITION
    San Biagio, Marco
    Bazzani, Loris
    Cristani, Marco
    Murino, Vittorio
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2734 - 2738
  • [7] CELLULAR AUTOMATA BAG OF VISUAL WORDS FOR OBJECT RECOGNITION
    Mironical, Ionut
    Ionescu, Bogdan
    Dogaru, Radu
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2015, 77 (04): : 107 - 118
  • [8] Extended Bag of Visual Words for Face Detection
    Montazer, Gholam Ali
    Soltanshahi, Mohammad Ali
    Giveki, Davar
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT I (IWANN 2015), 2015, 9094 : 503 - 510
  • [9] Learning Bag of Visual Words for Motorbike Detection
    Ngoc Dung Thai
    Thanh Sach Le
    Nam Thoai
    Hamamoto, Kazuhiko
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 1045 - 1050
  • [10] Saliency map driven image retrieval combining the bag-of-words model and PLSA
    Giouvanakis, Emmanouil
    Kotropoulos, Constantine
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 280 - 285