Data-driven two-layer visual dictionary structure learning

被引:3
|
作者
Yu, Xiangchun [1 ]
Yu, Zhezhou [1 ]
Wu, Lei [1 ]
Pang, Wei [2 ]
Lin, Chenghua [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Dept Computat Intelligence, Changchun, Jilin, Peoples R China
[2] Univ Aberdeen, Sch Nat & Comp Sci, Dept Comp Sci, Meston Bldg, Aberdeen, Scotland
关键词
statistical modeling; overfitting; visual dictionary; Bayesian nonparametric model; deep learning; LATENT DIRICHLET ALLOCATION; HIERARCHICAL MODEL; WORDS; BAG; REPRESENTATION; FEATURES;
D O I
10.1117/1.JEI.28.2.023006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An important issue in statistical modeling is to determine the complexity of the model based on the scale of data so as to effectively mitigate the model's overfitting problems without big data. We adopt a data-driven approach to automatically determine the number of components of the model. In order to better extract robust features, we propose a framework of data-driven two-layer structure visual dictionary learning (DTSVDL). It works by dividing the visual dictionary structure learning into two levels: the attribute layer and the detail layer. In the attribute layer, the attributes of the image dataset are learned, and these attributes are obtained by a data-driven Bayesian nonparametric model. Then, in the detail layer, the detailed information over attributes is further explored and refined, and the attributes are weighted by the number of effective observations associated with each attribute. Our proposed approach has three main advantages: (1) the two-layer structure makes our building visual dictionary be more expressive; (2) the number of components in the attribute layer can be determined automatically from the data; (3) the components are automatically determined based on the scale of visual words; therefore, our model can well mitigate the overfitting problem. In addition, by comparing with stacked autoencoders, stacked denoising autoencoders, LeNet-5, speeded-up robust features, and pretrained deep learning model ImageNet-VGG-F algorithms, we find that our approach achieves satisfactory image categorization results on two benchmark datasets. Specifically, higher categorization performance is achieved than by the classical approaches on 15 scene categories and action datasets. We conclude that the resulting DTSVDL possesses a good generality derived from attribute information as well as an excellent distinction derived from detailed information. In other words, the visual dictionary learned by our algorithm is more expressive and discriminatory. (C) 2019 SPIE and IS&T
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A TWO-LAYER STRUCTURE OF THE NANNING BASIN
    Xiao Yiyue Institute of Geology
    [J]. Scientia Geologica Sinica, 1997, (03) : 110 - 115
  • [22] An exploration of error-driven learning in simple two-layer networks from a discriminative learning perspective
    Dorothée B. Hoppe
    Petra Hendriks
    Michael Ramscar
    Jacolien van Rij
    [J]. Behavior Research Methods, 2022, 54 : 2221 - 2251
  • [23] An exploration of error-driven learning in simple two-layer networks from a discriminative learning perspective
    Hoppe, Dorothee B.
    Hendriks, Petra
    Ramscar, Michael
    van Rij, Jacolien
    [J]. BEHAVIOR RESEARCH METHODS, 2022, 54 (05) : 2221 - 2251
  • [24] Explainable fraud detection of financial statement data driven by two-layer knowledge graph
    Cai, Siqi
    Xie, Zhenping
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [25] A TWO-LAYER REINFORCEMENT LEARNING SOLUTION FOR ENERGY HARVESTING DATA DISSEMINATION SCENARIOS
    Ortiz, Andrea
    Weber, Tobias
    Klein, Anja
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6648 - 6652
  • [26] Communicative data-driven learning: a two-year pilot study
    Hirata, Yoko
    Thompson, Paul
    [J]. ELT JOURNAL, 2022, 76 (03) : 356 - 366
  • [27] Machine learning for neutron reflectometry data analysis of two-layer thin films *
    Doucet, Mathieu
    Archibald, Richard K.
    Heller, William T.
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (03):
  • [28] A Data-Driven Sparse GLM for fMRI Analysis Using Sparse Dictionary Learning With MDL Criterion
    Lee, Kangjoo
    Tak, Sungho
    Ye, Jong Chul
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2011, 30 (05) : 1076 - 1089
  • [29] Fast Dictionary Learning Based on Data-Driven Tight Frame for 3-D Seismic Data Denoising
    Zhou, Zixiang
    Wu, Juan
    Bai, Min
    Yang, Bo
    Ma, Zhaoyang
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 10
  • [30] Damage Detection with Data-Driven Machine Learning Models on an Experimental Structure
    Alemu, Yohannes L.
    Lahmer, Tom
    Walther, Christian
    [J]. ENG, 2024, 5 (02): : 629 - 656