Data-driven two-layer visual dictionary structure learning

被引:3
|
作者
Yu, Xiangchun [1 ]
Yu, Zhezhou [1 ]
Wu, Lei [1 ]
Pang, Wei [2 ]
Lin, Chenghua [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Dept Computat Intelligence, Changchun, Jilin, Peoples R China
[2] Univ Aberdeen, Sch Nat & Comp Sci, Dept Comp Sci, Meston Bldg, Aberdeen, Scotland
关键词
statistical modeling; overfitting; visual dictionary; Bayesian nonparametric model; deep learning; LATENT DIRICHLET ALLOCATION; HIERARCHICAL MODEL; WORDS; BAG; REPRESENTATION; FEATURES;
D O I
10.1117/1.JEI.28.2.023006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An important issue in statistical modeling is to determine the complexity of the model based on the scale of data so as to effectively mitigate the model's overfitting problems without big data. We adopt a data-driven approach to automatically determine the number of components of the model. In order to better extract robust features, we propose a framework of data-driven two-layer structure visual dictionary learning (DTSVDL). It works by dividing the visual dictionary structure learning into two levels: the attribute layer and the detail layer. In the attribute layer, the attributes of the image dataset are learned, and these attributes are obtained by a data-driven Bayesian nonparametric model. Then, in the detail layer, the detailed information over attributes is further explored and refined, and the attributes are weighted by the number of effective observations associated with each attribute. Our proposed approach has three main advantages: (1) the two-layer structure makes our building visual dictionary be more expressive; (2) the number of components in the attribute layer can be determined automatically from the data; (3) the components are automatically determined based on the scale of visual words; therefore, our model can well mitigate the overfitting problem. In addition, by comparing with stacked autoencoders, stacked denoising autoencoders, LeNet-5, speeded-up robust features, and pretrained deep learning model ImageNet-VGG-F algorithms, we find that our approach achieves satisfactory image categorization results on two benchmark datasets. Specifically, higher categorization performance is achieved than by the classical approaches on 15 scene categories and action datasets. We conclude that the resulting DTSVDL possesses a good generality derived from attribute information as well as an excellent distinction derived from detailed information. In other words, the visual dictionary learned by our algorithm is more expressive and discriminatory. (C) 2019 SPIE and IS&T
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Two-layer binary tree data-driven model for valve stiction
    Chen, Si-Lu
    Tan, Kok Kiong
    Huang, Sunan
    [J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2008, 47 (08) : 2842 - 2848
  • [2] A two-layer ensemble learning framework for data-driven soft sensor of the diesel attributes in an industrial hydrocracking process
    Wang, Yalin
    Wu, Dongzhe
    Yuan, Xiaofeng
    [J]. JOURNAL OF CHEMOMETRICS, 2019, 33 (12)
  • [3] Data-driven reduced order modeling of a two-layer quasi-geostrophic ocean model
    Besabe, Lander
    Girfoglio, Michele
    Quaini, Annalisa
    Rozza, Gianluigi
    [J]. Results in Engineering, 2025, 25
  • [4] Data-driven learning: From Collins Cobuild Dictionary to ChatGPT
    Flowerdew, John
    [J]. LANGUAGE TEACHING, 2024,
  • [5] A GP Based Two-Layer Framework for Data-Driven Modeling of Swarm Self-Organizing Rules
    Wang, Tao
    Peng, Xingguang
    Wu, Yapei
    Gao, Jian
    [J]. 2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 174 - 181
  • [6] A model-based data-driven dictionary learning for seismic data representation
    Yarman, Can Evren
    Kumar, Rajiv
    Rickett, James
    [J]. GEOPHYSICAL PROSPECTING, 2018, 66 (01) : 98 - 123
  • [7] Learning the structure of wind: A data-driven nonlocal turbulence model for the atmospheric boundary layer
    Keith, B.
    Khristenko, U.
    Wohlmuth, B.
    [J]. PHYSICS OF FLUIDS, 2021, 33 (09)
  • [8] Seismic data denoising based on data-driven tight frame dictionary learning method
    ZHENG Jialiang
    WANG Deli
    ZHANG Liang
    [J]. Global Geology, 2020, 23 (04) : 241 - 246
  • [9] Data-driven templates with dictionary learning and sparse representations for photometric redshift estimation
    Frontera-Pons, J.
    Sureau, F.
    Bobin, J.
    Kilbinger, M.
    [J]. ASTRONOMY AND COMPUTING, 2023, 44
  • [10] The construction of a dictionary for a two-layer Chinese morphological analyzer
    Goh, Chooi-Ling
    Lu, Jia
    Cheng, Yuchang
    Asahara, Masayuki
    Matsumoto, Yuji
    [J]. PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 332 - 340