Data-driven two-layer visual dictionary structure learning

被引:3
|
作者
Yu, Xiangchun [1 ]
Yu, Zhezhou [1 ]
Wu, Lei [1 ]
Pang, Wei [2 ]
Lin, Chenghua [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Dept Computat Intelligence, Changchun, Jilin, Peoples R China
[2] Univ Aberdeen, Sch Nat & Comp Sci, Dept Comp Sci, Meston Bldg, Aberdeen, Scotland
关键词
statistical modeling; overfitting; visual dictionary; Bayesian nonparametric model; deep learning; LATENT DIRICHLET ALLOCATION; HIERARCHICAL MODEL; WORDS; BAG; REPRESENTATION; FEATURES;
D O I
10.1117/1.JEI.28.2.023006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An important issue in statistical modeling is to determine the complexity of the model based on the scale of data so as to effectively mitigate the model's overfitting problems without big data. We adopt a data-driven approach to automatically determine the number of components of the model. In order to better extract robust features, we propose a framework of data-driven two-layer structure visual dictionary learning (DTSVDL). It works by dividing the visual dictionary structure learning into two levels: the attribute layer and the detail layer. In the attribute layer, the attributes of the image dataset are learned, and these attributes are obtained by a data-driven Bayesian nonparametric model. Then, in the detail layer, the detailed information over attributes is further explored and refined, and the attributes are weighted by the number of effective observations associated with each attribute. Our proposed approach has three main advantages: (1) the two-layer structure makes our building visual dictionary be more expressive; (2) the number of components in the attribute layer can be determined automatically from the data; (3) the components are automatically determined based on the scale of visual words; therefore, our model can well mitigate the overfitting problem. In addition, by comparing with stacked autoencoders, stacked denoising autoencoders, LeNet-5, speeded-up robust features, and pretrained deep learning model ImageNet-VGG-F algorithms, we find that our approach achieves satisfactory image categorization results on two benchmark datasets. Specifically, higher categorization performance is achieved than by the classical approaches on 15 scene categories and action datasets. We conclude that the resulting DTSVDL possesses a good generality derived from attribute information as well as an excellent distinction derived from detailed information. In other words, the visual dictionary learned by our algorithm is more expressive and discriminatory. (C) 2019 SPIE and IS&T
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Data-Driven ESP Vocabulary Learning
    Liu, Ping
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON MODERN EDUCATION AND SOCIAL SCIENCE (MESS 2016), 2016, : 219 - 225
  • [42] Data-driven approach for ontology learning
    Ocampo-Guzman, Isidra
    Lopez-Arevalo, Ivan
    Sosa-Sosa, Victor
    [J]. 2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATION CONTROL (CCE 2009), 2009, : 463 - 468
  • [43] DATA-DRIVEN LEARNING OF NONAUTONOMOUS SYSTEMS
    Qin, Tong
    Chen, Zhen
    Jakeman, John D.
    Xiu, Dongbin
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2021, 43 (03): : A1607 - A1624
  • [44] Convex Two-Layer Modeling with Latent Structure
    Ganapathiraman, Vignesh
    Zhang, Xinhua
    Yu, Yaoliang
    Wen, Junfeng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [45] On the Equilibrium of a Two-Layer Elastic Structure with a Crack
    Fankina I.V.
    [J]. Journal of Applied and Industrial Mathematics, 2019, 13 (04) : 629 - 641
  • [46] On the Structure of Two-Layer Cellular Neural Networks
    Ban, Jung-Chao
    Chang, Chih-Hung
    Lin, Song-Sun
    [J]. DIFFERENTIAL AND DIFFERENCE EQUATIONS WITH APPLICATI ONS, 2013, 47 : 265 - 273
  • [47] The structure of roll waves in two-layer flows
    Lyapidevskii, VY
    [J]. PMM JOURNAL OF APPLIED MATHEMATICS AND MECHANICS, 2000, 64 (06): : 937 - 943
  • [48] DATA-DRIVEN 3D VISUAL PRONUNCIATION OF CHINESE IPA FOR LANGUAGE LEARNING
    Yu, Jun
    Li, Aijun
    Hu, Fang
    Fang, Qiang
    Jiang, Chen
    Li, Xian
    Yang, Jing
    Wang, Zeng-fu
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [49] The Two-Layer Structure of the Entrainment Zone in the Convective Boundary Layer
    Garcia, Jade Rachele
    Mellado, Juan Pedro
    [J]. JOURNAL OF THE ATMOSPHERIC SCIENCES, 2014, 71 (06) : 1935 - 1955
  • [50] Extended dynamic mode decomposition with dictionary learning: A data-driven adaptive spectral decomposition of the Koopman operator
    Li, Qianxiao
    Dietrich, Felix
    Bollt, Erik M.
    Kevrekidis, Ioannis G.
    [J]. CHAOS, 2017, 27 (10)