Data-driven two-layer visual dictionary structure learning

被引：3

作者：

Yu, Xiangchun ^{[1
]}

Yu, Zhezhou ^{[1
]}

Wu, Lei ^{[1
]}

Pang, Wei ^{[2
]}

Lin, Chenghua ^{[2
]}

机构：

[1] Jilin Univ, Coll Comp Sci & Technol, Dept Computat Intelligence, Changchun, Jilin, Peoples R China

[2] Univ Aberdeen, Sch Nat & Comp Sci, Dept Comp Sci, Meston Bldg, Aberdeen, Scotland

来源：

JOURNAL OF ELECTRONIC IMAGING | 2019年 / 28卷 / 02期

关键词：

statistical modeling; overfitting; visual dictionary; Bayesian nonparametric model; deep learning; LATENT DIRICHLET ALLOCATION; HIERARCHICAL MODEL; WORDS; BAG; REPRESENTATION; FEATURES;

D O I：

10.1117/1.JEI.28.2.023006

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

An important issue in statistical modeling is to determine the complexity of the model based on the scale of data so as to effectively mitigate the model's overfitting problems without big data. We adopt a data-driven approach to automatically determine the number of components of the model. In order to better extract robust features, we propose a framework of data-driven two-layer structure visual dictionary learning (DTSVDL). It works by dividing the visual dictionary structure learning into two levels: the attribute layer and the detail layer. In the attribute layer, the attributes of the image dataset are learned, and these attributes are obtained by a data-driven Bayesian nonparametric model. Then, in the detail layer, the detailed information over attributes is further explored and refined, and the attributes are weighted by the number of effective observations associated with each attribute. Our proposed approach has three main advantages: (1) the two-layer structure makes our building visual dictionary be more expressive; (2) the number of components in the attribute layer can be determined automatically from the data; (3) the components are automatically determined based on the scale of visual words; therefore, our model can well mitigate the overfitting problem. In addition, by comparing with stacked autoencoders, stacked denoising autoencoders, LeNet-5, speeded-up robust features, and pretrained deep learning model ImageNet-VGG-F algorithms, we find that our approach achieves satisfactory image categorization results on two benchmark datasets. Specifically, higher categorization performance is achieved than by the classical approaches on 15 scene categories and action datasets. We conclude that the resulting DTSVDL possesses a good generality derived from attribute information as well as an excellent distinction derived from detailed information. In other words, the visual dictionary learned by our algorithm is more expressive and discriminatory. (C) 2019 SPIE and IS&T

引用

页数：15

共 50 条

[21] A TWO-LAYER STRUCTURE OF THE NANNING BASIN
Xiao Yiyue Institute of Geology
[J]. Scientia Geologica Sinica, 1997, (03) : 110 - 115
[22] An exploration of error-driven learning in simple two-layer networks from a discriminative learning perspective
Dorothée B. Hoppe
Petra Hendriks
Michael Ramscar
Jacolien van Rij
[J]. Behavior Research Methods, 2022, 54 : 2221 - 2251
[23] An exploration of error-driven learning in simple two-layer networks from a discriminative learning perspective
Hoppe, Dorothee B.
Hendriks, Petra
Ramscar, Michael
van Rij, Jacolien
[J]. BEHAVIOR RESEARCH METHODS, 2022, 54 (05) : 2221 - 2251
[24] Explainable fraud detection of financial statement data driven by two-layer knowledge graph
Cai, Siqi
Xie, Zhenping
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
[25] A TWO-LAYER REINFORCEMENT LEARNING SOLUTION FOR ENERGY HARVESTING DATA DISSEMINATION SCENARIOS
Ortiz, Andrea
Weber, Tobias
Klein, Anja
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6648 - 6652
[26] Communicative data-driven learning: a two-year pilot study
Hirata, Yoko
Thompson, Paul
[J]. ELT JOURNAL, 2022, 76 (03) : 356 - 366
[27] Machine learning for neutron reflectometry data analysis of two-layer thin films *
Doucet, Mathieu
Archibald, Richard K.
Heller, William T.
[J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (03):
[28] A Data-Driven Sparse GLM for fMRI Analysis Using Sparse Dictionary Learning With MDL Criterion
Lee, Kangjoo
Tak, Sungho
Ye, Jong Chul
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2011, 30 (05) : 1076 - 1089
[29] Fast Dictionary Learning Based on Data-Driven Tight Frame for 3-D Seismic Data Denoising
Zhou, Zixiang
Wu, Juan
Bai, Min
Yang, Bo
Ma, Zhaoyang
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 10
[30] Damage Detection with Data-Driven Machine Learning Models on an Experimental Structure
Alemu, Yohannes L.
Lahmer, Tom
Walther, Christian
[J]. ENG, 2024, 5 (02): : 629 - 656

← 1 2 3 4 5 →