A Unified Search Framework for Data Augmentation and Neural Architecture on Small-Scale Image Data Sets

被引:1
|
作者
Zhang, Jianwei [1 ]
Zhang, Lei [1 ]
Li, Dong [1 ]
Wang, Lituan [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Automated data augmentation (auto augmentation); automated machine learning; neural architecture search (NAS);
D O I
10.1109/TCDS.2023.3274177
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation is an effective technique to enrich the training data's diversity and reduce the risk of overfitting. However, different data sets have distinct preferences on various augmentation techniques. Recently, automated data augmentation (auto-augmentation), which could engineer augmentation policy automatically, drew a growing interest. Previous auto-augmentation methods usually utilize a density matching (DM) strategy, which highly depends on a large data scale to ensure a precise policy evaluation. When facing small-scale data sets, it usually achieves an inferior performance. To address the problem, an improved method named Augmented DM is proposed by augmenting the train data with policies uniformly sampled from a prior distribution, making the policy evaluation more precise. Moreover, we propose a unified search framework for data augmentation and neural architecture (USAA) by formulating the search processes with one formulation. As a result, both optimal augmentation policy and neural architecture could be obtained within one round of the search process. Extensive experiments have been conducted on a bag of medical data sets with general small scales, and the results show that the proposed Augmented DM and USAA can outperform the state-of-the-art auto-augmentation and AutoML approaches, respectively.
引用
收藏
页码:501 / 510
页数:10
相关论文
共 50 条
  • [1] A data-driven framework for the stochastic reconstruction of small-scale features with application to climate data sets
    Wan, Zhong Yi
    Dodov, Boyko
    Lessig, Christian
    Dijkstra, Henk
    Sapsis, Themistoklis P.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 442
  • [2] Optimizing Data Augmentation for Semantic Segmentation on Small-Scale Dataset
    Ma, Rui
    Tao, Pin
    Tang, Huiyun
    ICCCV 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON CONTROL AND COMPUTER VISION, 2019, : 77 - 81
  • [3] PreAugNet: improve data augmentation for industrial defect classification with small-scale training data
    Isack Farady
    Chih-Yang Lin
    Ming-Ching Chang
    Journal of Intelligent Manufacturing, 2024, 35 : 1233 - 1246
  • [4] PreAugNet: improve data augmentation for industrial defect classification with small-scale training data
    Farady, Isack
    Lin, Chih-Yang
    Chang, Ming-Ching
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (03) : 1233 - 1246
  • [5] APPLE DISEASE RECOGNITION BASED ON SMALL-SCALE DATA SETS
    Song, Chenyong
    Wang, Dongwei
    Bai, Haoran
    Sun, Weihao
    APPLIED ENGINEERING IN AGRICULTURE, 2021, 37 (03) : 481 - 490
  • [6] A data-driven model of drop size prediction based on artificial neural networks using small-scale data sets
    Bo Wang
    Han Zhou
    Shan Jing
    Qiang Zheng
    Wenjie Lan
    Shaowei Li
    Chinese Journal of Chemical Engineering, 2024, 66 (02) : 71 - 83
  • [7] A data-driven model of drop size prediction based on artificial neural networks using small-scale data sets
    Wang, Bo
    Zhou, Han
    Jing, Shan
    Zheng, Qiang
    Lan, Wenjie
    Li, Shaowei
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2024, 66 : 71 - 83
  • [8] A Unified Framework From Face Image Restoration to Data Augmentation Using Generative Prior
    You, Jiawei
    Huang, Ganyu
    Han, Tianyuan
    Yang, Haoze
    Shen, Liping
    IEEE ACCESS, 2023, 11 : 2907 - 2919
  • [9] Search for a small-scale neutrino anisotropy with three years of IceCube data
    Bernhard, Anna
    6TH INTERNATIONAL WORKSHOP ON VERY LARGE VOLUME NEUTRINO TELESCOPES (VLVVT 13), 2014, 1630 : 70 - 73
  • [10] Improving classification accuracy using data augmentation on small data sets
    Moreno-Barea, Francisco J.
    Jerez, Jose M.
    Franco, Leonardo
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 161 (161)