EASY DATA AUGMENTATION METHOD FOR CLASSIFICATION TASKS

被引:1
|
作者
Liu Guohang [1 ]
Zhang Shibin [1 ]
Tang Haozhe [1 ]
Yang Lu [1 ]
Lu Jiazhong [1 ]
Huang Yuanyuan [1 ]
机构
[1] Chengdu Univ Informat Technol, Sch Cybersecur, Chengdu 610225, Peoples R China
基金
中国国家自然科学基金;
关键词
Data augmentation; Machine learning; Deep learning; K-means;
D O I
10.1109/ICCWAMTIP51612.2020.9317525
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the rapid development of deep learning has made great progress in artificial intelligence-related technologies. With the deepening of artificial intelligence research, machine learning is applied to more and more fields. Although machine learning has many advantages and has achieved considerable results, machine learning and its related learning algorithms still face some related challenges. That is, the lack of sufficient training data or uneven class balance in the data set. In deep artificial neural networks, a large amount of training data is needed to learn effectively, and collecting such training data is often expensive and laborious. Data Augmentation overcomes this problem by artificially expanding the training set and label retention conversion. In this article, we propose a method to expand the data set based on ball k-means. The experiments on the Four-class, Digits, Iris and Breast-cancer data sets prove the efficiency of the method-fast execution speed, low computational complexity, and the effectiveness of the method. In these experiments, the quality of the training model has been improved, but the learning time stays the same as when the enhancement method is unused.
引用
收藏
页码:166 / 169
页数:4
相关论文
共 50 条
  • [1] EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
    Wei, Jason
    Zou, Kai
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6382 - 6388
  • [2] Iterative Translation-Based Data Augmentation Method for Text Classification Tasks
    Lee, Sangwon
    Liu, Ling
    Choi, Wonik
    [J]. IEEE ACCESS, 2021, 9 : 160437 - 160445
  • [3] Data Augmentation Methods for Enhancing Robustness in Text Classification Tasks
    Tang, Huidong
    Kamei, Sayaka
    Morimoto, Yasuhiko
    [J]. ALGORITHMS, 2023, 16 (01)
  • [4] EasyAug: An Automatic Textual Data Augmentation Platform for Classification Tasks
    Qiu, Siyuan
    Xu, Binxia
    Zhang, Jie
    Wang, Yafang
    Shen, Xiaoyu
    de Melo, Gerard
    Long, Chong
    Li, Xiaolong
    [J]. WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 249 - 252
  • [5] Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks
    Wu, Xing
    Gao, Chaochen
    Lin, Meng
    Zang, Liangjun
    Hu, Songlin
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 871 - 875
  • [6] Data augmentation using virtual word insertion techniques in text classification tasks
    Long, Zhigao
    Li, Hong
    Shi, Jiawen
    Ma, Xin
    [J]. EXPERT SYSTEMS, 2024, 41 (04)
  • [7] The Performance Research of the Data Augmentation Method for Image Classification
    Zhang, Ruirui
    Zhou, Bolin
    Lu, Chang
    Ma, Manzeng
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [8] EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
    Zhao, Minyi
    Zhang, Lu
    Xu, Yi
    Ding, Jiandong
    Guan, Jihong
    Zhou, Shuigeng
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4742 - 4752
  • [9] MDA: Multimodal Data Augmentation Framework for Boosting Performance on Sentiment/Emotion Classification Tasks
    Xu, Nan
    Mao, Wenji
    Wei, Penghui
    Zeng, Daniel
    [J]. IEEE INTELLIGENT SYSTEMS, 2021, 36 (06) : 3 - 11
  • [10] Data independent sequence augmentation method for acoustic scene classification
    Zhang, Teng
    Zhang, Kailai
    Wu, Ji
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3289 - 3293