Adaptive Data Compression for Classification Problems

被引:0
|
作者
Pourkamali-Anaraki, Farhad [1 ]
Bennette, Walter D. [2 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Lowell, MA 01854 USA
[2] Air Force Res Lab, Informat Directorate, Rome, NY 13441 USA
关键词
Data compression; Data models; Training data; Neural networks; Computational modeling; Training; Task analysis; Adaptive algorithms; supervised learning; classification algorithms; compression algorithms; CLASS-IMBALANCED DATA;
D O I
10.1109/ACCESS.2021.3130551
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data subset selection is a crucial task in deploying machine learning algorithms under strict constraints regarding memory and computation resources. Despite extensive research in this area, a practical difficulty is the lack of rigorous strategies for identifying the optimal size of the reduced data to regulate trade-offs between accuracy and efficiency. Furthermore, existing methods are often built around specific machine learning models, and translating existing theoretical results into practice is challenging for practitioners. To address these problems, we propose two adaptive compression algorithms for classification problems by formulating data subset selection in the form of interactive teaching. The user interacts with the learning task at hand to adapt to the unique structure of the problem at hand, developing an iterative importance sampling scheme. We also propose to couple importance sampling and a diversity criterion to further control the evolution of the data summary over the rounds of interaction. We conduct extensive experiments on several data sets, including imbalanced and multiclass data, and various classification algorithms, such as ensemble learning and neural networks. Our results demonstrate the performance, efficiency, and ease of implementation of the underlying framework.
引用
收藏
页码:157654 / 157669
页数:16
相关论文
共 50 条
  • [41] Adaptive Automatic Monitoring System with Data Compression
    Antonyuk, E. M.
    Varshayskiy, I. E.
    Antonyuk, P. E.
    PROCEEDINGS OF 2019 XXII INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2019, : 179 - 180
  • [42] INNOVATIONS APPROACH TO ADAPTIVE DATA COMPRESSION IN DATA-TRANSMISSION
    MARK, JW
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1974, CO22 (10) : 1618 - 1629
  • [43] ADAPTIVE DATA AUGMENTATION FOR IMAGE CLASSIFICATION
    Fawzi, Alhussein
    Samulowitz, Horst
    Turaga, Deepak
    Frossard, Pascal
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3688 - 3692
  • [44] Adaptive Oversampling for Imbalanced Data Classification
    Ertekin, Seyda
    INFORMATION SCIENCES AND SYSTEMS 2013, 2013, 264 : 261 - 269
  • [45] ADAPTIVE BAYESIAN CLASSIFICATION OF SPATIAL DATA
    KLEIN, R
    PRESS, SJ
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (419) : 844 - 851
  • [46] Fractal Image Compression With Adaptive Quardtree Partitioning And Archetype Classification
    Nandi, Utpal
    Mandal, Jyotsna Kumar
    2015 IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2015, : 56 - 60
  • [47] Adaptive Stereo Image Joint Compression Based on Characteristics Classification
    Li, Shizhong
    Chen, Zhe
    Liu, Xiaofeng
    Ran, Zhaochun
    MIPPR 2011: MULTISPECTRAL IMAGE ACQUISITION, PROCESSING, AND ANALYSIS, 2011, 8002
  • [48] Measurement of Data Complexity for Classification Problems with Unbalanced Data
    Anwar, Nafees
    Jones, Geoff
    Ganesh, Siva
    STATISTICAL ANALYSIS AND DATA MINING, 2014, 7 (03) : 194 - 211
  • [49] Improving the Accuracy and Efficiency of Compression-based Dissimilarity Measure using Information Quantity in Data Classification Problems
    Takamoto A.
    Kohara Y.
    Yoshida M.
    Umemura K.
    Transactions of the Japanese Society for Artificial Intelligence, 2023, 38 (01) : 1 - 15
  • [50] A Self-Adaptive Fireworks Algorithm for Classification Problems
    Xue, Yu
    Zhao, Binping
    Ma, Tinghuai
    Pang, Wei
    IEEE ACCESS, 2018, 6 : 44406 - 44416