Adaptive Data Compression for Classification Problems

被引:0
|
作者
Pourkamali-Anaraki, Farhad [1 ]
Bennette, Walter D. [2 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Lowell, MA 01854 USA
[2] Air Force Res Lab, Informat Directorate, Rome, NY 13441 USA
关键词
Data compression; Data models; Training data; Neural networks; Computational modeling; Training; Task analysis; Adaptive algorithms; supervised learning; classification algorithms; compression algorithms; CLASS-IMBALANCED DATA;
D O I
10.1109/ACCESS.2021.3130551
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data subset selection is a crucial task in deploying machine learning algorithms under strict constraints regarding memory and computation resources. Despite extensive research in this area, a practical difficulty is the lack of rigorous strategies for identifying the optimal size of the reduced data to regulate trade-offs between accuracy and efficiency. Furthermore, existing methods are often built around specific machine learning models, and translating existing theoretical results into practice is challenging for practitioners. To address these problems, we propose two adaptive compression algorithms for classification problems by formulating data subset selection in the form of interactive teaching. The user interacts with the learning task at hand to adapt to the unique structure of the problem at hand, developing an iterative importance sampling scheme. We also propose to couple importance sampling and a diversity criterion to further control the evolution of the data summary over the rounds of interaction. We conduct extensive experiments on several data sets, including imbalanced and multiclass data, and various classification algorithms, such as ensemble learning and neural networks. Our results demonstrate the performance, efficiency, and ease of implementation of the underlying framework.
引用
收藏
页码:157654 / 157669
页数:16
相关论文
共 50 条
  • [31] A NEURAL APPROACH TO DATA-COMPRESSION AND CLASSIFICATION
    KRATZER, KP
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1991, 541 : 250 - 263
  • [32] Adaptive compression of DICOM-image data
    Hludov, S
    Engel, T
    Meinel, C
    ELECTRONIC IMAGING: PROCESSING, PRINTING, AND PUBLISHING IN COLOR, 1998, 3409 : 260 - 266
  • [33] The Lossless Adaptive Binomial Data Compression Method
    Borysenko, Oleksiy
    Matsenko, Svitlana
    Salgals, Toms
    Spolitis, Sandis
    Bobrovs, Vjaceslavs
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [34] AN ADAPTIVE ALGORITHM FOR THE COMPRESSION OF COMPUTER-DATA
    RAMABADRAN, TV
    COHN, DL
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1989, 37 (04) : 317 - 324
  • [35] Text Ranking and Classification using Data Compression
    Kasturi, Nitya
    Markov, Igor L.
    WORKSHOP AT NEURIPS 2021, VOL 163, 2021, 163 : 48 - 53
  • [36] An Intelligent, Adaptive, and Flexible Data Compression Framework
    Devarajan, Hariharan
    Kougkas, Anthony
    Sun, Xian-He
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 82 - 91
  • [37] An adaptive character wordlength algorithm for data compression
    Al-Bahadili, Hussein
    Hussain, Shakir M.
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (06) : 1250 - 1256
  • [38] ADAPTIVE DCT FOR IMAGE-DATA COMPRESSION
    DENATALE, FGB
    DESOLI, GS
    GIUSTO, DD
    VERNAZZA, G
    EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1992, 3 (04): : 359 - 366
  • [39] Data acquisition and compression in adaptive mechanical systems
    Zhang, WH
    Michaelis, B
    WHERE INSTRUMENTATION IS GOING - CONFERENCE PROCEEDINGS, VOLS 1 AND 2, 1998, : 1110 - 1115
  • [40] A LOCALLY ADAPTIVE DATA-COMPRESSION SCHEME
    HORSPOOL, RN
    CORMACK, GV
    COMMUNICATIONS OF THE ACM, 1987, 30 (09) : 792 - 794