Compression and machine learning: A new perspective on feature space vectors

被引:0
|
作者
Sculley, D. [1 ]
Brodley, Carla E. [1 ]
机构
[1] Tufts Univ, Dept Comp Sci, Medford, MA 02155 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of compression algorithms in machine learning tasks such as clustering and classification has appeared in a variety of fields, sometimes with the promise of reducing problems of explicit feature selection. The theoretical justification for such methods has been founded on an tipper bound on Kolmogorov complexity and an idealized information space. An alternate view shows compression algorithms implicitly map strings into implicit feature space vectors, and compression-based similarity measures compute similarity within these feature spaces. Thus, compression-based methods are not a "parameter free" magic bullet for feature selection and data representation, but are instead concrete similarity measures within defined feature spaces, and are therefore akin to explicit feature vector models used in standard machine learning algorithms. To underscore this point, we find theoretical and empirical connections between traditional machine learning vector models and compression, encouraging cross-fertilization in future work.
引用
收藏
页码:332 / +
页数:4
相关论文
共 50 条
  • [1] Feature selection in machine learning: A new perspective
    Cai, Jie
    Luo, Jiawei
    Wang, Shulin
    Yang, Sheng
    [J]. NEUROCOMPUTING, 2018, 300 : 70 - 79
  • [2] Differentiation and Integration of Machine Learning Feature Vectors
    Mu, Xinying
    Pavel, Ana B.
    Kon, Mark
    [J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 611 - 616
  • [3] Exploring Chemical Reaction Space with Machine Learning Models: Representation and Feature Perspective
    Ding, Yuheng
    Qiang, Bo
    Chen, Qixuan
    Liu, Yiqiao
    Zhang, Liangren
    Liu, Zhenming
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (08) : 2955 - 2970
  • [4] Clustering in extreme learning machine feature space
    He, Qing
    Jin, Xin
    Du, Changying
    Zhuang, Fuzhen
    Shi, Zhongzhi
    [J]. NEUROCOMPUTING, 2014, 128 : 88 - 95
  • [5] Deep learning-based Feature compression for Video Coding for Machine
    Do, Jihoon
    Lee, Jooyoung
    Kim, Younhee
    Jeong, Se Yoon
    Choi, Jin Soo
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [6] Machine Learning Model to Map Tribocorrosion Regimes in Feature Space
    Ramachandran, Rahul
    [J]. COATINGS, 2021, 11 (04)
  • [7] Deep Discriminative Feature Learning and Feature Space Transformation for Scalable Machine Fault Diagnosis
    Sreekumar, K. T.
    Kumar, C. Santhosh
    Ramachandran, K. I.
    [J]. IEEE ACCESS, 2024, 12 : 107944 - 107958
  • [8] Underwater Cylinder Recognition Using Machine Learning with DFT-based Feature Vectors
    Seo, Yoojeong
    On, Baeksan
    Jang, Beomhui
    Im, Sungbin
    Seo, Iksu
    [J]. 2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 168 - 169
  • [9] Genetic & Evolutionary Biometrics: Feature Extraction from a Machine Learning Perspective
    Shelton, Joseph
    Alford, Aniesha
    Small, Lasanio
    Leflore, Derrick
    Williams, Jared
    Adams, Joshua
    Dozier, Gerry
    Bryant, Kelvin
    Abegaz, Tamirat
    Ricanek, Karl
    [J]. 2012 PROCEEDINGS OF IEEE SOUTHEASTCON, 2012,
  • [10] A Machine Learning Model for Accurate Cardiopulmonary Resuscitation Chest Compression Feature Extraction
    Lalitha, V
    Nithya, M.
    Samundeswari, S.
    Srinivasan, Srinidhi
    [J]. 2021 IEEE INTERNATIONAL WOMEN IN ENGINEERING (WIE) CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE), 2022, : 33 - 36