Automatic Feature Generation for Machine Learning Based Optimizing Compilation

被引:55
|
作者
Leather, Hugh [1 ]
Bonilla, Edwin [1 ]
O'Boyle, Michael [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
D O I
10.1109/CGO.2009.21
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recent work has shown that machine learning can automate and in some cases outperform hand crafted compiler optimizations. Central to such an approach is that machine learning techniques typically rely upon summaries or features of the program. The quality of these features is critical to the accuracy of the resulting machine learned algorithm; no machine learning method will work well with poorly chosen features. However, due to the size and complexity of programs, theoretically there are an infinite number of potential features to choose from. The compiler writer now has to expend effort in choosing the best features from this space. This paper develops a novel mechanism to automatically find those features which most improve the quality of the machine learned heuristic. The feature space is described by a grammar and is then searched with genetic programming and predictive modeling. We apply this technique to loop unrolling in GCC 4.3.1 and evaluate our approach on a Pentium 6. On a benchmark suite of 57 programs, GCC's hard-coded heuristic achieves only 3% of the maximum performance available, while a state of the art machine learning approach with hand-coded features obtains 59%. Our feature generation technique is able to achieve 76% of the maximum available speedup, outperforming existing approaches.
引用
收藏
页码:81 / 91
页数:11
相关论文
共 50 条
  • [31] Optimizing IoT intrusion detection system: feature selection versus feature extraction in machine learning
    Jing Li
    Mohd Shahizan Othman
    Hewan Chen
    Lizawati Mi Yusuf
    [J]. Journal of Big Data, 11
  • [32] Automatic question generation based on sentence structure analysis using machine learning approach
    Blstak, Miroslav
    Rozinajova, Viera
    [J]. NATURAL LANGUAGE ENGINEERING, 2022, 28 (04) : 487 - 517
  • [33] TASO: Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
    Jia, Zhihao
    Padon, Oded
    Thomas, James
    Warszawski, Todd
    Zaharia, Matei
    Aiken, Alex
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '19), 2019, : 47 - 62
  • [34] FEATURE-BASED MODELING FOR AUTOMATIC MESH GENERATION
    UNRUH, V
    ANDERSON, DC
    [J]. ENGINEERING WITH COMPUTERS, 1992, 8 (01) : 1 - 12
  • [35] Deep Learning and Linguistic Feature Based Automatic Multiple Choice Question Generation from Text
    Agarwal, Rajat
    Negi, Vaishnav
    Kalra, Akshat
    Mittal, Ankush
    [J]. DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2022, 2022, 13145 : 260 - 264
  • [36] Automatic Feature Learning for Glaucoma Detection Based on Deep Learning
    Chen, Xiangyu
    Xu, Yanwu
    Yan, Shuicheng
    Wong, Damon Wing Kee
    Wong, Tien Yin
    Liu, Jiang
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 669 - 677
  • [37] Sentiment Analysis using Feature Generation And Machine Learning Approach
    Srivastava, Roopam
    Bharti, P. K.
    Verma, Parul
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 86 - 91
  • [38] Using Feature Selection with Machine Learning for Generation of Insurance Insights
    Taha, Ayman
    Cosgrave, Bernard
    Mckeever, Susan
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [39] Automatic Detection of Seizures Using Extreme Learning Machine with a Single Feature
    Qin, Yingmei
    Han, Chunxiao
    Lu, Meili
    Wang, Ruofan
    Yang, Li
    Che, Yanqiu
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 4430 - 4433
  • [40] Machine Learning Feature Based Job Scheduling for Distributed Machine Learning Clusters
    Wang, Haoyu
    Liu, Zetian
    Shen, Haiying
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (01) : 58 - 73