On ML-Based Program Translation: Perils and Promises

被引:1
|
作者
Malyala, Aniketh [1 ]
Zhou, Katelyn [1 ]
Ray, Baishakhi [2 ]
Chakraborty, Saikat [3 ]
机构
[1] Silver Creek High Sch, San Jose, CA 95121 USA
[2] Columbia Univ, New York, NY USA
[3] Microsoft Res, Redmond, WA USA
关键词
Code generation; code translation; program transformation;
D O I
10.1109/ICSE-NIER58687.2023.00017
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the advent of new and advanced programming languages, it becomes imperative to migrate legacy software to new programming languages. Unsupervised Machine Learning-based Program Translation could play an essential role in such migration, even without a sufficiently sizeable reliable corpus of parallel source code. However, these translators are far from perfect due to their statistical nature. This work investigates unsupervised program translators and where and why they fail. With in-depth error analysis of such failures, we have identified that the cases where such translators fail follow a few particular patterns. With this insight, we develop a rule-based program mutation engine, which pre-processes the input code if the input follows specific patterns and post-process the output if the output follows certain patterns. We show that our code processing tool, in conjunction with the program translator, can form a hybrid program translator and significantly improve the state-of-the-art. In the future, we envision an end-to-end program translation tool where programming domain knowledge can be embedded into an ML-based translation pipeline using pre- and post-processing steps.
引用
收藏
页码:60 / 65
页数:6
相关论文
共 50 条
  • [31] ML-based tracking algorithms for MIMO-OFDM
    Oberli, Christian
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2007, 6 (07) : 2630 - 2639
  • [32] ML-Based Predictive Modelling of Stock Market Returns
    Bogdanova, Boryana
    Stancheva-Todorova, Eleonora
    APPLICATIONS OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE20), 2021, 2333
  • [33] Eluding ML-based Adblockers With Actionable Adversarial Examples
    Zhu, Shitong
    Wang, Zhongjie
    Chen, Xun
    Li, Shasha
    Man, Keyu
    Iqbal, Umar
    Qian, Zhiyun
    Chan, Kevin S.
    Krishnamurthy, Srikanth V.
    Shafiq, Zubair
    Hao, Yu
    Li, Guoren
    Zhang, Zheng
    Zou, Xiaochen
    37TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2021, 2021, : 541 - 553
  • [34] The promises and perils of "scientifically based" research for urban schools
    Shealey, MW
    URBAN EDUCATION, 2006, 41 (01) : 5 - 19
  • [35] The promises and perils of participation on site-based councils
    Malen, B
    THEORY INTO PRACTICE, 1999, 38 (04) : 209 - 216
  • [36] ML-based Cross-Platform Query Optimization
    Kaoudi, Zoi
    Quiane-Ruiz, Jorge-Arnulfo
    Contreras-Rojas, Bertty
    Pardo-Meza, Rodrigo
    Troudi, Anis
    Chawla, Sanjay
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1489 - 1500
  • [37] JANES: A NAS Framework for ML-based EDA Applications
    Selg, Hardi
    Jenihhin, Maksim
    Ellervee, Peeter
    34TH IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT 2021), 2021,
  • [38] AI/ML-Based Medical Image Processing and Analysis
    Alghazo, Jaafar
    Latif, Ghazanfar
    DIAGNOSTICS, 2023, 13 (24)
  • [39] Interpretable ML-Based Forecasting of CMEs Associated with Flares
    Hemapriya Raju
    Saurabh Das
    Solar Physics, 2023, 298
  • [40] ML-Based Selection Relay with Transmission Power Constraint
    Pan, Lider
    Cheng, Hon-Chi
    An, John F.
    2012 12TH INTERNATIONAL CONFERENCE ON ITS TELECOMMUNICATIONS (ITST-2012), 2012, : 236 - 241