On ML-Based Program Translation: Perils and Promises

被引:1
|
作者
Malyala, Aniketh [1 ]
Zhou, Katelyn [1 ]
Ray, Baishakhi [2 ]
Chakraborty, Saikat [3 ]
机构
[1] Silver Creek High Sch, San Jose, CA 95121 USA
[2] Columbia Univ, New York, NY USA
[3] Microsoft Res, Redmond, WA USA
关键词
Code generation; code translation; program transformation;
D O I
10.1109/ICSE-NIER58687.2023.00017
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the advent of new and advanced programming languages, it becomes imperative to migrate legacy software to new programming languages. Unsupervised Machine Learning-based Program Translation could play an essential role in such migration, even without a sufficiently sizeable reliable corpus of parallel source code. However, these translators are far from perfect due to their statistical nature. This work investigates unsupervised program translators and where and why they fail. With in-depth error analysis of such failures, we have identified that the cases where such translators fail follow a few particular patterns. With this insight, we develop a rule-based program mutation engine, which pre-processes the input code if the input follows specific patterns and post-process the output if the output follows certain patterns. We show that our code processing tool, in conjunction with the program translator, can form a hybrid program translator and significantly improve the state-of-the-art. In the future, we envision an end-to-end program translation tool where programming domain knowledge can be embedded into an ML-based translation pipeline using pre- and post-processing steps.
引用
收藏
页码:60 / 65
页数:6
相关论文
共 50 条
  • [21] THE PROMISES AND PERILS OF VIDEOTEX
    HURLY, P
    FUTURIST, 1985, 19 (02) : 7 - &
  • [22] Nanotechnology: The promises and perils
    Gbaruko, BC
    Gbaruko, PN
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2005, 229 : U338 - U338
  • [23] Towards ML-Based Diagnostics of Focused Laser Pulse
    Rodimkov Y.R.
    Volokitin V.D.
    Meyerov I.B.
    Efimenko E.S.
    Lobachevskii Journal of Mathematics, 2023, 44 (1) : 67 - 77
  • [24] An iterative ML-based carrier frequency estimation algorithm
    Luo Wu
    Liu An
    Bin Liu
    2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1192 - +
  • [25] A Low Complexity ML-Based Methods for Malware Classification
    Farfoura, Mahmoud E.
    Alkhatib, Ahmad
    Alsekait, Deema Mohammed
    Alshinwan, Mohammad
    El-Rahman, Sahar A.
    Rosiyadi, Didi
    AbdElminaam, Diaa Salama
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4833 - 4857
  • [26] Feasibility Study of a ML-Based ASD Monitoring System
    Maria Vicente-Samper, Jose
    Avila-Navarro, Ernesto
    Maria Sabater-Navarro, Jose
    ARTIFICIAL INTELLIGENCE IN NEUROSCIENCE: AFFECTIVE ANALYSIS AND HEALTH APPLICATIONS, PT I, 2022, 13258 : 270 - 280
  • [27] Interpretable ML-Based Forecasting of CMEs Associated with Flares
    Raju, Hemapriya
    Das, Saurabh
    SOLAR PHYSICS, 2023, 298 (08)
  • [28] MALAI: ML-Based Attack on Learning with Error Problem
    Sri, Mandru Suma
    Yadav, Chakka Srikanth
    Sanyashi, Tikaram
    Singh, Virendra
    INFORMATION SYSTEMS SECURITY, ICISS 2024, 2025, 15416 : 404 - 421
  • [29] QARTA: An ML-based System for Accurate Map Services
    Musleh, Mashaal
    Abbar, Sofiane
    Stanojevic, Rade
    Mokbel, Mohamed
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 2273 - 2282
  • [30] Feedforward ML-Based Timing Estimation With PSK Signals
    Morelli, M.
    D'Andrea, A. N.
    Mengali, U.
    IEEE COMMUNICATIONS LETTERS, 1997, 1 (03) : 80 - 82