On ML-Based Program Translation: Perils and Promises

被引:1
|
作者
Malyala, Aniketh [1 ]
Zhou, Katelyn [1 ]
Ray, Baishakhi [2 ]
Chakraborty, Saikat [3 ]
机构
[1] Silver Creek High Sch, San Jose, CA 95121 USA
[2] Columbia Univ, New York, NY USA
[3] Microsoft Res, Redmond, WA USA
关键词
Code generation; code translation; program transformation;
D O I
10.1109/ICSE-NIER58687.2023.00017
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the advent of new and advanced programming languages, it becomes imperative to migrate legacy software to new programming languages. Unsupervised Machine Learning-based Program Translation could play an essential role in such migration, even without a sufficiently sizeable reliable corpus of parallel source code. However, these translators are far from perfect due to their statistical nature. This work investigates unsupervised program translators and where and why they fail. With in-depth error analysis of such failures, we have identified that the cases where such translators fail follow a few particular patterns. With this insight, we develop a rule-based program mutation engine, which pre-processes the input code if the input follows specific patterns and post-process the output if the output follows certain patterns. We show that our code processing tool, in conjunction with the program translator, can form a hybrid program translator and significantly improve the state-of-the-art. In the future, we envision an end-to-end program translation tool where programming domain knowledge can be embedded into an ML-based translation pipeline using pre- and post-processing steps.
引用
收藏
页码:60 / 65
页数:6
相关论文
共 50 条
  • [1] ML-based Translation Methods for Protocols and Data Formats
    Tothfalusi, Tamas
    Varga, Eszter
    Csiszar, Zoltan
    Varga, Pal
    2023 19TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM, 2023,
  • [2] Assessing Robustness of ML-Based Program Analysis Tools using Metamorphic Program Transformations
    Applis, Leonhard
    Panichella, Annibale
    van Deursen, Arie
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 1377 - 1381
  • [3] THE PROMISES AND PERILS OF A PRODUCE PRESCRIPTION PROGRAM: A QUALITATIVE EXPLORATION
    Schlosser, Allison V.
    Joshi, Kakul
    Smith, Samantha
    Thornton, Anna B.
    Trapl, Erika
    Bolen, Shari
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2017, 32 : S349 - S349
  • [4] A New ML-based AFIB Detector
    Tudjarski, Stojancho
    Ignjatov, Tomislav
    Gusev, Marjan
    2021 29TH TELECOMMUNICATIONS FORUM (TELFOR), 2021,
  • [5] ML-Based Wildfire Prediction and Detection
    Joshi, Chiragee C.
    Payyavula, Jaya S. S. K.
    Patel, Soham
    Alginahi, Yasser M.
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [6] The Challenges in ML-based Security for SDN
    Nguyen, Tam N.
    2018 2ND CYBER SECURITY IN NETWORKING CONFERENCE (CSNET), 2018,
  • [7] Promises and Perils
    DiChristina, Marietta
    SCIENTIFIC AMERICAN, 2015, 312 (06) : 4 - 4
  • [8] ML-based Demand Forecast with External Factors
    Hellmers López D.
    Julia Kramer K.
    Schmidt M.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2023, 118 (05): : 324 - 329
  • [9] ML-Based Teaching Systems: A Conceptual Framework
    Spitzer P.
    Kühl N.
    Heinz D.
    Satzger G.
    Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (CSCW2)
  • [10] ML-based Expert Products Scoring System
    Mendori, Patryk
    Pelc, Mariusz
    Kawala-Sterniuk, Aleksandra
    Gola, Mariusz
    2024 PROGRESS IN APPLIED ELECTRICAL ENGINEERING, PAEE 2024, 2024,