adaptNMT: an open-source, language-agnostic development environment for neural machine translation

被引:0
|
作者
Séamus Lankford
Haithem Afli
Andy Way
机构
[1] Dublin City University,ADAPT Centre
[2] Munster Technological University,ADAPT Centre
来源
关键词
Neural machine translation; Language technology; NMT; Natural language processing; Green NLP;
D O I
暂无
中图分类号
学科分类号
摘要
adaptNMT streamlines all processes involved in the development and deployment of RNN and Transformer neural translation models. As an open-source application, it is designed for both technical and non-technical users who work in the field of machine translation. Built upon the widely-adopted OpenNMT ecosystem, the application is particularly useful for new entrants to the field since the setup of the development environment and creation of train, validation and test splits is greatly simplified. Graphing, embedded within the application, illustrates the progress of model training, and SentencePiece is used for creating subword segmentation models. Hyperparameter customization is facilitated through an intuitive user interface, and a single-click model development approach has been implemented. Models developed by adaptNMT can be evaluated using a range of metrics, and deployed as a translation service within the application. To support eco-friendly research in the NLP space, a green report also flags the power consumption and kgCO2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_{2}$$\end{document} emissions generated during model development. The application is freely available (http://github.com/adaptNMT).
引用
收藏
页码:1671 / 1696
页数:25
相关论文
共 50 条
  • [1] adaptNMT: an open-source, language-agnostic development environment for neural machine translation
    Lankford, Seamus
    Afli, Haithem
    Way, Andy
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (04) : 1671 - 1696
  • [2] DeepRelease: Language-agnostic Release Notes Generation from Pull Requests of Open-source Software
    Jiang, Huaxi
    Zhu, Jie
    Yang, Li
    Liang, Geng
    Zuo, Chun
    [J]. 2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2021), 2021, : 101 - 110
  • [3] A Convolutional Neural Network for Language-Agnostic Source Code Summarization
    Moore, Jessica
    Ben Gelman
    Slater, David
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE), 2019, : 15 - 26
  • [4] OpenNMT: Open-Source Toolkit for Neural Machine Translation
    Klein, Guillaume
    Kim, Yoon
    Deng, Yuntian
    Senellart, Jean
    Rush, Alexander M.
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 67 - 72
  • [5] An Evaluation of Language-Agnostic Inner-Attention-Based Representations in Machine Translation
    Raganato, Alessandro
    Vazquez, Raul
    Creutz, Mathias
    Tiedemann, Jorg
    [J]. 4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 27 - 32
  • [6] Deep open-source machine translation
    Bond, Francis
    Oepen, Stephan
    Nichols, Eric
    Flickinger, Dan
    Velldal, Erik
    Haugereid, Petter
    [J]. MACHINE TRANSLATION, 2011, 25 (02) : 87 - 105
  • [7] Dolos: Language-agnostic plagiarism detection in source code
    Maertens, Rien
    Van Petegem, Charlotte
    Strijbol, Niko
    Baeyens, Toon
    Jacobs, Arne Carla
    Dawyndt, Peter
    Mesuere, Bart
    [J]. JOURNAL OF COMPUTER ASSISTED LEARNING, 2022, 38 (04) : 1046 - 1061
  • [8] Free/open-source machine translation: preface
    Sanchez-Martinez, Felipe
    Forcada, Mikel L.
    [J]. MACHINE TRANSLATION, 2011, 25 (02) : 83 - 86
  • [9] LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers
    Wang, Peidong
    Sun, Eric
    Xue, Jian
    Wu, Yu
    Zhou, Long
    Gaur, Yashesh
    Liu, Shujie
    Li, Jinyu
    [J]. INTERSPEECH 2023, 2023, : 57 - 61
  • [10] Open-source Portuguese-Spanish machine translation
    Armentano-Oller, Carme
    Carrasco, Rafael C.
    Corbi-Bellot, Antonio M.
    Forcada, Mikel L.
    Ginesti-Rosell, Mireia
    Ortiz-Rojas, Sergio
    Perez-Ortiz, Juan Antonio
    Ramirez-Sanchez, Gema
    Sanchez-Martinez, Felipe
    Scalco, Miriam A.
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2006, 3960 : 50 - 59