NATURALCC: An Open-Source Toolkit for Code Intelligence

被引:0
|
作者
Wan, Yao [1 ]
He, Yang [2 ]
Bi, Zhangqian [1 ]
Zhang, Jianguo [3 ]
Sui, Yulei [2 ]
Zhang, Hongyu [4 ]
Hashimoto, Kazuma [5 ]
Jin, Hai [1 ]
Xu, Guandong [2 ]
Xiong, Caiming [6 ]
Yu, Philip S. [3 ]
机构
[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab,Sch Comp Sci & Techn, Wuhan, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW, Australia
[3] Univ Illinois, Chicago, IL 60680 USA
[4] Univ Newcastle, Callaghan, NSW, Australia
[5] Google Res, Mountain View, CA USA
[6] Salesforce Res, Palo Alto, CA USA
基金
中国国家自然科学基金;
关键词
Code intelligence; deep learning; code representation; code embedding; open source; toolkit; benchmark;
D O I
10.1145/3510454.3516863
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present NATURALCC, an efficient and extensible open-source toolkit for machine-learning-based source code analysis (i.e., code intelligence). Using NATURALCC, researchers can conduct rapid prototyping, reproduce state-of-the-art models, and/or exercise their own algorithms. NATURALCC is built upon Fairseq and PyTorch, providing (1) a collection of code corpus with preprocessing scripts, (2) a modular and extensible framework that makes it easy to reproduce and implement a code intelligence model, and (3) a benchmark of state-of-the-art models. Furthermore, we demonstrate the usability of our toolkit over a variety of tasks (e.g., code summarization, code retrieval, and code completion) through a graphical user interface. The website of this project is http://xcodemind.github.io, where the source code and demonstration video can be found.
引用
收藏
页码:149 / 153
页数:5
相关论文
共 50 条
  • [41] SymCog: An open-source toolkit for assessing human symbolic cognition
    Maurice Flurie
    Alexandra Kelly
    Ingrid R. Olson
    Jamie Reilly
    [J]. Behavior Research Methods, 2023, 55 : 807 - 823
  • [42] Vulnerability modellers toolkit, an open-source platform for vulnerability analysis
    Luís Martins
    Vítor Silva
    Helen Crowley
    Francesco Cavalieri
    [J]. Bulletin of Earthquake Engineering, 2021, 19 : 5691 - 5709
  • [43] An open-source phase correction toolkit for transcranial focused ultrasound
    Changzhu Jin
    David Moore
    John Snell
    Dong-Guk Paeng
    [J]. BMC Biomedical Engineering, 2 (1):
  • [44] CRSLab: An Open-Source Toolkit for Building Conversational Recommender System
    Zhou, Kun
    Wang, Xiaolei
    Zhou, Yuanhang
    Shang, Chenzhan
    Cheng, Yuan
    Zhao, Wayne Xin
    Li, Yaliang
    Wen, Ji-Ronge
    [J]. ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 185 - 193
  • [45] Development of the open-source dose calculation and optimization toolkit matRad
    Wieser, Hans-Peter
    Cisternas, Eduardo
    Wahl, Niklas
    Ulrich, Silke
    Stadler, Alexander
    Mescher, Henning
    Mueller, Lucas-Raphael
    Klinge, Thomas
    Gabrys, Hubert
    Burigo, Lucas
    Mairani, Andrea
    Ecker, Swantje
    Ackermann, Benjamin
    Ellerbrock, Malte
    Parodi, Katia
    Jaekel, Oliver
    Bangert, Mark
    [J]. MEDICAL PHYSICS, 2017, 44 (06) : 2556 - 2568
  • [46] Methods and open-source toolkit for analyzing and visualizing challenge results
    Manuel Wiesenfarth
    Annika Reinke
    Bennett A. Landman
    Matthias Eisenmann
    Laura Aguilera Saiz
    M. Jorge Cardoso
    Lena Maier-Hein
    Annette Kopp-Schneider
    [J]. Scientific Reports, 11 (1)
  • [47] An open-source toolkit to assist authors and collaborators during manuscript preparation: AuthorAndCollaborator toolkit
    Matava, Clyde T.
    Williams, R. J.
    Simpao, Allan F.
    [J]. CANADIAN JOURNAL OF ANESTHESIA-JOURNAL CANADIEN D ANESTHESIE, 2022, 69 (05): : 680 - 681
  • [48] An open-source toolkit to assist authors and collaborators during manuscript preparation: AuthorAndCollaborator toolkit
    Clyde T. Matava
    R. J. Williams
    Allan F. Simpao
    [J]. Canadian Journal of Anesthesia/Journal canadien d'anesthésie, 2022, 69 : 680 - 681
  • [49] OpenHGNN: An Open-Source Toolkit for Heterogeneous Graph Neural Networks
    Han, Hui
    Zhao, Tianyu
    Yang, Cheng
    Zhang, Hongyi
    Liu, Yaoqi
    Wang, Xiao
    Shi, Chuan
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3993 - 3997
  • [50] Motmot, an open-source toolkit for realtime video acquisition and analysis
    Straw, Andrew D.
    Dickinson, Michael H.
    [J]. SOURCE CODE FOR BIOLOGY AND MEDICINE, 2009, 4 (01):