Concept Extraction: A Modular Approach to Extraction of Source Code Concepts

被引:1
|
作者
Chaturvedi, Ritu [1 ]
Brar, Veerpal [2 ]
Geelal, Jai [2 ]
Kong, Kelvin [2 ]
机构
[1] Univ Guelph, Guelph, ON, Canada
[2] Univ Toronto, Toronto, ON, Canada
关键词
D O I
10.1109/Cybermatics_2018.2018.00309
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Code examples have always been one of the most sought after pieces of information when it comes to understanding and mastering programming concepts. Research shows extracting knowledge from such examples in any online tutoring system is a challenging task. Current methods rely upon specifically formed regular expressions that must be tailor made to the input language, or generation of an AST for the given input program. In our paper, we extend upon existing implementations in code recommendation software using a novel keyword based search tree (k-BST) method. K-BST recommends relevant code fragments by extracting existing keywords, matching with relevant coding examples by k-means clustering, and recommending the relevant coding examples back to the user. K-BSTs also address several major issues which modern knowledge extraction software often run into, like ease of use, extendibility to other domains and run time. With that in mind, K-BSTs are designed to tackle ease of use with popular recognizable file formats such as CSV while keeping the run time of extracting relevant keywords to be extremely low (compared to the more popular method that uses AST).
引用
收藏
页码:1860 / 1866
页数:7
相关论文
共 50 条
  • [1] Extraction of domain concepts from the source code
    Abebe, Surafel Lemma
    Tonella, Paolo
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2015, 98 : 680 - 706
  • [2] An Approach Using Formal Concept Analysis to Object Extraction in Legacy Code
    Chiang, Chia-Chu
    Lee, Roger
    [J]. SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2009, 209 : 169 - +
  • [3] A language-independent approach to the extraction of dependencies between source code entities
    Savic, Milos
    Rakic, Gordana
    Budimac, Zoran
    Ivanovic, Mirjana
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2014, 56 (10) : 1268 - 1288
  • [4] Graph Grammar Extraction from Source Code
    Duarte, Lucio Mauro
    Ribeiro, Leila
    [J]. FORMAL METHODS: FOUNDATIONS AND APPLICATIONS, SBMF 2017, 2017, 10623 : 52 - 69
  • [5] Unifying program slicing and concept assignment for higher-level executable source code extraction
    Gold, NE
    Harman, M
    Binkley, D
    Hierons, RM
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2005, 35 (10): : 977 - 1006
  • [6] Recovering concepts from source code with automated concept identification
    Carey, Maurice M.
    Gannod, Gerald C.
    [J]. ICPC 2007: 15TH IEEE INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, PROCEEDINGS, 2007, : 27 - +
  • [7] GraphifyEvolution - A Modular Approach to Analysing Source Code Histories
    Rahkema, Kristiina
    Pfahl, Dietmar
    [J]. 2021 IEEE/ACM 8TH INTERNATIONAL CONFERENCE ON MOBILE SOFTWARE ENGINEERING AND SYSTEMS (MOBILESOFT 2021), 2021, : 24 - 27
  • [8] Information Extraction of Cybersecurity Concepts: An LSTM Approach
    Gasmi, Houssem
    Laval, Jannik
    Bouras, Bdelaziz
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (19):
  • [9] Code extraction algorithms which unify slicing and concept assignment
    Harman, M
    Gold, N
    Hierons, R
    Binkley, D
    [J]. NINTH WORKING CONFERENCE ON REVERSE ENGINEERING, PROCEEDINGS, 2002, : 11 - 20
  • [10] Analysis and Code Model Extraction for C/C plus plus Source Code
    Wagner, Christian
    Margaria, Tiziana
    Pagendarm, Hans-Georg
    [J]. 2009 14TH IEEE INTERNATIONAL CONFERENCE ON ENGINEERING OF COMPLEX COMPUTER SYSTEMS (ICECCS), 2009, : 110 - +