Concept extraction from business documents for software engineering projects

被引:0
|
作者
Pierre André Ménard
Sylvie Ratté
机构
[1] École de technologie supérieure,
来源
关键词
Automated extraction; Conceptual model; Domain model; Relevance evaluation; Software project; Knowledge modeling;
D O I
暂无
中图分类号
学科分类号
摘要
Acquiring relevant business concepts is a crucial first step for any software project for which the software experts are not domain experts. The wealth of information buried within an organization’s written documentation is a precious source of concepts, relationships and attributes which can be used to model the enterprise’s domain. The lack of targeted extraction tools can make perusing through this type of resource a lengthy and costly process. We propose a domain model focused extraction process aimed at the rapid discovery of knowledge relevant to the software expert. To avoid undesirable noise from high-level linguistic tools, the process is mainly composed of positive and negative base filters that are less error prone and more robust. The extracted candidates are then reordered using a weight propagation algorithm based on structural hints from source documents. When tested on French text corpora from public organizations, our process performs 2.7 times better than a statistical baseline for relevant concept discovery. A new metric to assess the performance discovery speed of relevant concepts is introduced. The annotation of a gold standard definition of software engineering oriented concepts for knowledge extraction tasks is also presented.
引用
收藏
页码:649 / 686
页数:37
相关论文
共 50 条
  • [1] Concept extraction from business documents for software engineering projects
    Menard, Pierre Andre
    Ratte, Sylvie
    AUTOMATED SOFTWARE ENGINEERING, 2016, 23 (04) : 649 - 686
  • [3] MODELING SOFTWARE ENGINEERING PROJECTS AS A BUSINESS: A BUSINESS INTELLIGENCE PERSPECTIVE
    Hans, Robert T.
    Mnkandla, Ernest
    AFRICON, 2013, 2013, : 1172 - 1176
  • [4] Feature Extraction from Japanese Natural Language Requirements Documents for Software Product Line Engineering
    Hisazumi, Kenji
    Xiao, Yuedong
    Fukuda, Akira
    2019 COMPANION OF THE 19TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS-C 2019), 2019, : 322 - 329
  • [5] Information extraction from documents for automating software testing
    Lutsky, P
    ARTIFICIAL INTELLIGENCE IN ENGINEERING, 2000, 14 (01): : 63 - 69
  • [6] Logical Structure Extraction from Software Requirements Documents
    Rauf, Rehan
    Antkiewicz, Michal
    Czarnecki, Krzysztof
    2011 19TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2011, : 101 - 110
  • [7] Concept Extraction from Medical Documents A Contextual Approach
    Szenasi, Gyorgy
    Lemnaru, Camelia
    Barbantan, Ioana
    2015 IEEE 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2015, : 13 - 17
  • [8] On Software Projects in Academia and Industry from a Perspective of Software Engineering Education
    Kamthan, Pankaj
    Shahmir, Nazlie
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 34 - 39
  • [9] Software engineering as a business
    Miller, A
    Ebert, C
    IEEE SOFTWARE, 2002, 19 (06) : 18 - 20
  • [10] SOFTWARE ENGINEERING IN BUSINESS
    DENERT, E
    INFORMATION PROCESSING '94, VOL III: LINKAGE AND DEVELOPING COUNTRIES, 1994, 53 : 318 - 325