A text categorisation tool for open source communities based on semantic analysis

被引:12
|
作者
Martinez-Torres, M. R. [1 ]
Toral, S. L. [2 ]
Barrero, F. J. [2 ]
Gregor, D. [2 ]
机构
[1] Univ Seville, Dept Adm Empresas & Comercializac & Invest Mercad, Seville, Spain
[2] Univ Seville, Dept Ingn Elect, Seville, Spain
关键词
semantic analysis; text categorisation; open source; virtual communities; OPEN SOURCE PROJECTS; VIRTUAL COMMUNITIES; ONLINE COMMUNITIES; KNOWLEDGE; MODEL; DETERMINANTS; TECHNOLOGY; INTERNET; SUCCESS;
D O I
10.1080/0144929X.2011.624634
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Open source software (OSS) projects are supported by communities interacting through software repositories and mailing lists. Thousands of contributors participate in the development of the projects although they rarely meet each other. The result is a huge archived repository with thousands of questions, answers and contributions usually difficult to explore. We propose a tool based on semantic analysis for both performing an automatic knowledge discovery and a categorisation of the content of mailing lists repositories. Semantic analysis is a practical method for extracting and inferring relations of words in passages of discourse, producing measures of relations among words or passages that are well correlated with semantic similarity. The objective of this article is two-fold: (1) to develop a text categorisation tool based on indexing terms and semantic annotation, and (2) to apply the developed tool to extract the main dimensions related to knowledge sharing activities in virtual communities. Debian Linux ports to embedded processors are used as a case study to accomplish the proposed double objective.
引用
收藏
页码:532 / 544
页数:13
相关论文
共 50 条
  • [1] LAGOON: An Analysis Tool for Open Source Communities
    Dey, Sourya
    Woods, Walt
    [J]. 2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 717 - 721
  • [2] Public attitudes on open source communities in China: A text mining analysis
    Hou, Shengjie
    Zhang, Xiang
    Yi, Biyi
    Tang, Yi
    [J]. TECHNOLOGY IN SOCIETY, 2022, 71
  • [3] TACIT: An open-source text analysis, crawling, and interpretation tool
    Morteza Dehghani
    Kate M. Johnson
    Justin Garten
    Reihane Boghrati
    Joe Hoover
    Vijayan Balasubramanian
    Anurag Singh
    Yuvarani Shankar
    Linda Pulickal
    Aswin Rajkumar
    Niki Jitendra Parmar
    [J]. Behavior Research Methods, 2017, 49 : 538 - 547
  • [4] TACIT: An open-source text analysis, crawling, and interpretation tool
    Dehghani, Morteza
    Johnson, Kate M.
    Garten, Justin
    Boghrati, Reihane
    Hoover, Joe
    Balasubramanian, Vijayan
    Singh, Anurag
    Shankar, Yuvarani
    Pulickal, Linda
    Rajkumar, Aswin
    Parmar, Niki Jitendra
    [J]. BEHAVIOR RESEARCH METHODS, 2017, 49 (02) : 538 - 547
  • [5] Social network analysis of open source software: A review and categorisation
    McClean, Kelvin
    Greer, Des
    Jurek-Loughrey, Anna
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 130
  • [6] Text Similarity Based on Semantic Analysis
    Wang, Junli
    Zhou, Qing
    Sun, Guobao
    [J]. PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRIAL ENGINEERING (AIIE 2016), 2016, 133 : 303 - 307
  • [7] Text manifold based on semantic analysis
    Yang, Zhen
    Fan, Ke-Feng
    Lei, Jian-Jun
    Guo, Jun
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (03): : 557 - 561
  • [8] ABSA Toolkit: An Open Source Tool for Aspect Based Sentiment Analysis
    Nasim, Zarmeen
    Haider, Sajjad
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2017, 26 (06)
  • [9] MedXN: an open source medication extraction and normalization tool for clinical text
    Sohn, Sunghwan
    Clark, Cheryl
    Halgrim, Scott R.
    Murphy, Sean P.
    Chute, Christopher G.
    Liu, Hongfang
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (05) : 858 - 865
  • [10] SciBrowser: A Computational Ethnography Tool to Explore Open Source Science Communities
    Arnold, Michael
    Shenviwagle, Damodar
    Yilmaz, Levent
    [J]. PROCEEDINGS OF THE 48TH ANNUAL SOUTHEAST REGIONAL CONFERENCE (ACM SE 10), 2010, : 135 - 140