Textual data mining for industrial knowledge management and text classification: A business oriented approach

被引:88
|
作者
Ur-Rahman, N. [1 ]
Harding, J. A. [1 ]
机构
[1] Univ Loughborough, Wolfson Sch Mech & Mfg Engn, Loughborough LE11 3TU, Leics, England
关键词
Textual data milling; Text mining; Post Project Reviews;
D O I
10.1016/j.eswa.2011.09.124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Textual databases are useful sources of information and knowledge and if these are well utilised then issues related to future project management and product or service quality improvement may be resolved. A large part of corporate information, approximately 80%, is available in textual data formats. Text Classification techniques are well known for managing on-line sources of digital documents. The identification of key issues discussed within textual data and their classification into two different classes could help decision makers or knowledge workers to manage their future activities better. This research is relevant for most text based documents and is demonstrated on Post Project Reviews (PPRs) which are valuable source of information and knowledge. The application of textual data mining techniques for discovering useful knowledge and classifying textual data into different classes is a relatively new area of research. The research work presented in this paper is focused on the use of hybrid applications of text mining or textual data mining techniques to classify textual data into two different classes. The research applies clustering techniques at the first stage and Apriori Association Rule Mining at the second stage. The Apriori Association Rule of Mining is applied to generate Multiple Key Term Phrasal Knowledge Sequences (MKTPKS) which are later used for classification. Additionally, studies were made to improve the classification accuracies of the classifiers i.e. C4.5, K-NN, Nave Bayes and Support Vector Machines (SVMs). The classification accuracies were measured and the results compared with those of a single term based classification model. The methodology proposed could be used to analyse any free formatted textual data and in the current research it has been demonstrated on an industrial dataset consisting of Post Project Reviews (PPRs) collected from the construction industry. The data or information available in these reviews is codified in multiple different formats but in the current research scenario only free formatted text documents are examined. Experiments showed that the performance of classifiers improved through adopting the proposed methodology. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4729 / 4739
页数:11
相关论文
共 50 条
  • [1] A knowledge management approach to data mining process for business intelligence
    Wang, Hai
    Wang, Shouhong
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2008, 108 (5-6) : 622 - 634
  • [2] Data and text mining: A business applications approach.
    Ford, JM
    [J]. PERSONNEL PSYCHOLOGY, 2005, 58 (01) : 267 - 271
  • [3] BUSINESS DEMANDS FOR PROCESSING UNSTRUCTURED TEXTUAL DATA - TEXT MINING TECHNIQUES FOR COMPANIES TO IMPLEMENT
    Zhecheva, Denitsa
    Nenkov, Nayden
    [J]. ACCESS-ACCESS TO SCIENCE BUSINESS INNOVATION IN THE DIGITAL ECONOMY, 2022, 3 (02): : 107 - 120
  • [4] Business environmental analysis for textual data using data mining and sentence-level classification
    Kim, Yoon-Sung
    Rim, Hae-Chang
    Lee, Do-Gil
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2019, 119 (01) : 69 - 88
  • [5] Text Associative Classification Approach for Mining Arabic Data Set
    Ghareb, Abdullah S.
    Hamdan, Abdul Razak
    Abu Bakar, Azuraliza
    [J]. 2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2012, : 114 - 120
  • [6] A Novel Data Mining Approach for Multi Variant Text Classification
    Dsouza, Kevin Joy
    Ansari, Zaheed Ahmed
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), 2016, : 68 - 73
  • [7] Mining causality knowledge from textual data
    Pechsiri, C
    Kawtrakul, A
    Piriyakul, R
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2006, : 85 - +
  • [8] Mining explanation knowledge from textual data
    Pechsiri, Chaveevan
    Kawtrakul, Asance
    Piriyakul, Rapepun
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER SCIENCE AND TECHNOLOGY, 2006, : 322 - +
  • [9] A technology of text classification of data mining
    Yang, Bin
    Meng, Zhi-qing
    [J]. Xiangtan Daxue Ziran Kexue Xuebao, 2001, 23 (04): : 34 - 37
  • [10] Knowledge Management And Data Mining: Emerging Business Intelligence Research Subspecialties
    Eom, Sean
    [J]. DSS 2.0 - SUPPORTING DECISION MAKING WITH NEW TECHNOLOGIES, 2014, 261 : 353 - 362