Concept-based text mining technique for semantic classification of manufacturing suppliers

被引:1
|
作者
Shotorbani P.Y. [1 ]
Ameri F. [1 ]
机构
[1] Engineering Informatics Research Group, Texas State Univ., San Marcos, 78666, TX
来源
Ameri, F. (ameri@txstate.edu) | 2017年 / ASTM International卷 / 01期
关键词
Manufacture - Semantics - Data mining - International trade - Natural language processing systems - Text processing;
D O I
10.1520/SSMS20160005
中图分类号
学科分类号
摘要
Small-to-medium sized enterprises (SMEs) in the manufacturing sector are increasingly strengthening their web presence in order to improve their visibility and remain competitive in the global market. With the explosive growth of unstructured content on the Web, more advanced methods for information organization and retrieval are needed to improve the intelligence and efficiency of the supplier discovery and evaluation process. In this paper, a technique for automated characterization and classification of manufacturing suppliers based on their textual portfolios was presented. A probabilistic technique that adopts Naïve Bayes method was used as the underlying mathematical model of the proposed text classifier. To improve the semantic relevance of the results, classification was conducted at the conceptual level rather than at the term level that is typically used by conventional text classifiers. The necessary steps for training data preparation and representation related to manufacturing supplier classification problem are delineated. The proposed classifier is capable of forming both simple and complex classes of manufacturing SMEs based on their advertised capabilities. The performance of the proposed classifier wass evaluated experimentally based on the standard metrics used in information retrieval such as precision, recall, and F-measure. It was concluded that the proposed concept-based classification technique outperforms the traditional term-based methods with respect to accuracy, robustness, and cost. Copyright © 2017 by ASTM International, 100 Barr Harbor Drive, P.O. Box C700, West Conshohocken, PA 19428-2959
引用
收藏
相关论文
共 50 条
  • [41] C-HTS: A Concept-based Hierarchical Text Segmentation approach
    Bayomi, Mostafa
    Lawless, Seamus
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1519 - 1528
  • [42] Predicting software defect type using concept-based classification
    Sangameshwar Patil
    B. Ravindran
    Empirical Software Engineering, 2020, 25 : 1341 - 1378
  • [43] Concept-Based Document Classification Using Wikipedia and Value Function
    Malo, Pekka
    Sinha, Ankur
    Wallenius, Jyrki
    Korhonen, Pekka
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (12): : 2496 - 2511
  • [44] An efficient concept-based retrieval model for enhancing text retrieval quality
    Shady Shehata
    Fakhri Karray
    Mohamed S. Kamel
    Knowledge and Information Systems, 2013, 35 : 411 - 434
  • [45] Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
    Ji, Lei
    Wang, Yujing
    Shi, Botian
    Zhang, Dawei
    Wang, Zhongyuan
    Yan, Jun
    DATA INTELLIGENCE, 2019, 1 (03) : 238 - 270
  • [46] Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding
    Lei Ji
    Yujing Wang
    Botian Shi
    Dawei Zhang
    Zhongyuan Wang
    Jun Yan
    Data Intelligence, 2019, 1 (03) : 262 - 294
  • [47] Predicting software defect type using concept-based classification
    Patil, Sangameshwar
    Ravindran, B.
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (02) : 1341 - 1378
  • [48] Mining patterns of transitional growth using multivariate concept-based models
    Bartak J.
    Jastrzębska A.
    Quality & Quantity, 2022, 56 (6) : 4395 - 4419
  • [49] A technique for the concept-based detection of functional modules in an interaction network
    Park, Jong-Min
    Yang, Hyung-Jeong
    Yang, Jae-Dong
    Choi, Dong-Hoon
    INFORMATION PROCESSING LETTERS, 2016, 116 (10) : 611 - 617
  • [50] Enhancing information retrieval through concept-based language modeling and semantic smoothing
    Lhadj, Lynda Said
    Boughanem, Mohand
    Amrouche, Karima
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (12) : 2909 - 2927