Natural language processing for the Turkish Academic texts in the engineering field and development of a decision support system: the case of TUBITAK project proposals

被引:2
|
作者
Kat, Bora [1 ]
机构
[1] Sci & Technol Res Council Turkiye TUBITAK, TR-06530 Ankara, Turkiye
关键词
Key term extraction; Feature extraction; Natural language processing; Supervised machine learning; Na?ve Bayes classifier; Conceptual similarity; Decision support system; IMPACT;
D O I
10.17341/gazimmfd.1132053
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose:This study proposes a decision support system (as illustrated in Figure A) based on NLP applications and machine learning algorithm. Three modules (key term extraction, similarity detection and subfield assignment) are developed that would automatically index academic engineering documents, calculate their conceptual similarities and assign them to the most appropriate subfield over 31 subfields. Theory and Methods:Tailored preprocessing procedures are applied to the texts and the initial key terms are extracted. After a post-processing step, final versions of the term-frequency vectors are obtained. These vectors are used in the proposed similarity detection algorithm and as an input to the Naive Bayes classifiers.Results: The proposals submitted to TUBITAK Academic Research Funding Program Directorate (ARDEB) are analyzed as a case study. The results indicate that the proposed similarity algorithm correctly detects almost all of the revised proposals while the accuracy of the Naive Bayes classifier is more than 80% over a sample of 1255 proposals. The accuracy level exceeds 95% based on the best three predictions.Conclusion: NLP studies conducted in this study and the proposed algorithms are the first attempt to classify Turkish academic texts. Current study focuses on engineering; further studies on classifying other disciplines are needed. Moreover, the success of the machine learning in classification would pave the way for other applications such as reviewer identification.
引用
收藏
页码:1879 / 1892
页数:14
相关论文
共 14 条
  • [1] Natural Language Processing for the Turkish Academic Texts in the Engineering Field: Key-Term Extraction, Similarity Detection, Subject/Topic Assignment
    Kat, Bora
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT II, 2023, 676 : 411 - 424
  • [2] A decision support system for agriculture using natural language processing (ADSS)
    Prasad, J. R.
    Prasad, R. S.
    Kulkarni, U. V.
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 365 - +
  • [3] Earthquake management: a decision support system based on natural language processing
    Fersini, E.
    Messina, E.
    Pozzi, F. A.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (01) : 37 - 45
  • [4] Earthquake management: a decision support system based on natural language processing
    E. Fersini
    E. Messina
    F. A. Pozzi
    Journal of Ambient Intelligence and Humanized Computing, 2017, 8 : 37 - 45
  • [5] TEMPORAL SEMANTICS AND NATURAL-LANGUAGE PROCESSING IN A DECISION SUPPORT SYSTEM
    DE, SJ
    PAN, SS
    WHINSTON, A
    INFORMATION SYSTEMS, 1987, 12 (01) : 29 - 47
  • [6] Natural Language Processing in a Clinical Decision Support System for the Identification of Venous Thromboembolism: Algorithm Development and Validation
    Jin, Zhi-Geng
    Zhang, Hui
    Tai, Mei-Hui
    Yang, Ying
    Yao, Yuan
    Guo, Yu-Tao
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [7] A NATURAL-LANGUAGE PROCESSING BASED GROUP DECISION-SUPPORT SYSTEM
    CONLON, SP
    REITHEL, BJ
    AIKEN, MW
    SHIRANI, AI
    DECISION SUPPORT SYSTEMS, 1994, 12 (03) : 181 - 188
  • [8] Intelligent decision support system for CV evaluation based on natural language processing
    Alfawareh, Hejab
    Jusoh, Shaidah
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2019, 6 (04): : 1 - 8
  • [9] A Language Independent Decision Support System for Diagnosis and Treatment by Using Natural Language Processing Techniques
    Gokgol, Merve Kevser
    Orhan, Zeynep
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, CMBEBIH 2019, 2020, 73 : 721 - 728
  • [10] Enhancing Optimized Personalized Therapy in Clinical Decision Support System using Natural Language Processing
    Hiremath, Basavaraj N.
    Patil, Malini M.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 2840 - 2848