Text Mining and Analysis of Treatise on Febrile Diseases Based on Natural Language Processing

被引:0
|
作者
Kai Zhao [1 ]
Na Shi [1 ]
Zhen Sa [1 ]
Hua-Xing Wang [1 ]
Chun-Hua Lu [2 ]
Xiao-Ying Xu [1 ]
机构
[1] School of Traditional Chinese Medicine, Beijing University of Chinese Medicine
[2] School of Life Science, Beijing University of Chinese Medicine
关键词
Knowledge discovery; natural language processing; text mining; traditional Chinese medicine literature; treatise on febrile diseases;
D O I
暂无
中图分类号
R441.3 [发热]; TP391.1 [文字信息处理];
学科分类号
081203 ; 0835 ; 100208 ;
摘要
Objective:With using natural language processing (NLP) technology to analyze and process the text of "Treatise on Febrile Diseases (TFDs)"for the sake of finding important information, this paper attempts to apply NLP in the field of text mining of traditional Chinese medicine (TCM)literature. Materials and Methods:Based on the Python language, the experiment invoked the NLP toolkit such as Jieba, nltk, gensim,and sklearn library, and combined with Excel and Word software. The text of "TFDs" was sequentially cleaned, segmented, and moved the stopped words, and then implementing word frequency statistics and analysis, keyword extraction, named entity recognition (NER) and other operations, finally calculating text similarity. Results:Jieba can accurately identify the herbal name in "TFDs." Word frequency statistics based on the word segmentation found that "warm therapy" is an important treatment of "TFDs." Guizhi decoction is the main prescription,and five core decoctions are identified. Keyword extraction based on the term "frequency-inverse document frequency" algorithm is ideal.The accuracy of NER in "TFDs" is about 86%; latent semantic indexing model calculating the similarity,"Understanding of Synopsis of Golden Chamber (SGC)" is much more similar with "SGC" than with "TFDs." The results meet expectation. Conclusions:It lays a research foundation for applying NLP to the field of text mining of unstructured TCM literature. With the combination of deep learning technology,NLP as an important branch of artificial intelligence will have broader application prospective in the field of text mining in TCM literature and construction of TCM knowledge graph as well as TCM knowledge services.
引用
收藏
页码:67 / 73
页数:7
相关论文
共 50 条
  • [31] Natural Language Processing Based on a Text Graph Convolutional Network
    Moreira Pereira, Vitor Cesar
    de Castro, Leandro Nunes
    [J]. 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2023, 583 : 1 - 10
  • [32] Natural Language Processing (NLP) based Text Summarization - A Survey
    Awasthi, Ishitva
    Gupta, Kuntal
    Bhogal, Prabjot Singh
    Anand, Sahejpreet Singh
    Soni, Piyush Kumar
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 1310 - 1317
  • [33] Natural Language Processing and Text Mining to Identify Knowledge Profiles for Software Engineering Positions
    Valdez-Almada, Rogelio
    Rodriguez-Elias, Oscar M.
    Enrique Rose-Gomez, Cesar
    De Jesus Velazquez-Mendoza, Maria
    Gonzalez-Lopez, Samuel
    [J]. 2017 5TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION (CONISOFT 2017), 2017, : 97 - 106
  • [34] Persica: A Persian corpus for multi-purpose text mining and Natural language processing
    Eghbalzadeh, Hamid
    Hosseini, Behrooz
    Khadivi, Shahram
    Khodabakhsh, Ali
    [J]. 2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 1207 - 1214
  • [35] The dynamics of natural language processing and text mining under emerging artificial intelligence techniques
    Dimlo, U. M. Fernandes
    Rupesh, V.
    Raju, Yeligeti
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (09) : 4512 - 4526
  • [36] Business and government applications of text mining & Natural Language Processing (NLP) for societal benefit: Introduction to the special issue on "text mining & NLP"
    Bhattacharjee, Sudip
    Delen, Dursun
    Ghasemaghaei, Maryam
    Kumar, Ajay
    Ngai, Eric W. T.
    [J]. DECISION SUPPORT SYSTEMS, 2022, 162
  • [37] Realization of natural language processing and machine learning approaches for text-based sentiment analysis
    Naithani, Kanchan
    Raiwani, Yadav Prasad
    [J]. EXPERT SYSTEMS, 2023, 40 (05)
  • [38] From Text to Insight: A Natural Language Processing-Based Analysis of Topics and Trends in Neurosurgery
    Karabacak, Mert
    Schupper, Alexander J.
    Carr, Matthew T.
    Hickman, Zachary L.
    Margetis, Konstantinos
    [J]. NEUROSURGERY, 2024, 94 (04) : 679 - 689
  • [39] Computational Analysis of Printed Arabic Text Database for Natural Language Processing
    Bouressace, Hassina
    [J]. COGNITIVE STUDIES-ETUDES COGNITIVES, 2023, (23):
  • [40] Augmenting Qualitative Text Analysis with Natural Language Processing: Methodological Study
    Guetterman, Timothy C.
    Chang, Tammy
    DeJonckheere, Melissa
    Basu, Tanmay
    Scruggs, Elizabeth
    Vydiswaran, Vinod
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2018, 20 (06)