Text Mining and Analysis of Treatise on Febrile Diseases Based on Natural Language Processing

被引:0
|
作者
Kai Zhao [1 ]
Na Shi [1 ]
Zhen Sa [1 ]
Hua-Xing Wang [1 ]
Chun-Hua Lu [2 ]
Xiao-Ying Xu [1 ]
机构
[1] School of Traditional Chinese Medicine, Beijing University of Chinese Medicine
[2] School of Life Science, Beijing University of Chinese Medicine
关键词
Knowledge discovery; natural language processing; text mining; traditional Chinese medicine literature; treatise on febrile diseases;
D O I
暂无
中图分类号
R441.3 [发热]; TP391.1 [文字信息处理];
学科分类号
081203 ; 0835 ; 100208 ;
摘要
Objective:With using natural language processing (NLP) technology to analyze and process the text of "Treatise on Febrile Diseases (TFDs)"for the sake of finding important information, this paper attempts to apply NLP in the field of text mining of traditional Chinese medicine (TCM)literature. Materials and Methods:Based on the Python language, the experiment invoked the NLP toolkit such as Jieba, nltk, gensim,and sklearn library, and combined with Excel and Word software. The text of "TFDs" was sequentially cleaned, segmented, and moved the stopped words, and then implementing word frequency statistics and analysis, keyword extraction, named entity recognition (NER) and other operations, finally calculating text similarity. Results:Jieba can accurately identify the herbal name in "TFDs." Word frequency statistics based on the word segmentation found that "warm therapy" is an important treatment of "TFDs." Guizhi decoction is the main prescription,and five core decoctions are identified. Keyword extraction based on the term "frequency-inverse document frequency" algorithm is ideal.The accuracy of NER in "TFDs" is about 86%; latent semantic indexing model calculating the similarity,"Understanding of Synopsis of Golden Chamber (SGC)" is much more similar with "SGC" than with "TFDs." The results meet expectation. Conclusions:It lays a research foundation for applying NLP to the field of text mining of unstructured TCM literature. With the combination of deep learning technology,NLP as an important branch of artificial intelligence will have broader application prospective in the field of text mining in TCM literature and construction of TCM knowledge graph as well as TCM knowledge services.
引用
收藏
页码:67 / 73
页数:7
相关论文
共 50 条
  • [1] Text mining and analysis of treatise on febrile diseases based on natural language processing
    Zhao, Kai
    Shi, Na
    Sa, Zhen
    Wang, Hua-Xing
    Lu, Chun-Hua
    Xu, Xiao-Ying
    [J]. WORLD JOURNAL OF TRADITIONAL CHINESE MEDICINE, 2020, 6 (01) : 67 - 73
  • [2] Text mining and natural language processing in construction
    Shamshiri, Alireza
    Ryu, Kyeong Rok
    Park, June Young
    [J]. AUTOMATION IN CONSTRUCTION, 2024, 158
  • [3] Analysis of Stock Market using Text Mining and Natural Language Processing
    Abdullah, Sheikh Shaugat
    Rahaman, Mohammad Saiedur
    Rahman, Mohammad Saidur
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,
  • [4] Convolution Neural Network for Text Mining and Natural Language Processing
    Widiastuti, N., I
    [J]. 2ND INTERNATIONAL CONFERENCE ON INFORMATICS, ENGINEERING, SCIENCE, AND TECHNOLOGY (INCITEST 2019), 2019, 662
  • [5] The state of the art in text mining and natural language processing for pharmacogenomics
    Coulet, Adrien
    Cohen, K. Bretonnel
    Altman, Russ B.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2012, 45 (05) : 825 - 826
  • [6] Editorial for the Special Issue on "Natural Language Processing and Text Mining"
    Gamallo, Pablo
    Garcia, Marcos
    [J]. INFORMATION, 2019, 10 (09)
  • [7] Current issues in biomedical text mining and natural language processing
    Chapman, Wendy W.
    Cohen, K. Bretonnel
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 757 - 759
  • [8] Mining and Application of Tourism Online Review Text Based on Natural Language Processing and Text Classification Technology
    Xu, Hongsheng
    Lv, Yanqing
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [9] Construction site accident analysis using text mining and natural language processing techniques
    Zhang, Fan
    Fleyeh, Hasan
    Wang, Xinru
    Lu, Minghui
    [J]. AUTOMATION IN CONSTRUCTION, 2019, 99 : 238 - 248
  • [10] Text visualization for geological hazard documents via text mining and natural language processing
    Ma, Ying
    Xie, Zhong
    Li, Gang
    Ma, Kai
    Huang, Zhen
    Qiu, Qinjun
    Liu, Hui
    [J]. EARTH SCIENCE INFORMATICS, 2022, 15 (01) : 439 - 454