Compound Classification and Consideration of Correlation with Chemical Descriptors from Articles on Antioxidant Capacity Using Natural Language Processing

被引:0
|
作者
Matsumoto, Yuto [1 ]
Gotoh, Hiroaki [1 ]
机构
[1] Yokohama Natl Univ, Dept Chem & Life Sci, Yokohama 2408501, Japan
关键词
SOAC VALUES; EXTRACTION; QUERCETIN;
D O I
10.1021/acs.jcim.3c01826
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In recent times, there has been a substantial increase in the number of articles focusing on antioxidants. However, the development of a comprehensive estimator for antioxidant capacity remains elusive due to the challenge of integrating information from these articles. Furthermore, the complexity of the antioxidant mechanism, which involves a multitude of factors, makes it difficult to establish a simple equation or correlation. Hence, there is a pressing need for a model that can effectively interpret the collective knowledge from these articles, especially from a chemistry perspective. In this research, we employed natural language processing techniques, specifically Word2Vec, to analyze articles related to antioxidant capacity. We extracted representation vectors of compound names from these documents and organized them into 10 distinct clusters. In our investigation of two of these clusters, we unveiled that the majority of the compounds in question were flavonoids and flavonoid glycosides. To establish a link between the descriptors and clusters, we utilized kernel density estimation and generated scatter plots to visualize their similarity. These visualizations clearly indicated a strong relationship between the descriptors and clusters, affirming that a tangible connection exists between word vectors and compound descriptors through a document analysis conducted with natural language processing techniques. This study represents a pioneering approach that utilizes document analysis to shed light on the field of antioxidant capacity research, marking a significant advancement in this domain.
引用
收藏
页码:119 / 127
页数:9
相关论文
共 50 条
  • [1] Using natural language processing to improve suicide classification requires consideration of race
    Rahman, Nusrat
    Mozer, Reagan
    McHugh, R. Kathryn
    Rockett, Ian R. H.
    Chow, Clifton M.
    Vaughan, Gregory
    SUICIDE AND LIFE-THREATENING BEHAVIOR, 2022, 52 (04) : 782 - 791
  • [2] Multi-label Text Classification of Economic Concepts from Economic News Articles using Natural Language Processing
    Kim, Soojeong
    Lee, Minhyeok
    Seok, Junhee
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 417 - 420
  • [3] Classification of Poverty Condition Using Natural Language Processing
    Muneton-Santa, Guberney
    Escobar-Grisales, Daniel
    Orlando Lopez-Pabon, Felipe
    Perez-Toro, Paula Andrea
    Rafael Orozco-Arroyave, Juan
    SOCIAL INDICATORS RESEARCH, 2022, 162 (03) : 1413 - 1435
  • [4] Classification of Poverty Condition Using Natural Language Processing
    Guberney Muñetón-Santa
    Daniel Escobar-Grisales
    Felipe Orlando López-Pabón
    Paula Andrea Pérez-Toro
    Juan Rafael Orozco-Arroyave
    Social Indicators Research, 2022, 162 : 1413 - 1435
  • [5] Classification of Notices to Airmen using Natural Language Processing
    Szeto, Aiden C.
    Das, Aditya
    AIAA SCITECH 2024 FORUM, 2024,
  • [6] Classification of neurologic outcomes from medical notes using natural language processing
    Fernandes, Marta B.
    Valizadeh, Navid
    Alabsi, Haitham S.
    Quadri, Syed A.
    Tesh, Ryan A.
    Bucklin, Abigail A.
    Sun, Haoqi
    Jain, Aayushee
    Brenner, Laura N.
    Ye, Elissa
    Ge, Wendong
    Collens, Sarah, I
    Lin, Stacie
    Das, Sudeshna
    Robbins, Gregory K.
    Zafar, Sahar F.
    Mukerji, Shibani S.
    Westover, M. Brandon
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [7] Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression
    Patra, Braja Gopal
    Sun, Zhaoyi
    Cheng, Zilin
    Kumar, Praneet Kasi Reddy Jagadeesh
    Altammami, Abdullah
    Liu, Yiyang
    Joly, Rochelle
    Jedlicka, Caroline
    Delgado, Diana
    Pathak, Jyotishman
    Peng, Yifan
    Zhang, Yiye
    FRONTIERS IN PSYCHIATRY, 2023, 14
  • [8] Recategorizing Interdisciplinary Articles Using Natural Language Processing and Machine/Deep Learning
    Tanaka, Kazuya
    Arakawa, Riku
    Kameoka, Yasuaki
    Sakai, Ichiro
    2018 PORTLAND INTERNATIONAL CONFERENCE ON MANAGEMENT OF ENGINEERING AND TECHNOLOGY (PICMET '18): MANAGING TECHNOLOGICAL ENTREPRENEURSHIP: THE ENGINE FOR ECONOMIC GROWTH, 2018,
  • [9] ENRICHING PSYCHOTIC DISORDER CLASSIFICATION USING NATURAL LANGUAGE PROCESSING
    Patel, Rashmi
    Jackson, Richard
    Stewart, Robert
    McGuire, Philip
    SCHIZOPHRENIA BULLETIN, 2018, 44 : S154 - S155
  • [10] Real and Fake News Classification Using Natural Language Processing
    Kumar, Shivam
    Krishnan, C. Santhana
    Ramya, M.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1535 - 1540