Text mining-based identification of promising miRNA biomarkers for diabetes mellitus

被引:3
|
作者
Li, Xin [1 ]
Dai, Andrea [2 ]
Tran, Richard [3 ]
Wang, Jie [4 ,5 ]
机构
[1] Shandong First Med Univ, Ophthalmol Dept, Cent Hosp, Jinan, Shandong, Peoples R China
[2] Oakland Univ, William Beaumont Sch Med, Rochester, MI USA
[3] Univ Chicago, Masters Program Comp Sci, Chicago, IL USA
[4] Syracuse Univ, Appl Data Sci Program, Syracuse, NY 13244 USA
[5] MDSight LLC, Brookeville, MD 20833 USA
来源
关键词
microRNA; text mining; machine learning; diabetes; miR-146; MICRORNA BIOGENESIS; MIR-146A; EXPRESSION; DIAGNOSIS; RISK;
D O I
10.3389/fendo.2023.1195145
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
IntroductionMicroRNAs (miRNAs) are small, non-coding RNAs that play a critical role in diabetes development. While individual studies investigating the mechanisms of miRNA in diabetes provide valuable insights, their narrow focus limits their ability to provide a comprehensive understanding of miRNAs' role in diabetes pathogenesis and complications. MethodsTo reduce potential bias from individual studies, we employed a text mining-based approach to identify the role of miRNAs in diabetes and their potential as biomarker candidates. Abstracts of publications were tokenized, and biomedical terms were extracted for topic modeling. Four machine learning algorithms, including Naive Bayes, Decision Tree, Random Forest, and Support Vector Machines (SVM), were employed for diabetes classification. Feature importance was assessed to construct miRNA-diabetes networks. ResultsOur analysis identified 13 distinct topics of miRNA studies in the context of diabetes, and miRNAs exhibited a topic-specific pattern. SVM achieved a promising prediction for diabetes with an accuracy score greater than 60%. Notably, miR-146 emerged as one of the critical biomarkers for diabetes prediction, targeting multiple genes and signal pathways implicated in diabetic inflammation and neuropathy. ConclusionThis comprehensive approach yields generalizable insights into the network miRNAs-diabetes network and supports miRNAs' potential as a biomarker for diabetes.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Text Mining-Based Drug Discovery in Osteoarthritis
    Yu, Rong-Guo
    Zhang, Jia-Yu
    Liu, Zhen-Tao
    Zhuo, You-Guang
    Wang, Hai-Yang
    Ye, Jie
    Liu, Nannan
    Zhang, Yi-Yuan
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [2] Mining of protein based biomarkers for type 2 diabetes mellitus
    Khan, Abdul Rehman
    Awan, Fazli Rabbi
    [J]. PAKISTAN JOURNAL OF PHARMACEUTICAL SCIENCES, 2012, 25 (04) : 889 - 901
  • [3] Athena: Mining-based interactive management of text databases
    Agrawal, R
    Bayardo, R
    Srikant, R
    [J]. ADVANCES IN DATABSE TECHNOLOGY-EDBT 2000, PROCEEDINGS, 2000, 1777 : 365 - 379
  • [4] Text Classification using Graph Mining-based Feature Extraction
    Jiang, Chuntao
    Coenen, Frans
    Sanderson, Robert
    Zito, Michele
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 21 - 34
  • [5] A text mining-based approach for the evaluation of patenting trends on nanomaterials
    Milanez, Douglas Henrique
    Lopes de Faria, Leandro Innocentini
    Leiva, Daniel Rodrigo
    [J]. JOURNAL OF NANOPARTICLE RESEARCH, 2021, 23 (09)
  • [6] TEXT MINING-BASED PATENT ANALYSIS OF BIM APPLICATION IN CONSTRUCTION
    Pan, Xing
    Zhong, Botao
    Wang, Xiaobo
    Xiang, Ran
    [J]. JOURNAL OF CIVIL ENGINEERING AND MANAGEMENT, 2021, 27 (05) : 303 - 315
  • [7] Text classification using graph mining-based feature extraction
    Jiang, Chuntao
    Coenen, Frans
    Sanderson, Robert
    Zito, Michele
    [J]. KNOWLEDGE-BASED SYSTEMS, 2010, 23 (04) : 302 - 308
  • [8] A text mining-based approach for the evaluation of patenting trends on nanomaterials
    Douglas Henrique Milanez
    Leandro Innocentini Lopes de Faria
    Daniel Rodrigo Leiva
    [J]. Journal of Nanoparticle Research, 2021, 23
  • [9] Sustainability trends in the process industries: A text mining-based analysis
    Liew, Wan Te
    Adhitya, Arief
    Srinivasan, Rajagopalan
    [J]. COMPUTERS IN INDUSTRY, 2014, 65 (03) : 393 - 400
  • [10] Identifying miRNA biomarkers of polycystic ovary syndrome through text mining
    Wang Yan
    Yuan Xiao-Jing
    Zhao Bin
    [J]. 生殖与发育医学(英文), 2023, 07 (02)