Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends

被引:21
|
作者
Jurca G. [1 ]
Addam O. [1 ]
Aksac A. [1 ]
Gao S. [2 ]
Özyer T. [3 ]
Demetrick D. [4 ]
Alhajj R. [1 ,5 ]
机构
[1] Department of Computer Science, University of Calgary, Calgary, AB
[2] College of Computer Science and Technology, Jilin University, Changchun
[3] Department of Computer Engineering, TOBB University, Ankara
[4] Departments of Pathology, Oncology and Biochemistry and Molecular Biology, University of Calgary, Calgary, AB
[5] Department of Computer Science, Global University, Beirut
关键词
Breast cancer; Data mining; Network analysis; Text mining;
D O I
10.1186/s13104-016-2023-5
中图分类号
学科分类号
摘要
Background: Breast cancer is a serious disease which affects many women and may lead to death. It has received considerable attention from the research community. Thus, biomedical researchers aim to find genetic biomarkers indicative of the disease. Novel biomarkers can be elucidated from the existing literature. However, the vast amount of scientific publications on breast cancer make this a daunting task. This paper presents a framework which investigates existing literature data for informative discoveries. It integrates text mining and social network analysis in order to identify new potential biomarkers for breast cancer. Results: We utilized PubMed for the testing. We investigated gene-gene interactions, as well as novel interactions such as gene-year, gene-country, and abstract-country to find out how the discoveries varied over time and how overlapping/diverse are the discoveries and the interest of various research groups in different countries. Conclusions: Interesting trends have been identified and discussed, e.g., different genes are highlighted in relationship to different countries though the various genes were found to share functionality. Some text analysis based results have been validated against results from other tools that predict gene-gene relations and gene functions. © 2016 Jurca et al.
引用
收藏
相关论文
共 50 条
  • [1] Identifying Technology Evolution Pathways by Integrating Citation Network and Text Mining
    Zhou, Yuan
    Du, Jun-fei
    Liu, Yu-fei
    Zheng, Wen-jiang
    [J]. 2019 IEEE TECHNOLOGY & ENGINEERING MANAGEMENT CONFERENCE (TEMSCON), 2019,
  • [2] Mining Breast Cancer Genetic Data
    Mansour, Nashat
    Zardout, Rouba
    El-Sibai, Mirvat
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 1047 - 1051
  • [3] Trends in deqi research: a text mining and network analysis
    Kwon, O. Sang
    Kim, Junbeom
    Choi, Kwang-Ho
    Ryu, Yeonhee
    Park, Ji-Eun
    [J]. INTEGRATIVE MEDICINE RESEARCH, 2018, 7 (03) : 231 - 237
  • [4] Data mining for identifying trends in markets
    Puscasiu, Adela
    Fanca, Alexandra
    Gota, Dan-Ioan
    Valean, Honoriu
    [J]. PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR), 2020, : 367 - 372
  • [5] Identifying miRNA biomarkers for breast cancer and ovarian cancer: a text mining perspective
    Li, Xin
    Dai, Andrea
    Tran, Richard
    Wang, Jie
    [J]. BREAST CANCER RESEARCH AND TREATMENT, 2023, 201 (01) : 5 - 14
  • [6] Identifying miRNA biomarkers for breast cancer and ovarian cancer: a text mining perspective
    Xin Li
    Andrea Dai
    Richard Tran
    Jie Wang
    [J]. Breast Cancer Research and Treatment, 2023, 201 : 5 - 14
  • [7] Data Analysis Support by Combining Data Mining and Text Mining
    Matsumoto, Tomoya
    Sunayama, Wataru
    Hatanaka, Yuji
    Ogohara, Kazunori
    [J]. 2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 313 - 318
  • [8] Integrating Text Mining and Genetic Algorithm for Subject Selection
    Phung, Y. C.
    Phon-Amnuaisuk, S.
    Komiya, R.
    [J]. INNOVATIONS AND ADVANCES IN COMPUTER SCIENCES AND ENGINEERING, 2010, : 37 - +
  • [9] Text mining and data information analysis for network public opinion
    Hu, Yan
    [J]. Data Science Journal, 2019, 18 (01)
  • [10] Identifying missing data handling methods with text mining
    Boros, Krisztian
    Kmetty, Zoltan
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,