Experiments in text-based mining and analysis of biological information from MEDLINE on functionally-related genes

被引:1
|
作者
Moon, N [1 ]
Singh, R [1 ]
机构
[1] San Francisco State Univ, Dept Comp Sci, San Francisco, CA 94132 USA
关键词
EXPRESSION;
D O I
10.1109/ICSENG.2005.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Technological advancements such as microarrays have enabled biologists to generate unprecedented quantities of data about biological entities. This has lead to the development of a large number of algorithms for processing and analysis of biological data. Challenges however remain; for instance, genes that function cooperatively need not have similar expression patterns. This suggests the use of non-numerical sources of information to explore the underlying biology. We experimentally study various factors that are inherent in algorithmic methodologies for text analysis. The proposed method accesses MEDLINE dynamically to account for the latest research, with the available literature corresponding to the genes analyzed to develop lists of keywords. Natural language processing (NLP) techniques such as stop-word filtering and stemming are then applied to the lists, and keyword frequencies weighted using the term frequency-inverse document frequency (TFIDF) scheme. The results are input to a hierarchical clustering algorithm to derive groupings of genes by functionality. The process is repealed using z-score weighting and latent semantic analysis (LSA) to determine which yields the most accurate clustering. The study presented examines the importance of these steps and their influence on the overall efficacy of the system. We believe that the analysis conducted as part of this research will be invaluable to development and fine-tuning of text mining methodologies for biological literature.
引用
收藏
页码:326 / 331
页数:6
相关论文
共 50 条
  • [1] System Analysis of LWDH Related Genes Based on Text Mining in Biological Networks
    Liao, Mingzhi
    Miao, Yingbo
    Zhang, Liangcai
    Wang, Yang
    Feng, Rennan
    Yang, Lei
    Zhang, Shihua
    Jiang, Yongshuai
    Liu, Guiyou
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [2] Text-based analysis of genes, proteins, aging, and cancer
    Semeiks, JR
    Grate, LR
    Mian, IS
    MECHANISMS OF AGEING AND DEVELOPMENT, 2005, 126 (01) : 193 - 208
  • [3] Biological relation extraction and query answering from MEDLINE abstracts using ontology-based text mining
    Abulaish, Muhammad
    Dey, Lipika
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (02) : 228 - 262
  • [4] A text-based mining approach for real estate policy impact monitoring and analysis
    Cao, Lei
    Xu, Peng
    Shang, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1575 - 1581
  • [5] Analysis of Trending Topics and Text-based Channels of Information Delivery in Cybersecurity
    Wu, Tingmin
    Ma, Wanlun
    Wen, Sheng
    Xia, Xin
    Paris, Cecile
    Nepal, Surya
    Xiang, Yang
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2022, 22 (02)
  • [6] Clustering analysis of vulnerability information based on text mining
    School of Information Science and Technology, Northwest University, Xi'an
    710069, China
    Dongnan Daxue Xuebao, 5 (845-850):
  • [7] Voice-based Information Retrieval - how far are we from the text-based information retrieval ?
    Lee, Lin-shan
    Pan, Yi-cheng
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 26 - 43
  • [8] Using field and quasi experiments and text-based analysis to advance international business theory
    Ramani, Ravi S.
    Aguinis, Herman
    JOURNAL OF WORLD BUSINESS, 2023, 58 (05)
  • [9] A Signal-Representation-Based Parser to Extract Text-Based Information from the Web
    Su, Mu-Chun
    Wang, Shao-Jui
    Huang, Chen-Ko
    Wang, Pa-Chun
    Hsu, Fu-Hau
    Lin, Shih-Chieh
    Hsieh, Yi-Zeng
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2010, 14 (05) : 531 - 539
  • [10] Sugar consumption from beverages and the potential effects of a text-based information label
    Gray, Jodi P.
    Karnon, Jonathan
    Blackwell, Leslee
    AUSTRALIAN AND NEW ZEALAND JOURNAL OF PUBLIC HEALTH, 2011, 35 (01) : 88 - U94