A language modeling text mining approach to the annotation of protein community

被引:0
|
作者
Zhang, Xiaodan [1 ]
Wu, Daniel D. [1 ]
Zhou, Xiaohua [1 ]
Hu, Xiaohua [1 ]
机构
[1] Drexel Univ, Coll Informat Sci & Techno, 3141 Chestnut, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper discusses an ontology based language modeling text mining approach to the annotation of protein community. Communities appear to play an important role in the functional properties of complex networks. Being able to annotate the identified the community structure in a biological network can help us to understand better the structure and dynamics of biological systems. Traditional method such as Gene Ontology (GO) provides information about the functionality of gene products, but they are not enough to annotate community as for only limited number of proteins in the database, limited protein properties available for annotation and the inability to annotate a group of gene products as a whole. Thus, we present an ontology based mixture language model approach to annotate protein community. Compared to traditional method, we have the following three advantages. First, biomedical literature mining brings much richer information than existed gene databases. Second, the mixture language model can help "purify" the document by eliminating some background noise. Third, using domain ontology, we extract biological concept and concept pairs from abstracts. Biological concept is more meaningful than word or multi-word phrases. Moreover, using concept pairs can deliver much more information and serve as evidence of annotation results. We test our approach on four communities SAGA-SRB, CCR-NOT, RFC and ARP2/3, detected from dataset of interactions for Saccharomyces cerevisae from the General Repository for Interaction Datasets (GRID). Annotation results provide a very coherent indication of functionality of each community.
引用
收藏
页码:12 / +
页数:2
相关论文
共 50 条
  • [31] A Hybrid Knowledge Mining Approach to Develop a System Framework for Odia Language Text Processing
    Mishra, Brojo Kishore
    Sahoo, Rekhanjali
    MATERIALS TODAY-PROCEEDINGS, 2018, 5 (01) : 1335 - 1340
  • [32] Text Annotation Graphs: Annotating Complex Natural Language Phenomena
    Forbes, Angus G.
    Lee, Kristine
    Hahn-Powell, Gus
    Valenzuela-Escarcega, Marco A.
    Surdeanu, Mihai
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1047 - 1052
  • [33] Text mining and natural language processing in construction
    Shamshiri, Alireza
    Ryu, Kyeong Rok
    Park, June Young
    AUTOMATION IN CONSTRUCTION, 2024, 158
  • [34] Text Mining of Medical Documents in Spanish: Semantic Annotation and Detection of Recommendations
    Telleria, Carlos
    Ilarri, Sergio
    Sanchez, Carlos
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST), 2020, : 197 - 208
  • [35] Analyzing Empowerment Processes Among Cancer Patients in an Online Community: A Text Mining Approach
    Verberne, Suzan
    Batenburg, Anika
    Sanders, Remco
    van Eenbergen, Mies
    Das, Enny
    Lambooij, Mattijs S.
    JMIR CANCER, 2019, 5 (01):
  • [36] Text Semantic Annotation: A Distributed Methodology Based on Community Coherence
    Makris, Christos
    Pispirigos, Georgios
    Simos, Michael Angelos
    ALGORITHMS, 2020, 13 (07)
  • [37] Modeling Social Annotation: A Bayesian Approach
    Plangprasopchok, Anon
    Lerman, Kristina
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 5 (01)
  • [38] Understanding text mining: A pragmatic approach
    Bolasco, S
    Canzonetti, A
    Capo, FM
    della Ratta-Rinaldi, F
    Singh, BK
    KNOWLEDGE MINING, 2005, 185 : 31 - 50
  • [39] Call Center Text Mining Approach
    Yigit, Ibrahim Onuralp
    Ates, Ahmet Feyzi
    Guvercin, Mehmet
    Ferhatosmanoglu, Hakan
    Gedik, Bugra
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [40] A Comprehensive Study of Text Mining Approach
    Kaushik, Abhishek
    Naithani, Sudhanshu
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (02): : 69 - 76