Classification of Protein-Protein Interaction Full-Text Documents Using Text and Citation Network Features

被引:21
|
作者
Kolchinsky, Artemy [1 ,2 ]
Abi-Haidar, Alaa [1 ,2 ]
Kaur, Jasleen [1 ]
Hamed, Ahmed Abdeen [1 ]
Rocha, Luis M. [1 ,2 ]
机构
[1] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47408 USA
[2] FLAD Computat Biol Collaboratorium, Inst Gulbenkian Ciencia, P-2780156 Oeiras, Portugal
关键词
Text mining; literature mining; binary classification; protein-protein interaction; citation network; INFORMATION; GENES;
D O I
10.1109/TCBB.2010.55
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We participated ( as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of full-text documents relevant for protein-protein interaction. We used two distinct classifiers for the online and offline challenges: 1) the lightweight Variable Trigonometric Threshold (VTT) linear classifier we successfully introduced in BioCreative 2 for binary classification of abstracts and 2) a novel Naive Bayes classifier using features from the citation network of the relevant literature. We supplemented the supplied training data with full-text documents from the MIPS database. The lightweight VTT classifier was very competitive in this new full-text scenario: it was a top-performing submission in this task, taking into account the rank product of the Area Under the interpolated precision and recall Curve, Accuracy, Balanced F-Score, and Matthew's Correlation Coefficient performance measures. The novel citation network classifier for the biomedical text mining domain, while not a top performing classifier in the challenge, performed above the central tendency of all submissions, and therefore indicates a promising new avenue to investigate further in bibliome informatics.
引用
收藏
页码:400 / 411
页数:12
相关论文
共 50 条
  • [31] Protein features fusion using attributed network embedding for predicting protein-protein interaction
    Cao, Mei-Yuan
    Zainudin, Suhaila
    Daud, Kauthar Mohd
    BMC GENOMICS, 2024, 25 (01):
  • [32] Full-text articles: faculty perceptions, student use, and citation abuse
    Imler, Bonnie
    Hall, Russell A.
    REFERENCE SERVICES REVIEW, 2009, 37 (01) : 65 - +
  • [33] Full-text citation analysis: A new method to enhance scholarly networks
    Liu, Xiaozhong
    Zhang, Jinsong
    Guo, Chun
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2013, 64 (09): : 1852 - 1863
  • [34] Text-Mining Protein-Protein Interaction Corpus Using Concept Clustering to Identify Intermittency
    Peterson, Leif E.
    Coleman, Matthew A.
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3634 - +
  • [35] Full-text Search Using Database Index
    Chaitanya, B. Sri Sai Krishna
    Reddy, D. Ajay Kumar
    Chandra, B. Pavan Sai Eshwar
    Krishna, A. Bala
    Menon, Remya R. K.
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [36] Protein-protein interaction network constructing based on text mining and reinforcement learning with application to prostate cancer
    Zhu, Fei
    Liu, Quan
    Zhang, Xiaofang
    Shen, Bairong
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [37] Extracting Protein-Protein Interaction from Biomedical Text Using Additional Shallow Parsing Information
    Yu, Huanhuan
    Qian, Longhua
    Zhou, Guodong
    Zhu, Qiaoming
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 1601 - 1605
  • [38] Full-Text Search Engine using MySQL
    Gyorodi, C.
    Gyorodi, R.
    Pecherle, G.
    Cornea, G. M.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2010, 5 (05) : 735 - 743
  • [39] Research on full-text indexing technology for documents based on COM components
    Wu Wanzhi
    Wu Shunxiang
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 578 - 581
  • [40] Full-text and structural indexing of XML documents on B+-tree
    Shimizu, T
    Yoshikawa, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (01): : 237 - 247