The Text-mining based PubChem Bioassay neighboring analysis

被引:21
|
作者
Han, Lianyi [1 ]
Suzek, Tugba O. [1 ]
Wang, Yanli [1 ]
Bryant, Steve H. [1 ]
机构
[1] US Natl Lib Med, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
PROTEIN-PROTEIN INTERACTIONS; BIOMEDICAL LITERATURE; GENE-EXPRESSION; INFORMATION; NETWORK; NAMES;
D O I
10.1186/1471-2105-11-549
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In recent years, the number of High Throughput Screening (HTS) assays deposited in PubChem has grown quickly. As a result, the volume of both the structured information (i.e. molecular structure, bioactivities) and the unstructured information (such as descriptions of bioassay experiments), has been increasing exponentially. As a result, it has become even more demanding and challenging to efficiently assemble the bioactivity data by mining the huge amount of information to identify and interpret the relationships among the diversified bioassay experiments. In this work, we propose a text-mining based approach for bioassay neighboring analysis from the unstructured text descriptions contained in the PubChem BioAssay database. Results: The neighboring analysis is achieved by evaluating the cosine scores of each bioassay pair and fraction of overlaps among the human-curated neighbors. Our results from the cosine score distribution analysis and assay neighbor clustering analysis on all PubChem bioassays suggest that strong correlations among the bioassays can be identified from their conceptual relevance. A comparison with other existing assay neighboring methods suggests that the text-mining based bioassay neighboring approach provides meaningful linkages among the PubChem bioassays, and complements the existing methods by identifying additional relationships among the bioassay entries. Conclusions: The text-mining based bioassay neighboring analysis is efficient for correlating bioassays and studying different aspects of a biological process, which are otherwise difficult to achieve by existing neighboring procedures due to the lack of specific annotations and structured information. It is suggested that the text-mining based bioassay neighboring analysis can be used as a standalone or as a complementary tool for the PubChem bioassay neighboring process to enable efficient integration of assay results and generate hypotheses for the discovery of bioactivities of the tested reagents.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A Text-Mining Approach to Explain Unwanted Behaviours
    Chen, Wei
    Aspinall, David
    Gordon, Andrew D.
    Sutton, Charles
    Muttik, Igor
    [J]. PROCEEDINGS OF THE 9TH EUROPEAN WORKSHOP ON SYSTEM SECURITY, (EUROSEC 2016), 2016, : 19 - 24
  • [32] @Minter: automated text-mining of microbial interactions
    Lim, Kun Ming Kenneth
    Li, Chenhao
    Chng, Kern Rei
    Nagarajan, Niranjan
    [J]. BIOINFORMATICS, 2016, 32 (19) : 2981 - 2987
  • [33] Text-mining Approach for Estimating Vulnerability Score
    Miyamoto, Daisuke
    Yamamoto, Yasuhiro
    Nakayama, Masaya
    [J]. 2015 4TH INTERNATIONAL WORKSHOP ON BUILDING ANALYSIS DATASETS AND GATHERING EXPERIENCE RETURNS FOR SECURITY (BADGERS), 2015, : 67 - 73
  • [34] An assessment of blockchain academia and news developments: a bibliometric and text-mining analysis
    Shen, Chien-Wen
    Tran, Phung Phi
    [J]. LIBRARY HI TECH, 2023,
  • [35] Analysis of patterns in meteorological research and development using a text-mining algorithm
    Park, Hongju
    Kim, Habin
    Park, Taeyoung
    Lee, Yung-Seop
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (05) : 935 - 947
  • [36] A Text-Mining Analysis of Research Trends in Animal-Assisted Therapy
    Lee, Shin-Ja
    Kim, Geun-Hyeon
    Moon, Yea-Hwang
    Lee, Sung-Sill
    [J]. ANIMALS, 2023, 13 (19):
  • [37] The research on gene-disease association based on text-mining of PubMed
    Zhou, Jie
    Fu, Bo-quan
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [38] Text-Mining Based Risk Source Identification Model for Transportation Safety
    Luo, Wenhui
    Cai, Fengtian
    Wu, Chuna
    Xia, Hongwen
    Meng, Xingkai
    [J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2021, 56 (01): : 147 - 152
  • [39] Methodical Approaches to Forecasting Dynamics of the Stock Market based on "Text-Mining"
    Malyshenko, Kostyantyn Anatolievich
    Malyshenko, Vadim Anatolievich
    Anashkina, Marina Viktorovna
    [J]. VISION 2020: SUSTAINABLE ECONOMIC DEVELOPMENT AND APPLICATION OF INNOVATION MANAGEMENT, 2018, : 1030 - 1044
  • [40] Current challenges in text-mining for chemical information
    Sayle, Roger
    Mayfield, John
    O'Boyle, Noel
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258