Content-based search of gene expression databases using binary fingerprints of differential expression profiles

被引:0
|
作者
Bell F. [1 ]
Sacan A. [1 ]
机构
[1] School of Biomedical Engineering, Drexel University, Philadelphia
关键词
Binary fingerprints; Content-based search; GEO; Microarray;
D O I
10.1007/s13721-015-0076-3
中图分类号
学科分类号
摘要
Availability and rapid growth of microarray databases have made an integrated analysis of these databases computationally challenging. We present a novel approach to content-based searching in microarray databases, using binary vector representations, that is inspired from the Chemoinformatics field. A benchmark compendium of microarray datasets is established for evaluation of content-based searching. Differential expression profiles from microarray experiments are represented either as floating point vectors or as concise binary vectors. The benchmark compendium is searched using several distance measures for determining similarity. We demonstrate that the use of binary vector representations achieves accuracies equivalent to or better than the use of floating point measures, while at the same time significantly reducing the time required to search a microarray database, owing to the fast bitwise operations and the reduction in memory requirements. Experiments on a large database of binary vector representations demonstrate that a modified Tanimoto distance measure is best suited for content-based search of differential microarray profiles. The search method is available as a web service at: http://sacan.biomed.drexel.edu/mageoindex/. © Springer-Verlag Wien 2015.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Author Profiles Prediction Using Syntactic and Content-Based Features
    Reddy, T. Raghunadha
    Srilatha, M.
    Sreenivas, M.
    Rajasekhar, N.
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 265 - 273
  • [32] A Composite Mode Differential Gene Regulatory Architecture based on Temporal Expression Profiles
    Majumder, Aurpan
    Sarkar, Mrityunjay
    Sharma, Prolay
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (06) : 1785 - 1793
  • [33] Optimal Aggregation of Binary Classifiers for Multiclass Cancer Diagnosis Using Gene Expression Profiles
    Yukinawa, Naoto
    Oba, Shigeyuki
    Kato, Kikuya
    Ishii, Shin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (02) : 333 - 343
  • [34] Modelling gene expression profiles related to prostate tumor progression using binary states
    Martinez, Emmanuel
    Trevino, Victor
    THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2013, 10
  • [35] 3D content-based search using sketches
    Konstantinos Moustakas
    Georgios Nikolakis
    Dimitrios Tzovaras
    Sebastien Carbini
    Olivier Bernier
    Jean Emmanuel Viallet
    Personal and Ubiquitous Computing, 2009, 13 : 59 - 67
  • [36] 3D content-based search using sketches
    Moustakas, K.
    Nikolakis, G.
    Tzovaras, D.
    Carbini, S.
    Bernier, O.
    Viallet, J. E.
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2006, 204 : 361 - +
  • [37] Content-based search of video using color, texture, and motion
    Deng, Y
    Manjunath, BS
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL II, 1997, : 534 - 537
  • [38] 3D content-based search using sketches
    Moustakas, Konstantinos
    Nikolakis, Georgios
    Tzovaras, Dimitrios
    Carbini, Sebastien
    Bernier, Olivier
    Viallet, Jean Emmanuel
    PERSONAL AND UBIQUITOUS COMPUTING, 2009, 13 (01) : 59 - 67
  • [39] Differential compression and optimal caching methods for content-based image search systems
    Zhong, D
    Chang, SF
    Smith, JR
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS IV, 1999, 3846 : 413 - 422
  • [40] Content-based search engines for construction image databases (vol 14, pg 537, 2005)
    Brilakis, I
    Soibelman, L
    AUTOMATION IN CONSTRUCTION, 2006, 15 (02) : 253 - 253