Constrained query of order-preserving submatrix in gene expression data

被引:1
|
作者
Jiang, Tao [1 ]
Li, Zhanhuai [1 ]
Shang, Xuequn [1 ]
Chen, Bolin [1 ]
Li, Weibang [1 ]
Yin, Zhilei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci & Technol, Xian 710072, Peoples R China
关键词
gene expression data; OPSM; constrained query; brute-force search; feature sequence; cIndex;
D O I
10.1007/s11704-016-5487-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Order-preserving submatrix (OPSM) has become important in modelling biologically meaningful subspace cluster, capturing the general tendency of gene expressions across a subset of conditions. With the advance of microarray and analysis techniques, big volume of gene expression datasets and OPSM mining results are produced. OPSM query can efficiently retrieve relevant OPSMs from the huge amount of OPSM datasets. However, improving OPSM query relevancy remains a difficult task in real life exploratory data analysis processing. First, it is hard to capture subjective interestingness aspects, e.g., the analyst's expectation given her/his domain knowledge. Second, when these expectations can be declaratively specified, it is still challenging to use them during the computational process of OPSM queries. With the best of our knowledge, existing methods mainly focus on batch OPSM mining, while few works involve OPSM query. To solve the above problems, the paper proposes two constrained OPSM query methods, which exploit userdefined constraints to search relevant results from two kinds of indices introduced. In this paper, extensive experiments are conducted on real datasets, and experiment results demonstrate that the multi-dimension index (cIndex) and enumerating sequence index (esIndex) based queries have better performance than brute force search.
引用
收藏
页码:1052 / 1066
页数:15
相关论文
共 50 条
  • [1] Constrained query of order-preserving submatrix in gene expression data
    Tao JIANG
    Zhanhuai LI
    Xuequn SHANG
    Bolin CHEN
    Weibang LI
    Zhilei YIN
    Frontiers of Computer Science, 2016, 10 (06) : 1052 - 1066
  • [2] Constrained query of order-preserving submatrix in gene expression data
    Tao Jiang
    Zhanhuai Li
    Xuequn Shang
    Bolin Chen
    Weibang Li
    Zhilei Yin
    Frontiers of Computer Science, 2016, 10 : 1052 - 1066
  • [3] Indexing and Search of Order-Preserving Submatrix for Gene Expression Data
    Jiang, Tao
    Chen, Bolin
    Li, Juntao
    Xu, Guoyu
    IEEE ACCESS, 2019, 7 : 184769 - 184785
  • [4] Discovering local structure in gene expression data: The order-preserving submatrix problem
    Ben-Dor, A
    Chor, B
    Karp, R
    Yakhini, Z
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (3-4) : 373 - 384
  • [5] Towards Order-Preserving SubMatrix Search and Indexing
    Jiang, Tao
    Li, Zhanhuai
    Chen, Qun
    Li, Kaiwen
    Wang, Zhong
    Pan, Wei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, PT II, 2015, 9050 : 309 - 326
  • [6] Economic Regionalization Based On Order-Preserving Submatrix
    1ST INTERNATIONAL CONFERENCE ON DATA SCIENCE, ICDS 2014, 2014, 30 : 39 - 49
  • [7] An Algorithm for Discovering Deep Order Preserving Submatrix in Gene Expression Data
    Kuang, Qiuhua
    Zhang, Meizhen
    Ma, Zhihao
    Ma, Bo
    Liu, Zhiwen
    Xue, Yun
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1678 - 1683
  • [8] Order-preserving clustering and its application to gene expression data
    Syeda-Mahmood, T
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 637 - 640
  • [9] Mining Bucket Order-Preserving SubMatrices in Gene Expression Data
    Fang, Qiong
    Ng, Wilfred
    Feng, Jianlin
    Li, Yuliang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (12) : 2218 - 2231
  • [10] On the Deep Order-Preserving Submatrix Problem: A Best Effort Approach
    Gao, Byron J.
    Griffith, Obi L.
    Ester, Martin
    Xiong, Hui
    Zhao, Qiang
    Jones, Steven J. M.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (02) : 309 - 325