Discovery of deep order-preserving submatrix in DNA microarray data based on sequential pattern mining

被引:4
|
作者
Liu, Zhiwen [1 ]
Xue, Yun [2 ]
Li, Meihang [1 ]
Ma, Bo [1 ]
Zhang, Meizhen [1 ]
Chen, Xin [1 ]
Hu, Xiaohui [1 ]
机构
[1] South China Normal Univ, Sch Phys & Telecommun Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Phys & Telecommun Engn, Guangdong Prov Engn Technol Res Ctr Data Sci, Guangdong Prov Key Lab Quantum Engn & Quantum Mat, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
OPSM; frequent sequential pattern; all common subsequences; dynamic programming; GENE-EXPRESSION DATA; BICLUSTERING ALGORITHMS;
D O I
10.1504/IJDMB.2017.10006246
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, order-preserving submatrix (OPSM) model has been widely used in gene expression data analysis. Since it focuses on the changes between the elements rather than the real value, it shows better robustness and statistical significance among results than other models do. Currently, many OPSM algorithms are heuristic. They cannot mine all OPSMs as well as the deep OPSMs which are of biological significance in gene expression data. In this paper, an exact algorithm is proposed to find OPSMs by using frequent sequential pattern mining method. Firstly, we find out all common subsequences (ACS) between any two rows through dynamic programming. Then, we store them into a suffix tree. After that, we can get all OPSMs in this suffix tree, including deep OPSMs. Verified by the real gene data and artificially synthesised data, it is proved that our algorithm is efficient and meaningful.
引用
收藏
页码:217 / 237
页数:21
相关论文
共 50 条
  • [1] A new approach for the deep order preserving submatrix problem based on sequential pattern mining
    Xue, Yun
    Li, Tiechen
    Liu, Zhiwen
    Pang, Chaoyi
    Li, Meihang
    Liao, Zhengling
    Hu, Xiaohui
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (02) : 263 - 279
  • [2] A new approach for the deep order preserving submatrix problem based on sequential pattern mining
    Yun Xue
    Tiechen Li
    Zhiwen Liu
    Chaoyi Pang
    Meihang Li
    Zhengling Liao
    Xiaohui Hu
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 263 - 279
  • [3] Mining Order-Preserving Submatrices Based on Frequent Sequential Pattern Mining
    Xue, Yun
    Li, Yuting
    Deng, Weijun
    Li, Jiejin
    Tang, Jianxiong
    Liao, Zhengling
    Li, Tiechen
    HEALTH INFORMATION SCIENCE, HIS 2014, 2014, 8423 : 184 - 193
  • [4] Economic Regionalization Based On Order-Preserving Submatrix
    1ST INTERNATIONAL CONFERENCE ON DATA SCIENCE, ICDS 2014, 2014, 30 : 39 - 49
  • [5] OMEGA: An Order-Preserving SubMatrix Mining, Indexing and Search Tool
    Jiang, Tao
    Li, Zhanhuai
    Chen, Qun
    Wang, Zhong
    Li, Kaiwen
    Pan, Wei
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2015, 9286 : 303 - 307
  • [6] On the Deep Order-Preserving Submatrix Problem: A Best Effort Approach
    Gao, Byron J.
    Griffith, Obi L.
    Ester, Martin
    Xiong, Hui
    Zhao, Qiang
    Jones, Steven J. M.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (02) : 309 - 325
  • [7] Constrained query of order-preserving submatrix in gene expression data
    Tao JIANG
    Zhanhuai LI
    Xuequn SHANG
    Bolin CHEN
    Weibang LI
    Zhilei YIN
    Frontiers of Computer Science, 2016, 10 (06) : 1052 - 1066
  • [8] Indexing and Search of Order-Preserving Submatrix for Gene Expression Data
    Jiang, Tao
    Chen, Bolin
    Li, Juntao
    Xu, Guoyu
    IEEE ACCESS, 2019, 7 : 184769 - 184785
  • [9] Constrained query of order-preserving submatrix in gene expression data
    Jiang, Tao
    Li, Zhanhuai
    Shang, Xuequn
    Chen, Bolin
    Li, Weibang
    Yin, Zhilei
    FRONTIERS OF COMPUTER SCIENCE, 2016, 10 (06) : 1052 - 1066
  • [10] Constrained query of order-preserving submatrix in gene expression data
    Tao Jiang
    Zhanhuai Li
    Xuequn Shang
    Bolin Chen
    Weibang Li
    Zhilei Yin
    Frontiers of Computer Science, 2016, 10 : 1052 - 1066