An optimal DNA segmentation based on the MDL principle

被引:6
|
作者
Szpankowski, W [1 ]
Ren, WH [1 ]
Szpankowski, L [1 ]
机构
[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
关键词
D O I
10.1109/CSB.2003.1227402
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The biological world is highly stochastic as well as inhomogeneous in its behavior The transition between homogeneous and inhomogeneous regions of DNA, known also as change points, carry important biological information. Our goal is to employ rigorous methods of information theory to quantify structural properties of DNA sequences. In particular, we adopt the Stein-Ziv lemma to find asymptotically optimal discriminant function that determines whether two DNA segments are generated by the same source and assuring exponentially small false positives. Then we apply the Minimum Description Length (MDL) principle to select parameters of our segmentation algorithm. Finally, we perform extensive experimental work on human chromosome 9. After grouping A and G (purines) and Tand C (pyrimidines) we discover change points between coding and noncoding regions as well as the beginning of a CpG island.
引用
收藏
页码:541 / 546
页数:6
相关论文
共 50 条
  • [1] Joint motion estimation and segmentation based on the MDL principle
    Shanghai Jiaotong Univ, Shanghai, China
    Int Conf Signal Process Proc, (963-967):
  • [2] Joint motion estimation and segmentation based on the MDL principle
    Shi, JL
    Pan, J
    Yu, SY
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 963 - 967
  • [3] Unsupervised multiscale color image segmentation based on MDL principle
    Luo, Qiming
    Khoshgoftaar, Taghi M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (09) : 2755 - 2761
  • [4] MOTION-BASED OBJECT SEGMENTATION AND ESTIMATION USING THE MDL PRINCIPLE
    ZHENG, HY
    BLOSTEIN, SD
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (09) : 1223 - 1235
  • [5] MDL regularizer: A new regularizer based on the MDL principle
    Saito, K
    Nakano, R
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1833 - 1838
  • [6] Unsupervised statistical adaptive segmentation of brain MR images using the MDL principle
    Kim, TW
    Paik, CH
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 20, PTS 1-6: BIOMEDICAL ENGINEERING TOWARDS THE YEAR 2000 AND BEYOND, 1998, 20 : 617 - 620
  • [7] SPARSE CODING AND DICTIONARY LEARNING BASED ON THE MDL PRINCIPLE
    Ramirez, Ignacio
    Sapiro, Guillermo
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2160 - 2163
  • [8] Automatic discovery of definition patterns based on the MDL principle
    Tsuchiya, M
    Kurohashi, S
    DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 364 - 365
  • [9] Word sense learning based on feature selection and MDL principle
    Donghong Ji
    Yanxiang He
    Guozheng Xiao
    Language Resources and Evaluation, 2006, 40 : 375 - 393
  • [10] Learning Bayesian belief networks: An approach based on the MDL principle
    Lam, Wai
    Bacchus, Fahiem
    Computational Intelligence, 1994, 10 (03) : 269 - 293