An optimal DNA segmentation based on the MDL principle

被引:6
|
作者
Szpankowski, W [1 ]
Ren, WH [1 ]
Szpankowski, L [1 ]
机构
[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
关键词
D O I
10.1109/CSB.2003.1227402
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The biological world is highly stochastic as well as inhomogeneous in its behavior The transition between homogeneous and inhomogeneous regions of DNA, known also as change points, carry important biological information. Our goal is to employ rigorous methods of information theory to quantify structural properties of DNA sequences. In particular, we adopt the Stein-Ziv lemma to find asymptotically optimal discriminant function that determines whether two DNA segments are generated by the same source and assuring exponentially small false positives. Then we apply the Minimum Description Length (MDL) principle to select parameters of our segmentation algorithm. Finally, we perform extensive experimental work on human chromosome 9. After grouping A and G (purines) and Tand C (pyrimidines) we discover change points between coding and noncoding regions as well as the beginning of a CpG island.
引用
收藏
页码:541 / 546
页数:6
相关论文
共 50 条
  • [21] Covariance Matrix Estimation with Multi-Regularization Parameters based on MDL Principle
    Xiuling Zhou
    Ping Guo
    C. L. Philip Chen
    Neural Processing Letters, 2013, 38 : 227 - 238
  • [22] Covariance Matrix Estimation with Multi-Regularization Parameters based on MDL Principle
    Zhou, Xiuling
    Guo, Ping
    Chen, C. L. Philip
    NEURAL PROCESSING LETTERS, 2013, 38 (02) : 227 - 238
  • [23] New paradigm of learnable computer vision algorithms based on the representational MDL principle
    Potapov, Alexey S.
    Malyshev, Igor A.
    Puysha, Alexander E.
    Averkin, Anton N.
    AUTOMATIC TARGET RECOGNITION XX; ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXIV; AND OPTICAL PATTERN RECOGNITION XXI, 2010, 7696
  • [24] Botnet Detection Based on Non-negative Matrix Factorization and the MDL Principle
    Yamauchi, Sayaka
    Kawakita, Masanori
    Takeuchi, Jun'ichi
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT V, 2012, 7667 : 400 - 409
  • [25] Is My Neural Net Driven by the MDL Principle?
    Brandao, Eduardo
    Duffner, Stefan
    Emonet, Rémi
    Habrard, Amaury
    Jacquenet, François
    Sebban, Marc
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14170 LNAI : 173 - 189
  • [26] Alternating projection algorithm for detecting the number of coherent signals based on the MDL principle
    Suzuki, M
    Sanada, H
    Nagai, N
    2000 IEEE ASIA-PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS: ELECTRONIC COMMUNICATION SYSTEMS, 2000, : 699 - 702
  • [27] Is My Neural Net Driven by the MDL Principle?
    Brandao, Eduardo
    Duffner, Stefan
    Emonet, Remi
    Habrard, Amaury
    Jacquenet, Francois
    Sebban, Marc
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 173 - 189
  • [28] DETECTION OF THE NUMBER OF COHERENT SIGNALS BY THE MDL PRINCIPLE
    WAX, M
    ZISKIND, I
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (08): : 1190 - 1196
  • [29] NONPARAMETRIC MDL SEGMENTATION OF INHOMOGENEOUS IMAGES BASED ON QUADRATIC LOCAL BINARY FITTING
    Liu, Siwei
    Galland, Frederic
    Bertaux, Nicolas
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 6071 - 6075
  • [30] Modeling multisource remote sensing image classifier based on the MDL principle: Theoretical aspects
    Yin, Qian
    Guo, Ping
    Yuan, Zhi-Yong
    Wei, Zu-Kuan
    Zeng, Wen-Yi
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3497 - +