From typical sequences to typical genotypes

被引:1
|
作者
Tal, Omri [1 ]
Tran, Tat Dat [1 ]
Portegies, Jacobus [1 ]
机构
[1] Max Planck Inst Math Sci, Inselstr 22, D-04103 Leipzig, Germany
关键词
Typical sequences; Typical genotypes; Population entropy rate; Population cross entropy rate; Classification; INFORMATION-THEORY; GENETIC-MARKERS;
D O I
10.1016/j.jtbi.2017.02.010
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We demonstrate an application of a core notion of information theory, typical sequences and their related properties, to analysis of population genetic data. Based on the asymptotic equipartition property (AEP) for nonstationary discrete-time sources producing independent symbols, we introduce the concepts of typical genotypes and population entropy and cross entropy rate. We analyze three perspectives on typical genotypes: a set perspective on the interplay of typical sets of genotypes from two populations, a geometric perspective on their structure in high dimensional space, and a statistical learning perspective on the prospects of constructing typical-set based classifiers. In particular, we show that such classifiers have a surprising resilience to noise originating from small population samples, and highlight the potential for further links between inference and communication.
引用
下载
收藏
页码:159 / 183
页数:25
相关论文
共 50 条
  • [1] From Typical Sequences to Typical Genotype
    Tal, O.
    HUMAN HEREDITY, 2015, 80 (03) : 122 - 122
  • [2] Empirical Processes and Typical Sequences
    Raginsky, Maxim
    2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1458 - 1462
  • [3] Typical sequences extraction and recognition
    Ma, GY
    Lin, XY
    COMPUTER VISION IN HUMAN-COMPUTER INTERACTION, PROCEEDINGS, 2004, 3058 : 60 - 71
  • [4] Mining Typical Order Sequences from EHR for Building Clinical Pathways
    Hirano, Shoji
    Tsumoto, Shusaku
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 39 - 49
  • [5] Memory for sequences of events impaired in typical aging
    Allen, Timothy A.
    Morris, Andrea M.
    Stark, Shauna M.
    Fortin, Norbert J.
    Stark, Craig E. L.
    LEARNING & MEMORY, 2015, 22 (03) : 138 - 148
  • [6] On a Markov Lemma and Typical Sequences for Polish Alphabets
    Mitran, Patrick
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (10) : 5342 - 5356
  • [7] Typical duration of good seeing sequences at Concordia
    Fossat, E.
    Aristidi, E.
    Agabi, A.
    Bondoux, E.
    Challita, Z.
    Jeanneaux, F.
    Mekarnia, D.
    ASTRONOMY & ASTROPHYSICS, 2010, 517
  • [8] Integrating Words That Refer to Typical Sequences of Events
    Khalkhali, Saman
    Wammes, Jeffrey
    McRae, Ken
    CANADIAN JOURNAL OF EXPERIMENTAL PSYCHOLOGY-REVUE CANADIENNE DE PSYCHOLOGIE EXPERIMENTALE, 2012, 66 (02): : 106 - 114
  • [9] Typical Peak Sidelobe Level of Binary Sequences
    Litsyn, Simon
    Shpunt, Alexander
    2008 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-6, 2008, : 1755 - +
  • [10] Typical Peak Sidelobe Level of Binary Sequences
    Alon, Noga
    Litsyn, Simon
    Shpunt, Alexander
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2010, 56 (01) : 545 - 554