A MULTIVARIATE REPRESENTATION AND ANALYSIS OF DNA-SEQUENCE DATA

被引:17
|
作者
JONSSON, J
ERIKSSON, L
HELLBERG, S
LINDGREN, F
SJOSTROM, M
WOLD, S
机构
[1] Research Group for Chemometrics, Department of Organic Chemistry, University of Umeå, Umeå
来源
ACTA CHEMICA SCANDINAVICA | 1991年 / 45卷 / 02期
关键词
D O I
10.3891/acta.chem.scand.45-0186
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A new way to represent and analyze DNA sequence data is described. This approach complements methods currently used, in that it allows the systematic part of the variation between different sequences to be modeled. This can prove as informative as absence of variation (homology), which is the most widely used criterion for comparing sequence data. A multivariate sequence-activity model (SAM), for DNA-promoter sequences is presented, by which the relative promoter strength is modeled in terms of the primary DNA-sequence. The model is shown to have a good predictive capability. The coefficients from the model are interpreted, and used to design new structures predicted to be strong promoters in the system investigated. The approach described is also applicable to other kinds of sequence data, e.g. RNAs, proteins or peptides.
引用
收藏
页码:186 / 192
页数:7
相关论文
共 50 条
  • [1] ANALYSIS OF DNA-SEQUENCE DATA - PHYLOGENETIC INFERENCE
    HILLIS, DM
    ALLARD, MW
    MIYAMOTO, MM
    [J]. MOLECULAR EVOLUTION: PRODUCING THE BIOCHEMICAL DATA, 1993, 224 : 456 - 487
  • [2] DNA-SEQUENCE ANALYSIS
    WU, R
    [J]. ANNUAL REVIEW OF BIOCHEMISTRY, 1978, 47 : 607 - 634
  • [3] PHYLOGENY OF METARHIZIUM - ANALYSIS OF RIBOSOMAL DNA-SEQUENCE DATA
    CURRAN, J
    DRIVER, F
    BALLARD, JWO
    MILNER, RJ
    [J]. MYCOLOGICAL RESEARCH, 1994, 98 : 547 - 552
  • [4] THE ANALYSIS OF POPULATION SURVEY DATA ON DNA-SEQUENCE VARIATION
    LYNCH, M
    CREASE, TJ
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1990, 7 (04) : 377 - 394
  • [5] SAMPLING PROPERTIES OF DNA-SEQUENCE DATA IN PHYLOGENETIC ANALYSIS
    CUMMINGS, MP
    OTTO, SP
    WAKELEY, J
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1995, 12 (05) : 814 - 822
  • [6] AUTOMATED DNA-SEQUENCE ANALYSIS
    SMITH, LM
    [J]. SCIENCE, 1987, 235 : G89 - G89
  • [7] DNA-SEQUENCE ANALYSIS AND THE COMPUTER
    JONES, MD
    [J]. BIOCHEMICAL SOCIETY TRANSACTIONS, 1984, 12 (06) : 1018 - 1020
  • [8] COMPUTATIONAL DNA-SEQUENCE ANALYSIS
    KARLIN, S
    CARDON, LR
    [J]. ANNUAL REVIEW OF MICROBIOLOGY, 1994, 48 : 619 - 654
  • [9] AUTOMATED DNA-SEQUENCE ANALYSIS
    CONNELL, C
    FUNG, S
    HEINER, C
    BRIDGHAM, J
    CHAKERIAN, V
    HERON, E
    JONES, B
    MENCHEN, S
    MORDAN, W
    RAFF, M
    RECKNOR, M
    SMITH, L
    SPRINGER, J
    WOO, S
    HUNKAPILLER, M
    [J]. BIOTECHNIQUES, 1987, 5 (04) : 342 - &
  • [10] RAPID DNA-SEQUENCE ANALYSIS
    AIR, GM
    [J]. CRC CRITICAL REVIEWS IN BIOCHEMISTRY, 1979, 6 (01): : 1 - 33