P-value based visualization of codon usage data

被引:2
|
作者
Meinicke, Peter [1 ]
Brodag, Thomas [2 ]
Fricke, Wolfgang Florian [3 ]
Waack, Stephan [2 ]
机构
[1] Univ Gottingen, Abt Bioinform, Inst Mikrobiol & Genet, D-37077 Gottingen, Germany
[2] Univ Gottingen, Inst Numer & Angew Math, D-37083 Gottingen, Germany
[3] Univ Gottingen, Gottingen Genom Lab, D-37077 Gottingen, Germany
关键词
D O I
10.1186/1748-7188-1-10
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Significance level, p-value
    Sallat, Stephan
    SPRACHE-STIMME-GEHOR, 2024, 48 (01): : 13 - 13
  • [42] Commentary:: The P-value, devalued
    Goodman, S
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2003, 32 (05) : 699 - 702
  • [43] On P-value Combination Procedures
    Meng, Zhen
    Shi, Yu Ke
    Lin, Jin Yi
    Li, Qi Zhai
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2025, 41 (02) : 569 - 587
  • [44] p-Value: Villain or Scapegoat?
    Gachabayov, Mahir
    Fingerhut, Abraham
    SURGICAL TECHNOLOGY INTERNATIONAL-INTERNATIONAL DEVELOPMENTS IN SURGERY AND SURGICAL RESEARCH, 2019, 35
  • [45] On the Model-Based Bootstrap With Missing Data: Obtaining a P-Value for a Test of Exact Fit
    Savalei, Victoria
    Yuan, Ke-Hai
    MULTIVARIATE BEHAVIORAL RESEARCH, 2009, 44 (06) : 741 - 763
  • [46] DASS:: efficient discovery and p-value calculation of substructures in unordered data
    Hollunder, Jens
    Friedel, Maik
    Beyer, Andreas
    Workman, Christopher T.
    Wilhelm, Thomas
    BIOINFORMATICS, 2007, 23 (01) : 77 - 83
  • [47] On dependence assumption in p-value based multiple test procedures
    Gou, Jiangtao
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2023, 33 (05) : 596 - 610
  • [48] Efficient p-value evaluation for resampling-based tests
    Yu, Kai
    Liang, Faming
    Ciampa, Julia
    Chatterjee, Nilanjan
    BIOSTATISTICS, 2011, 12 (03) : 582 - 593
  • [49] Asymptotics for p-value based threshold estimation in regression settings
    Mallik, Atul
    Banerjee, Moulinath
    Sen, Bodhisattva
    ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 2477 - 2515
  • [50] Gumbel based p-value approximations for spatial scan statistics
    Allyson M Abrams
    Ken Kleinman
    Martin Kulldorff
    International Journal of Health Geographics, 9