Log-ratio lasso: Scalable, sparse estimation for log-ratio models

被引:18
|
作者
Bates, Stephen [1 ]
Tibshirani, Robert [1 ,2 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Biomed Data Sci, Stanford, CA 94305 USA
关键词
compositional data; lasso; log-ratio; mass spectrometry; variable selection; POST-SELECTION INFERENCE; VARIABLE SELECTION; REGRESSION;
D O I
10.1111/biom.12995
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Positive-valued signal data is common in the biological and medical sciences, due to the prevalence of mass spectrometry other imaging techniques. With such data, only the relative intensities of the raw measurements are meaningful. It is desirable to consider models consisting of the log-ratios of all pairs of the raw features, since log-ratios are the simplest meaningful derived features. In this case, however, the dimensionality of the predictor space becomes large, and computationally efficient estimation procedures are required. In this work, we introduce an embedding of the log-ratio parameter space into a space of much lower dimension and use this representation to develop an efficient penalized fitting procedure. This procedure serves as the foundation for a two-step fitting procedure that combines a convex filtering step with a second non-convex pruning step to yield highly sparse solutions. On a cancer proteomics data set, the proposed method fits a highly sparse model consisting of features of known biological relevance while greatly improving upon the predictive accuracy of less interpretable methods.
引用
收藏
页码:613 / 624
页数:12
相关论文
共 50 条
  • [1] The log-ratio beam position monitor
    Kalinin, A
    [J]. BEAM INSTRUMENTATION WORKSHOP 2002, 2002, 648 : 384 - 392
  • [2] Log-ratio methods in mixture models for compositional data sets
    Comas-Cufi, M.
    Martin-Fernandez, J. A.
    Mateu-Figueras, G.
    [J]. SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2016, 40 (02) : 349 - 374
  • [3] Milne's Argument for the Log-Ratio Measure
    Huber, Franz
    [J]. PHILOSOPHY OF SCIENCE, 2008, 75 (04) : 413 - 420
  • [4] LOG-RATIO MODULE MEASURES HIGH RESISTANCES
    CROWLEY, B
    [J]. ELECTRONICS, 1976, 49 (18): : 109 - &
  • [5] Log-ratio compositional data analysis in archaeometry
    Baxter, M. J.
    Freestone, I. C.
    [J]. ARCHAEOMETRY, 2006, 48 : 511 - 531
  • [6] Numerical Stability Analysis of the Centered Log-Ratio Transformation
    Galletti, Ardelio
    Maratea, Antonio
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2016, : 713 - 716
  • [7] A LOG-RATIO PAIR APPROACH TO ENDOSCOPIC IMAGE MATCHING
    Karim, Rohana Abdul
    Mustafa, Mohd Marzuki
    Zulkifley, Mohd Asyraf
    [J]. 2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 185 - 188
  • [8] Log-Ratio Analysis Is a Limiting Case of Correspondence Analysis
    Greenacre, Michael
    [J]. MATHEMATICAL GEOSCIENCES, 2010, 42 (01) : 129 - 134
  • [9] Log-Ratio Analysis Is a Limiting Case of Correspondence Analysis
    Michael Greenacre
    [J]. Mathematical Geosciences, 2010, 42 : 129 - 134
  • [10] LOG-RATIO TECHNIQUE FOR BEAM POSITION MONITOR SYSTEMS
    AIELLO, GR
    MILLS, MR
    [J]. NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1994, 346 (03): : 426 - 432