Distributed optimization for penalized regression in massive compositional data

被引:0
|
作者
Chao, Yue [1 ,2 ]
Huang, Lei [3 ]
Ma, Xuejun [1 ]
机构
[1] Soochow Univ, Sch Math Sci, Dept Stat, Suzhou, Peoples R China
[2] Xiamen Univ, MOE Key Lab Econometr, WISE, Xiamen, Peoples R China
[3] Southwest Jiaotong Univ, Sch Math, Dept Stat, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Massive compositional data; Distributed optimization; Augmented Lagrangian; Coordinate-wise descent; Variable selection; Medical insurance; QUANTILE REGRESSION; VARIABLE SELECTION; LIKELIHOOD; ALGORITHMS;
D O I
10.1016/j.apm.2025.115950
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Compositional data have been widely used in various fields to analyze parts of a whole, providing insights into proportional relationships. With the increasing availability of extraordinarily large compositional datasets, addressing the challenges of distributed statistical methodologies and computations has become essential in the era of big data. This paper focuses on the optimization methodology and practical application of the distributed sparse penalized linear log- contrast model for massive compositional data, specifically in the context of medical insurance reimbursement ratio prediction. We propose two distributed optimization techniques tailored for centralized and decentralized topologies to effectively tackle the constrained convex optimization problems that arise in this application. Our algorithms are rooted in the frameworks of the alternating direction method of multipliers and the coordinate descent method of multipliers, making them available for distributed data scenarios. Notably, in the decentralized topology, we introduce a distributed coordinate-wise descent algorithm that employs a group alternating direction method of multipliers to achieve efficient distributed regularized estimation. We rigorously present convergence analysis for our decentralized algorithm, ensuring its reliability for practical applications. Through numerical experiments on both simulated datasets and a real- world medical insurance dataset, we evaluate the performance of our proposed algorithms.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] LASSO type penalized spline regression for binary data
    Muhammad Abu Shadeque Mullah
    James A. Hanley
    Andrea Benedetti
    BMC Medical Research Methodology, 21
  • [42] Robust penalized quantile regression estimation for panel data
    Lamarche, Carlos
    JOURNAL OF ECONOMETRICS, 2010, 157 (02) : 396 - 408
  • [43] Recent Advances on Penalized Regression Models for Biological Data
    Wang, Pei
    Chen, Shunjie
    Yang, Sijia
    MATHEMATICS, 2022, 10 (19)
  • [44] KERNEL-PENALIZED REGRESSION FOR ANALYSIS OF MICROBIOME DATA
    Randolph, Timothy W.
    Zhao, Sen
    Copeland, Wade
    Hullar, Meredith
    Shojaie, Ali
    ANNALS OF APPLIED STATISTICS, 2018, 12 (01): : 540 - 566
  • [45] Distributed penalizing function criterion for local polynomial estimation in nonparametric regression with massive data
    Sun, Tianqi
    Li, Weiyu
    Lin, Lu
    STATISTICAL PAPERS, 2025, 66 (03)
  • [46] Classical and Robust Regression Analysis with Compositional Data
    K. G. van den Boogaart
    P. Filzmoser
    K. Hron
    M. Templ
    R. Tolosana-Delgado
    Mathematical Geosciences, 2021, 53 : 823 - 858
  • [47] Regression Models for Compositional Data: General Log-Contrast Formulations, Proximal Optimization, and Microbiome Data Applications
    Combettes, Patrick L.
    Mueller, Christian L.
    STATISTICS IN BIOSCIENCES, 2021, 13 (02) : 217 - 242
  • [48] Regression Models for Compositional Data: General Log-Contrast Formulations, Proximal Optimization, and Microbiome Data Applications
    Patrick L. Combettes
    Christian L. Müller
    Statistics in Biosciences, 2021, 13 : 217 - 242
  • [49] Classical and Robust Regression Analysis with Compositional Data
    van den Boogaart, K. G.
    Filzmoser, P.
    Hron, K.
    Templ, M.
    Tolosana-Delgado, R.
    MATHEMATICAL GEOSCIENCES, 2021, 53 (05) : 823 - 858
  • [50] A Dirichlet Regression Model for Compositional Data with Zeros
    Tsagris M.
    Stewart C.
    Lobachevskii Journal of Mathematics, 2018, 39 (3) : 398 - 412