Aggregation using input-output trade-off

被引:4
|
作者
Fischer, Aurelie [1 ]
Mougeot, Mathilde [1 ]
机构
[1] Univ Paris Diderot, Lab Probabilites Stat & Modelisat, F-75013 Paris, France
关键词
Classification; Regression estimation; Aggregation; Nonlinearity; Consistency; REGRESSION;
D O I
10.1016/j.jspi.2018.08.001
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we introduce a new learning strategy based on a seminal idea of Mojir-sheibani (1999, 2000, 2002a, 2002b), who proposed a smart method for combining several classifiers, relying on a consensus notion. In many aggregation methods, the prediction for a new observation x is computed by building a linear or convex combination over a collection of basic estimators r(1) (x), ... , r(m),(x) previously calibrated using a training data set. Mojirsheibani proposes to compute the prediction associated to a new observation by combining selected outputs of the training examples. The output of a training example is selected if some kind of consensus is observed: the predictions computed for the training example with the different machines have to be "similar" to the prediction far the new observation. This approach has been recently extended to the context of regression in Biau et al. (2016). In the original scheme, the agreement condition is actually required to hold for all individual estimators, which appears inadequate if there is one bad initial estimator. In practice, a few disagreements are allowed; for establishing the theoretical results, the proportion of estimators satisfying the condition is required to tend to 1. In this paper, we propose an alternative procedure, mixing the previous consensus ideas on the predictions with the Euclidean distance computed between entries. This may be seen as an alternative approach allowing to reduce the effect of a possibly bad estimator in the initial list, using a constraint on the inputs. We prove the consistency of our strategy in classification and in regression. We also provide some numerical experiments on simulated and real data to illustrate the benefits of this new aggregation method. On the whole, our practical study shows that our method may perform much better than the original combination technique, and, in particular, exhibit far less variance. We also show on simulated examples that this procedure mixing inputs and outputs is still robust to high dimensional inputs. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [1] A panel data approach to the input-output trade-off: A review of the international evidence
    Zavarce, H
    Pagliacci, C
    Espinoza, MC
    [J]. AMERICAN JOURNAL OF AGRICULTURAL ECONOMICS, 1998, 80 (05) : 1177 - 1177
  • [2] ON THE AGGREGATION PROBLEM IN INPUT-OUTPUT MODELS
    KIMURA, Y
    [J]. PAPERS OF THE REGIONAL SCIENCE ASSOCIATION, 1985, 56 : 167 - 176
  • [3] AGGREGATION PROBLEMS IN INPUT-OUTPUT ANALYSIS
    MORIMOTO, Y
    [J]. REVIEW OF ECONOMIC STUDIES, 1970, 37 (109): : 119 - 126
  • [4] LINEAR AGGREGATION OF INPUT-OUTPUT MODELS
    HOWE, EC
    JOHNSON, CR
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1989, 10 (01) : 65 - 79
  • [5] AGGREGATION AND DISAGGREGATION IN INPUT-OUTPUT MODELS
    SEKERKA, B
    [J]. EKONOMICKO-MATEMATICKY OBZOR, 1978, 14 (04): : 411 - 426
  • [6] INDUSTRY AGGREGATION IN INPUT-OUTPUT MODELS
    KYMN, KO
    NORSWORTHY, JR
    [J]. RIVISTA INTERNAZIONALE DI SCIENZE ECONOMICHE E COMMERCIALI, 1975, 22 (10): : 963 - 971
  • [7] THE EFFECT OF AGGREGATION ON THE OUTPUT MULTIPLIERS IN INPUT-OUTPUT MODELS
    KATZ, JL
    BURFORD, RL
    [J]. ANNALS OF REGIONAL SCIENCE, 1981, 15 (03): : 46 - 54
  • [8] The Aggregation-Learning Trade-off
    Piezunka, Henning
    Aggarwal, Vikas A.
    Posen, Hart E.
    [J]. ORGANIZATION SCIENCE, 2022, 33 (03) : 1094 - 1115
  • [9] ANALYTIC PROBLEMS IN THE AGGREGATION OF INPUT-OUTPUT MODELS
    HOWE, EC
    JOHNSON, CR
    [J]. APPLICATIONS OF MATRIX THEORY, 1989, 22 : 29 - 61
  • [10] INFORMATION APPROACH TO AGGREGATION OF INPUT-OUTPUT TABLES
    THEIL, H
    URIBE, P
    [J]. REVIEW OF ECONOMICS AND STATISTICS, 1967, 49 (04) : 451 - 462