Interval-Valued Regression - Sensitivity to Data Set Features

被引:0
|
作者
Kabir, Shaily [1 ]
Wagner, Christian [1 ]
机构
[1] Univ Nottingham, Sch Comp Sci, Lab Uncertainty Data & Decis Making LUCID, Nottingham, England
基金
英国工程与自然科学研究理事会;
关键词
Intervals; regression; uncertainty; linear;
D O I
10.1109/FUZZ45933.2021.9494554
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regression represents one of the most basic building blocks of data analysis and AI. Despite growing interest in interval-valued data across various fields, approaches to establish regression models for interval-valued data which address and handle the specific properties of given data sets are very limited. For broader use and adoption of regression for intervals, this paper conducts a sensitivity analysis of key extant linear regression approaches in respect to important features of interval-valued data sets, such as the mean and associated standard deviation of the range (size) of the intervals within the data set-a measure of overall size and size-diversity, and the dispersion of interval-centers-a measure of diversity in terms of interval position. Experiments with carefully designed synthetic exemplar data sets with these properties suggest that distant placement of intervals as well as higher standard deviation (uncertainty) of ranges increase estimation errors; that is, they result in lower linear regression model fitness for all regression methods as may intuitively be expected. However, these errors are lower for the best suited Parameterized model in comparison to the MinMax and Constrained Center and Range methods. This paper sheds light on the behaviour and 'expectable' performance of key linear regression models designed for interval-valued data and adds a further building block to supporting the broader adoption of intervals (and subsequently fuzzy sets) as a fundamental data type in AI.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Regression analysis for interval-valued data
    Billard, L
    Diday, E
    [J]. DATA ANALYSIS, CLASSIFICATION, AND RELATED METHODS, 2000, : 369 - 374
  • [2] Linear regression with interval-valued data
    Sun, Yan
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2016, 8 (01): : 54 - 60
  • [3] Constrained Regression for Interval-Valued Data
    Gonzalez-Rivera, Gloria
    Lin, Wei
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2013, 31 (04) : 473 - 490
  • [4] Resistant Regression for Interval-Valued Data
    Renan, Jobson
    Silva, Jornandes Dias
    Galdino, Sergio
    [J]. 2013 1ST BRICS COUNTRIES CONGRESS ON COMPUTATIONAL INTELLIGENCE AND 11TH BRAZILIAN CONGRESS ON COMPUTATIONAL INTELLIGENCE (BRICS-CCI & CBIC), 2013, : 277 - 281
  • [5] Quantile Regression of Interval-Valued Data
    Fagundes, Roberta A. A.
    de Souza, Renata M. C. R.
    Soares, Yanne M. G.
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2586 - 2591
  • [6] Compositional Linear Regression on Interval-valued Data
    Pekaslan, Direnc
    Wagner, Christian
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [7] Nonlinear regression applied to interval-valued data
    Eufrásio de A. Lima Neto
    Francisco de A. T. de Carvalho
    [J]. Pattern Analysis and Applications, 2017, 20 : 809 - 824
  • [8] Nonlinear regression applied to interval-valued data
    Lima Neto, Eufrasio de A.
    de Carvalho, Francisco de A. T.
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (03) : 809 - 824
  • [9] Multiple Linear Regression Models on Interval-valued Dengue Data with Interval-valued Climatic Variables
    Attanayake, A. M. C. H.
    Perera, S. S. N.
    Liyanage, U. P.
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2020, 59 (03): : 49 - 60
  • [10] Linear regression analysis for interval-valued functional data
    Nasirzadeh, Roya
    Nasirzadeh, Fariba
    Mohammadi, Zohreh
    [J]. STAT, 2021, 10 (01):