Regression applied to symbolic interval-spatial data

被引:0
|
作者
Freitas, Wanessa W. L. [1 ]
de Souza, Renata M. C. R. [1 ]
Amaral, Getulio J. A. [2 ]
de Moraes, Ronei M. [3 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, Ave Jornalista Anibal Fernandes S-N,Cidade Univ, BR-50740560 Recife, PE, Brazil
[2] Univ Fed Pernambuco, Dept Estat, Ave Jornalista Anibal Fernandes S-N,Cidade Univ, BR-50740560 Recife, PE, Brazil
[3] Univ Fed Paraiba, Dept Estat, BR-58050585 Joao Pessoa, PB, Brazil
关键词
Symbolic data analysis; Interval data; Spatial analysis; Regression; MODEL;
D O I
10.1007/s10489-023-05051-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Symbolic data analysis is a research area related to machine learning and statistics, which provides tools to describe geo-objects, and enables several types of variables to be dealt with, including interval type variables. Moreover, despite the recent progress in understanding symbolic data, there are no studies in the literature that address this type of data in the context of spatial data analysis. Thus, in this paper, we propose two different approaches of the spatial regression model for symbolic interval-valued data. The first fits a linear regression model on the minimum and maximum values of the interval values and the second fits a linear regression model on the center and range values of the interval. In order to evaluate the performance of these approaches, we have performed Monte Carlo simulations in which we calculated the mean value of the performance metric of the models analyzed. Furthermore, we also analyzed two applications involving real data. In the first, we examined the performance of the models in the Brazilian State of Pernambuco. In the second application, we analyzed the performance of the models for the Brazilian Northeastern region. Both applications were related to socioeconomic variables. We observed that in areas with less spatial variability, the interval spatial regression model performs better when compared with a usual method. When considering areas with a higher spatial variability, both ways presented similar results.
引用
收藏
页码:1545 / 1565
页数:21
相关论文
共 50 条
  • [41] Regression analysis of spatial data
    Beale, Colin M.
    Lennon, Jack J.
    Yearsley, Jon M.
    Brewer, Mark J.
    Elston, David A.
    ECOLOGY LETTERS, 2010, 13 (02) : 246 - 264
  • [42] Symbolic Regression Driven by Training Data and Prior Knowledge
    Kubalik, Jiri
    Derner, Erik
    Babuska, Robert
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 958 - 966
  • [43] Searching for Nonlinear Relationships in fMRI Data with Symbolic Regression
    Hughes, James Alexander
    Daley, Mark
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 1129 - 1136
  • [44] Data-driven discovery of formulas by symbolic regression
    Sun, Sheng
    Ouyang, Runhai
    Zhang, Bochao
    Zhang, Tong-Yi
    MRS BULLETIN, 2019, 44 (07) : 559 - 564
  • [45] Inference of faults through symbolic regression of system data
    Hale, William T.
    Safikou, Efi
    Bollas, George M.
    COMPUTERS & CHEMICAL ENGINEERING, 2022, 157
  • [46] Data-driven discovery of formulas by symbolic regression
    Sheng Sun
    Runhai Ouyang
    Bochao Zhang
    Tong-Yi Zhang
    MRS Bulletin, 2019, 44 : 559 - 564
  • [47] REGRESSION-ANALYSIS OF INEQUALITY MEASURES APPLIED TO INTERVAL SCALES
    ALLISON, PD
    QUALITY & QUANTITY, 1982, 16 (03) : 251 - 260
  • [48] Linear regression with interval-valued data
    Sun, Yan
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2016, 8 (01): : 54 - 60
  • [49] Nonparametric regression with interval-censored data
    Wen Li Deng
    Zu Kang Zheng
    Ri Quan Zhang
    Acta Mathematica Sinica, English Series, 2014, 30 : 1422 - 1434
  • [50] Random forests regression for soft interval data
    Gaona-Partida, Paul
    Yeh, Chih-Ching
    Sun, Yan
    Cutler, Adele
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,