Data Selection Using Support Vector Regression

被引:0
|
作者
Michael B.RICHMAN [1 ]
Lance M.LESLIE [1 ]
Theodore B.TRAFALIS [2 ]
Hicham MANSOURI [3 ]
机构
[1] School of Meteorology and Cooperative Institute for Mesoscale Meteorological Studies,University of Oklahoma,Norman, Oklahoma, 73072, USA
[2] School of Industrial and Systems Engineering, University of Oklahoma,Norman, Oklahoma, 73019, USA
关键词
data selection; data thinning; machine learning; support vector regression; Voronoi tessellation; pipeline methods;
D O I
暂无
中图分类号
P413 [数据处理];
学科分类号
0706 ; 070601 ;
摘要
Geophysical data sets are growing at an ever-increasing rate, requiring computationally efficient data selection(thinning)methods to preserve essential information. Satellites, such as Wind Sat, provide large data sets for assessing the accuracy and computational efficiency of data selection techniques. A new data thinning technique, based on support vector regression(SVR), is developed and tested. To manage large on-line satellite data streams, observations from Wind Sat are formed into subsets by Voronoi tessellation and then each is thinned by SVR(TSVR). Three experiments are performed. The first confirms the viability of TSVR for a relatively small sample, comparing it to several commonly used data thinning methods(random selection, averaging and Barnes filtering), producing a 10% thinning rate(90% data reduction), low mean absolute errors(MAE) and large correlations with the original data. A second experiment, using a larger dataset, shows TSVR retrievals with MAE < 1 m s-1and correlations 0.98. TSVR was an order of magnitude faster than the commonly used thinning methods. A third experiment applies a two-stage pipeline to TSVR, to accommodate online data. The pipeline subsets reconstruct the wind field with the same accuracy as the second experiment, is an order of magnitude faster than the nonpipeline TSVR. Therefore, pipeline TSVR is two orders of magnitude faster than commonly used thinning methods that ingest the entire data set. This study demonstrates that TSVR pipeline thinning is an accurate and computationally efficient alternative to commonly used data selection techniques.
引用
收藏
页码:277 / 286
页数:10
相关论文
共 50 条
  • [1] Data selection using support vector regression
    Richman, Michael B.
    Leslie, Lance M.
    Trafalis, Theodore B.
    Mansouri, Hicham
    [J]. ADVANCES IN ATMOSPHERIC SCIENCES, 2015, 32 (03) : 277 - 286
  • [2] Data selection using support vector regression
    Michael B. Richman
    Lance M. Leslie
    Theodore B. Trafalis
    Hicham Mansouri
    [J]. Advances in Atmospheric Sciences, 2015, 32 : 277 - 286
  • [3] Feature selection for support vector regression using a genetic algorithm
    Mckearnan, Shannon B.
    Vock, David M.
    Marai, G. Elisabeta
    Canahuate, Guadalupe
    Fuller, Clifton D.
    Wolfson, Julian
    [J]. BIOSTATISTICS, 2023, 24 (02) : 295 - 308
  • [4] Feature Selection Using Probabilistic Prediction of Support Vector Regression
    Yang, Jian-Bo
    Ong, Chong-Jin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (06): : 954 - 962
  • [5] SUBSET SELECTION IN NONLINEAR POISSON REGRESSION USING SUPPORT VECTOR REGRESSION : A SIMULATION STUDY
    Desai, S. S.
    Kashid, D. N.
    Sakate, D. M.
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2018, 14 (01): : 13 - 22
  • [6] Efficient Parameter Selection for Support Vector Regression Using Orthogonal Array
    Sano, Natsuki
    Higashinaka, Kaori
    Suzuki, Tomomichi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 2256 - 2261
  • [7] A hybrid forecasting methodology using feature selection and support vector regression
    Guajardo, J
    Miranda, J
    Weber, R
    [J]. HIS 2005: 5th International Conference on Hybrid Intelligent Systems, Proceedings, 2005, : 341 - 346
  • [8] SNPs selection using support vector regression and genetic algorithms in GWAS
    de Oliveira, Fabrzzio Conde
    Hasenclever Borges, Carlos Cristiano
    Almeida, Fernanda Nascimento
    e Silva, Fabyano Fonseca
    Verneque, Rui da Silva
    da Silva, Marcos Vinicius G. B.
    Arbex, Wagner
    [J]. BMC GENOMICS, 2014, 15
  • [9] A Forecasting Methodology Using Support Vector Regression and Dynamic Feature Selection
    Guajardo, Jose
    Weber, Richard
    Miranda, Jaime
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2006, 5 (04) : 329 - 335
  • [10] SNPs selection using support vector regression and genetic algorithms in GWAS
    Fabrízzio Condé de Oliveira
    Carlos Cristiano Hasenclever Borges
    Fernanda Nascimento Almeida
    Fabyano Fonseca e Silva
    Rui da Silva Verneque
    Marcos Vinicius GB da Silva
    Wagner Arbex
    [J]. BMC Genomics, 15