A Clustering-Based Data Reduction for the Large Automotive Datasets

被引:0
|
作者
Siwek, Patryk [1 ]
Skruch, Pawel [2 ]
Dlugosz, Marek [2 ]
机构
[1] Aptiv Serv Poland SA, Krakow, Poland
[2] AGH Univ Sci & Technol, Krakow, Poland
关键词
large dataset; automotive; reduction; clustering; perception;
D O I
10.1109/MMAR58394.2023.10242489
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large datasets used in automotive consist of a set of recorded sequences that represent possible road scenarios. Such scenarios are mainly utilized as test scenarios to verify developed driver assistance systems. Another application of the dataset is the training and verification of machine learning-based algorithms. As the number of possible road scenarios is, in fact, infinite, the process of selecting representative and meaningful sequences is a difficult and challenging task. This article presents an approach based on various clustering techniques for data reduction for large datasets that are used in the automotive industry to evaluate environmental perception algorithms. The approach is supported by the results obtained on representative datasets.
引用
收藏
页码:234 / 239
页数:6
相关论文
共 50 条
  • [1] A Clustering-Based Data Reduction for Very Large Spatio-Temporal Datasets
    Le-Khac, Nhien-An
    Bue, Martin
    Whelan, Michael
    Kechadi, M-Tahar
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 43 - 54
  • [2] A clustering-based hybrid approach for dual data reduction
    Ratnoo, Saroj
    Rathee, Seema
    Ahuja, Jyoti
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2018, 6 (05) : 468 - 490
  • [3] Nonlinear clustering-based support vector machine for large data sets
    Wang, Yongqiao
    Zhang, Xun
    Wang, Souyang
    Lai, K. K.
    OPTIMIZATION METHODS & SOFTWARE, 2008, 23 (04): : 533 - 549
  • [4] Clustering-based nonlinear dimensionality reduction on manifold
    Wen, Guihua
    Jiang, Lijun
    Wen, Jun
    Shadbolt, Nigel R.
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 444 - 453
  • [5] Data summarization based fast hierarchical clustering method for large datasets
    Patra, Bidyut Kr.
    Nandi, Sukumar
    Viswanath, P.
    2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT AND ENGINEERING, PROCEEDINGS, 2009, : 278 - +
  • [6] Clustering Large Datasets Using Data Stream Clustering Techniques
    Bolanos, Matthew
    Forrest, John
    Hahsler, Michael
    DATA ANALYSIS, MACHINE LEARNING AND KNOWLEDGE DISCOVERY, 2014, : 135 - 143
  • [7] Clustering-based Partitioning for Large Web Graphs
    Kong, Deyu
    Xie, Xike
    Zhang, Zhuoxu
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 593 - 606
  • [8] Clustering-based Data Transmission Algorithms for VANET
    Chai, Rong
    Yang, Bin
    Li, Lifan
    Sun, Xiao
    Chen, Qianbin
    2013 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2013), 2013,
  • [9] Clustering-based approaches to SAGE data mining
    Haiying Wang
    Huiru Zheng
    Francisco Azuaje
    BioData Mining, 1
  • [10] Clustering-based approach for medical data classification
    Kodabagi, Mallikarjun M.
    Tikotikar, Ahelam
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (14):