Incremental k-Nearest Neighbors Using Reservoir Sampling for Data Streams

被引:1
|
作者
Bahri, Maroua [1 ]
Bifet, Albert [1 ,2 ]
机构
[1] IP Paris, LTCI, Telecom Paris, Paris, France
[2] Univ Waikato, Hamilton, New Zealand
来源
DISCOVERY SCIENCE (DS 2021) | 2021年 / 12986卷
关键词
Data stream classification; K-nearest neighbors; Reservoir sampling; Sliding window;
D O I
10.1007/978-3-030-88942-5_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The online and potentially infinite nature of data streams leads to the inability to store the flow in its entirety and thus restricts the storage to a part of - and/or synopsis information from - the stream. To process these evolving data, we need efficient and accurate methodologies and systems, such as window models (e.g., sliding windows) and summarization techniques (e.g., sampling, sketching, dimensionality reduction). In this paper, we propose, RW-kNN, a k-Nearest Neighbors (kNN) algorithm that employs a practical way to store information about past instances using the biased reservoir sampling to sample the input instances along with a sliding window to maintain the most recent instances from the stream. We evaluate our proposal on a diverse set of synthetic and real datasets and compare against state-of-the-art algorithms in a traditional test-then-train evaluation. Results show how our proposed RW-kNN approach produces high-predictive performance for both real and synthetic datasets while using a feasible amount of resources.
引用
收藏
页码:122 / 137
页数:16
相关论文
共 50 条
  • [41] A NEW FUZZY K-NEAREST NEIGHBORS ALGORITHM
    Li, Chengjie
    Pei, Zheng
    Li, Bo
    Zhang, Zhen
    INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 246 - +
  • [42] Maximizing Reverse k-Nearest Neighbors for Trajectories
    Al Rahat, Tamjid
    Arman, Arif
    Ali, Mohammed Eunus
    DATABASES THEORY AND APPLICATIONS, ADC 2018, 2018, 10837 : 262 - 274
  • [43] The research on an adaptive k-nearest neighbors classifier
    Yu, Xiaopeng
    Yu, Xiaogao
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 535 - 540
  • [44] Hypersphere anchor loss for K-Nearest neighbors
    Xiang Ye
    Zihang He
    Heng Wang
    Yong Li
    Applied Intelligence, 2023, 53 : 30319 - 30328
  • [45] A FUZZY EXTENDED K-NEAREST NEIGHBORS RULE
    BEREAU, M
    DUBUISSON, B
    FUZZY SETS AND SYSTEMS, 1991, 44 (01) : 17 - 32
  • [46] A k-Nearest Neighbors Approach for COCOMO Calibration
    Le, Phu
    Vu Nguyen
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 219 - 224
  • [47] AutoML for Stream k-Nearest Neighbors Classification
    Bahri, Maroua
    Veloso, Bruno
    Bifet, Albert
    Gama, Joao
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 597 - 602
  • [48] An Interval Valued K-Nearest Neighbors Classifier
    Derrac, Joaquin
    Chiclana, Francisco
    Garcia, Salvador
    Herrera, Francisco
    PROCEEDINGS OF THE 2015 CONFERENCE OF THE INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY, 2015, 89 : 378 - 384
  • [49] Ensembles of K-Nearest Neighbors and Dimensionality Reduction
    Okun, Oleg
    Priisalu, Helen
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2032 - +
  • [50] Heuristics for Computing k-Nearest Neighbors Graphs
    Chavez, Edgar
    Luduena, Veronica
    Reyes, Nora
    COMPUTER SCIENCE - CACIC 2019, 2020, 1184 : 234 - 249