Incremental k-Nearest Neighbors Using Reservoir Sampling for Data Streams

被引:1
|
作者
Bahri, Maroua [1 ]
Bifet, Albert [1 ,2 ]
机构
[1] IP Paris, LTCI, Telecom Paris, Paris, France
[2] Univ Waikato, Hamilton, New Zealand
来源
DISCOVERY SCIENCE (DS 2021) | 2021年 / 12986卷
关键词
Data stream classification; K-nearest neighbors; Reservoir sampling; Sliding window;
D O I
10.1007/978-3-030-88942-5_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The online and potentially infinite nature of data streams leads to the inability to store the flow in its entirety and thus restricts the storage to a part of - and/or synopsis information from - the stream. To process these evolving data, we need efficient and accurate methodologies and systems, such as window models (e.g., sliding windows) and summarization techniques (e.g., sampling, sketching, dimensionality reduction). In this paper, we propose, RW-kNN, a k-Nearest Neighbors (kNN) algorithm that employs a practical way to store information about past instances using the biased reservoir sampling to sample the input instances along with a sliding window to maintain the most recent instances from the stream. We evaluate our proposal on a diverse set of synthetic and real datasets and compare against state-of-the-art algorithms in a traditional test-then-train evaluation. Results show how our proposed RW-kNN approach produces high-predictive performance for both real and synthetic datasets while using a feasible amount of resources.
引用
收藏
页码:122 / 137
页数:16
相关论文
共 50 条
  • [21] Clustering of Remote Sensing Data Based on K-Nearest Neighbors Sampling With Non-Evenly Division
    Liu, Lan
    Li, Cheng-Fan
    Sun, Xian-Kun
    Lei, Yong-Mei
    Si, Wen
    Lai, Ming-Shu
    IEEE ACCESS, 2019, 7 : 147292 - 147301
  • [22] Landmine Classification Using Possibilistic K-Nearest Neighbors with Wideband Electromagnetic Induction Data
    Dula, J.
    Zare, A.
    Ho, D.
    Gader, P.
    DETECTION AND SENSING OF MINES, EXPLOSIVE OBJECTS, AND OBSCURED TARGETS XVIII, 2013, 8709
  • [23] RSSI-based Localization Using K-Nearest Neighbors
    Achroufene, Achour
    AD HOC & SENSOR WIRELESS NETWORKS, 2023, 56 (1-2) : 105 - 135
  • [24] Movie Recommender System Using K-Nearest Neighbors Variants
    Sonu Airen
    Jitendra Agrawal
    National Academy Science Letters, 2022, 45 : 75 - 82
  • [25] Toward Predicting Medical Conditions Using k-Nearest Neighbors
    Tayeb, Shahab
    Pirouz, Matin
    Sun, Johann
    Hall, Kaylee
    Chang, Andrew
    Li, Jessica
    Song, Connor
    Chauhan, Apoorva
    Ferra, Michael
    Sager, Theresa
    Zhan, Justin
    Latifi, Shahram
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3897 - 3903
  • [26] Fast agglomerative clustering using information of k-nearest neighbors
    Chang, Chih-Tang
    Lai, Jim Z. C.
    Jeng, M. D.
    PATTERN RECOGNITION, 2010, 43 (12) : 3958 - 3968
  • [27] Classifications of Motor Imagery Tasks Using K-Nearest Neighbors
    Aldea, Roxana
    Fira, Monica
    Lazar, Anca
    2014 12TH SYMPOSIUM ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING (NEUREL), 2014, : 115 - 119
  • [28] A Placement Prediction System Using K-Nearest Neighbors Classifier
    Giri, Animesh
    Bhagavath, M. Vignesh V.
    Pruthvi, Bysani
    Dubey, Naini
    2016 SECOND INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2016,
  • [29] Information theoretic clustering using a k-nearest neighbors approach
    Vikjord, Vidar V.
    Jenssen, Robert
    PATTERN RECOGNITION, 2014, 47 (09) : 3070 - 3081
  • [30] Classification using the local probabilistic centers of k-nearest neighbors
    Li, Bo Yu
    Chen, Yun Wen
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1220 - +