Streaming random forests

被引:0
|
作者
Abdulsalam, Hanady [1 ]
Skillicorn, David B. [1 ]
Martin, Patrick [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
关键词
data mining; classification; decision trees; data-stream classification; random forests;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many recent applications deal with data streams, conceptually endless sequences of data records, often arriving at high flow rates. Standard data-mining techniques typically assume that records can be accessed multiple times and so do not naturally extend to streaming data. Algorithms for mining streams must be able to extract all necessary information from records with only one, or perhaps a few, passes over the data. We present the Streaming Random Forests algorithm, an online and incremental stream classification algorithm that extends Breiman's Random Forests algorithm. The Streaming Random Forests algorithm grows multiple decision trees, and classifies unlabelled records based on the plurality of tree votes. We evaluate the classification accuracy of the Streaming Random Forests algorithm on several datasets, and show that its accuracy is comparable to the standard Random Forest algorithm.
引用
收藏
页码:225 / 232
页数:8
相关论文
共 50 条
  • [21] On the asymptotics of random forests
    Scornet, Erwan
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 146 : 72 - 83
  • [22] Critical random forests
    Martin, James B.
    Yeo, Dominic
    ALEA-LATIN AMERICAN JOURNAL OF PROBABILITY AND MATHEMATICAL STATISTICS, 2018, 15 (02): : 913 - 960
  • [23] Multivariate random forests
    Segal, Mark
    Xiao, Yuanyuan
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) : 80 - 87
  • [24] Random Tessellation Forests
    Ge, Shufei
    Wang, Shijia
    Teh, Yee Whye
    Wang, Liangliang
    Elliott, Lloyd T.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] Random Similarity Forests
    Piernik, Maciej
    Brzezinski, Dariusz
    Zawadzki, Pawel
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT V, 2023, 13717 : 53 - 69
  • [26] Random Forests in Chapel
    Albrecht, Benjamin
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 676 - 676
  • [27] Random Survival Forests
    Taylor, Jeremy M. G.
    JOURNAL OF THORACIC ONCOLOGY, 2011, 6 (12) : 1974 - 1975
  • [28] Neural Random Forests
    Biau, Gerard
    Scornet, Erwan
    Welbl, Johannes
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2019, 81 (02): : 347 - 386
  • [29] Dynamic Random Forests
    Bernard, Simon
    Adam, Sebastien
    Heutte, Laurent
    PATTERN RECOGNITION LETTERS, 2012, 33 (12) : 1580 - 1586
  • [30] Coalescent random forests
    Pitman, J
    JOURNAL OF COMBINATORIAL THEORY SERIES A, 1999, 85 (02) : 165 - 193