The SPMF Open-Source Data Mining Library Version 2

被引:344
|
作者
Fournier-Viger, Philippe [1 ]
Lin, Jerry Chun-Wei [2 ]
Gomariz, Antonio [3 ]
Gueniche, Ted [4 ]
Soltani, Azadeh [5 ]
Deng, Zhihong [6 ]
Hoang Thanh Lam [7 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Sch Nat Sci & Humanities, Shenzhen, Peoples R China
[2] Harbin Inst Technol, Shenzhen Grad Sch, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[3] Univ Murcia, Dept Informat & Commun Engn, Murcia, Spain
[4] Univ Moncton, Dept Comp Sci, Moncton, NB, Canada
[5] Univ Bojnord, Dept Comp Engn, Bojnord, Iran
[6] Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China
[7] IBM Ireland Res Lab, Dublin, Ireland
关键词
Open-source library; Data mining; Frequent pattern mining;
D O I
10.1007/978-3-319-46131-1_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SPMF is an open-source data mining library, specialized in pattern mining, offering implementations of more than 120 data mining algorithms. It has been used in more than 310 research papers to solve applied problems in a wide range of domains from authorship attribution to restaurant recommendation. Its implementations are also commonly used as benchmarks in research papers, and it has also been integrated in several data analysis software programs. After three years of development, this paper introduces the second major revision of the library, named SPMF 2, which provides (1) more than 60 new algorithm implementations (including novel algorithms for sequence prediction), (2) an improved user interface with pattern visualization (3) a novel plug-in system, (4) improved performance, and (5) support for text mining.
引用
收藏
页码:36 / 40
页数:5
相关论文
共 50 条
  • [1] SPMF: A Java']Java Open-Source Pattern Mining Library
    Fournier-Viger, Philippe
    Gomariz, Antonio
    Gueniche, Ted
    Soltani, Azadeh
    Wu, Cheng-Wei
    Tseng, Vincent S.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3389 - 3393
  • [2] Open-source tools for data mining
    Zupan, Blaz
    Demsar, Janez
    [J]. CLINICS IN LABORATORY MEDICINE, 2008, 28 (01) : 37 - +
  • [3] The evolution of an integrated data-visualization environment for mining with the VTK open-source library
    Drummond, ML
    Lightfoot, N
    Donovan, SJ
    Grodner, MW
    Schweitzer, JK
    Sellers, EJ
    [J]. COMPUTER APPLICATIONS IN THE MINERALS INDUSTRIES, 2001, : 373 - 377
  • [4] A Study of Open-Source Data Mining Tools for Forecasting
    Hasim, Nurdatillah
    Abu Haris, Norhaidah
    [J]. ACM IMCOM 2015, PROCEEDINGS, 2015,
  • [5] Generation of an open-source library of mouse knockout immunophenotyping data
    Abeler-Doerner, L.
    Speak, A. O.
    Clare, S.
    Melvin, D. G.
    White, J. K.
    Adams, D. J.
    Hayday, A. C.
    [J]. IMMUNOLOGY, 2013, 140 : 132 - 132
  • [6] Open-Source Shared Case Library
    Schwid, Howard A.
    [J]. MEDICINE MEETS VIRTUAL REALITY 16: PARALLEL, COMBINATORIAL, CONVERGENT: NEXTMED BY DESIGN, 2008, 132 : 442 - +
  • [7] InSilicoSpectro: An open-source proteomics library
    Colinge, J
    Masselot, A
    Carbonell, P
    Appel, RD
    [J]. JOURNAL OF PROTEOME RESEARCH, 2006, 5 (03) : 619 - 624
  • [8] OpenNFCSense: Open-Source Library for NFCSense
    Liang, Rong-Hao
    [J]. ADJUNCT PROCEEDINGS OF THE 34TH ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, UIST 2021, 2021, : 118 - 120
  • [9] Open-Source Syringe Pump Library
    Wijnen, Bas
    Hunt, Emily J.
    Anzalone, Gerald C.
    Pearce, Joshua M.
    [J]. PLOS ONE, 2014, 9 (09):
  • [10] An open-source toolkit for mining Wikipedia
    Milne, David
    Witten, Ian H.
    [J]. ARTIFICIAL INTELLIGENCE, 2013, 194 : 222 - 239