Distributed Randomized Algorithms for Low-Support Data Mining

被引:0
|
作者
Ferro, Alfredo [1 ]
Giugno, Rosalba [1 ]
Mongiovi, Misael [1 ]
Pulvirenti, Alfredo [1 ]
机构
[1] Univ Catania, Dept Math & Comp Sci, Catania, Italy
来源
2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5 | 2009年
关键词
ASSOCIATION RULES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data mining in distributed systems has been facilitated by using high-support association rules. Less attention has been paid to distributed low-support/high-correlation data mining. This has proved useful in several fields such as computational biology, wireless networks, web mining, security and rare events analysis in industrial plants. In this paper we present distributed versions of efficient algorithms for low-support/high-correlation data mining such as Min-Hashing, K-Min-Hashing and Locality-Sensitive-Hashing. Experimental results on real data concerning scalability, speed-up and network traffic are reported.
引用
收藏
页码:2503 / 2509
页数:7
相关论文
共 50 条
  • [21] Spatial Data Mining: Database Primitives, Algorithms and Efficient DBMS Support
    Martin Ester
    Alexander Frommelt
    Hans-Peter Kriegel
    Jöorg Sander
    Data Mining and Knowledge Discovery, 2000, 4 : 193 - 216
  • [22] Compiler and runtime support for shared memory parallelization of data mining algorithms
    Li, XG
    Jin, RM
    Agrawal, G
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2005, 2481 : 265 - 279
  • [23] Spatial data mining: Database primitives, algorithms and efficient DBMS support
    Ester, M
    Frommelt, A
    Kriegel, HP
    Sander, J
    DATA MINING AND KNOWLEDGE DISCOVERY, 2000, 4 (2-3) : 193 - 216
  • [24] A scheme of interactive data mining support system in parallel and distributed environment
    Liu, Z
    Kamohara, S
    Guo, MY
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS, 2003, 2745 : 263 - 272
  • [25] DISTRIBUTED DATA MINING
    Fiolet, Valerie
    Toursel, Bernard
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2005, 6 (01): : 99 - 109
  • [26] Analysing randomized distributed algorithms
    Norman, G
    VALIDATION OF STOCHASTIC SYSTEMS: A GUIDE TO CURRENT RESEARCH, 2004, 2925 : 384 - 418
  • [27] Dynamic Distributed and Parallel Machine Learning algorithms for big data mining processing
    Djafri, Laouni
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (04) : 558 - 601
  • [28] Communication-Efficient Adam-Type Algorithms for Distributed Data Mining
    Xian, Wenhan
    Huang, Feihu
    Huang, Heng
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 1245 - 1250
  • [29] INNOVATION AND CREATIVITY SUPPORT VIA CHANCE DISCOVERY, GENETIC ALGORITHMS, AND DATA MINING
    Llora, Xavier
    Goldberg, David E.
    Ohsawa, Yukio
    Matsumura, Naohiro
    Washida, Yuichi
    Tamura, Hiroshi
    Yoshikawa, Masataka
    Welge, Michael
    Auvil, Loretta
    Searsmith, Duane
    Ohnishi, Kei
    Chao, Chen-Ju
    NEW MATHEMATICS AND NATURAL COMPUTATION, 2006, 2 (01) : 85 - 100
  • [30] An architecture to support distributed data mining services in e-commerce environments
    Krishnaswamy, S
    Zaslavsky, A
    Loke, SW
    WECWIS 2000: SECOND INTERNATIONAL WORKSHOP ON ADVANCED ISSUES OF E-COMMERCE AND WEB-BASED INFORMATION SYSTEMS, PROCEEDING, 2000, : 239 - 246