APHID: An architecture for private, high-performance integrated data mining

被引:5
|
作者
Secretan, Jimmy [1 ]
Georgiopoulos, Michael [1 ]
Koufakou, Anna [1 ]
Cardona, Kel [2 ]
机构
[1] Univ Cent Florida, Sch Elect Engn & Comp Sci, Orlando, FL 32816 USA
[2] Univ Puerto Rico, Dept Comp Engn, San Juan, PR 00936 USA
基金
美国国家科学基金会;
关键词
Data mining; Privacy; Distributed architectures; SERVICES;
D O I
10.1016/j.future.2010.02.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
While the emerging field of privacy preserving data mining (PPDM) will enable many new data mining applications, it suffers from several practical difficulties. PPDM algorithms are challenging to develop and computationally intensive to execute. Developers need convenient abstractions to simplify the engineering of PPDM applications. The individual parties involved in the data mining process need a way to bring high-performance, parallel computers to bear on the computationally intensive parts of the PPDM tasks. This paper discusses APHID (Architecture for Private and High-performance Integrated Data mining), a practical architecture and software framework for developing and executing large scale PPDM applications. At one tier, the system supports simplified use of cluster and grid resources, and at another tier, the system abstracts communication for easy PPDM algorithm development. This paper offers a detailed analysis of the challenges in developing PPDM algorithms with existing frameworks, and motivates the design of a new infrastructure based on these challenges. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:891 / 904
页数:14
相关论文
共 50 条
  • [21] Evolution of a high-performance PC architecture data processing system
    Turri, M
    DASIA 99: DATA SYSTEMS IN AEROSPACE, 1999, 447 : 73 - 78
  • [22] FAST: A High-Performance Architecture for Heterogeneous Big Data Forensics
    Pungila, Ciprian
    Negru, Viorel
    INTERNATIONAL JOINT CONFERENCE SOCO'17- CISIS'17-ICEUTE'17 PROCEEDINGS, 2018, 649 : 618 - 627
  • [23] High-performance protocol architecture
    Dabbous, WS
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1997, 29 (07): : 735 - 744
  • [24] HIGH-PERFORMANCE COMPUTER ARCHITECTURE
    BHUYAN, LN
    FUTURE GENERATION COMPUTER SYSTEMS, 1995, 11 (06) : 501 - 502
  • [25] HIGH-PERFORMANCE ARCHITECTURE ISSUES
    NICOLE, DA
    DECENTRALIZED AND DISTRIBUTED SYSTEMS, 1993, 39 : 23 - 30
  • [26] High-performance commercial data mining: A multistrategy machine learning application
    Hsu, WH
    Welge, M
    Redman, T
    Clutter, D
    DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (04) : 361 - 391
  • [27] High-Performance Commercial Data Mining: A Multistrategy Machine Learning Application
    William H. Hsu
    Michael Welge
    Tom Redman
    David Clutter
    Data Mining and Knowledge Discovery, 2002, 6 : 361 - 391
  • [28] Special issue on high-performance data mining - Guest Editors' introduction
    Kumar, V
    Ranka, S
    Singh, V
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (03) : 281 - 284
  • [29] Data mining middleware for wide-area high-performance networks
    Grossman, Robert L.
    Gu, Yunhong
    Hanley, David
    Sabala, Michal
    Mambretti, Joe
    Szalay, Alex
    Thakar, Ani
    Kumazoe, Kazumi
    Yuji, Oie
    Lee, Minsun
    Kwon, Yoonjoo
    Seok, Woojin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2006, 22 (08): : 940 - 948
  • [30] A data mining integrated architecture for shop floor control
    Srinivas
    Harding, J. A.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2008, 222 (05) : 605 - 624