APHID: An architecture for private, high-performance integrated data mining

被引:5
|
作者
Secretan, Jimmy [1 ]
Georgiopoulos, Michael [1 ]
Koufakou, Anna [1 ]
Cardona, Kel [2 ]
机构
[1] Univ Cent Florida, Sch Elect Engn & Comp Sci, Orlando, FL 32816 USA
[2] Univ Puerto Rico, Dept Comp Engn, San Juan, PR 00936 USA
基金
美国国家科学基金会;
关键词
Data mining; Privacy; Distributed architectures; SERVICES;
D O I
10.1016/j.future.2010.02.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
While the emerging field of privacy preserving data mining (PPDM) will enable many new data mining applications, it suffers from several practical difficulties. PPDM algorithms are challenging to develop and computationally intensive to execute. Developers need convenient abstractions to simplify the engineering of PPDM applications. The individual parties involved in the data mining process need a way to bring high-performance, parallel computers to bear on the computationally intensive parts of the PPDM tasks. This paper discusses APHID (Architecture for Private and High-performance Integrated Data mining), a practical architecture and software framework for developing and executing large scale PPDM applications. At one tier, the system supports simplified use of cluster and grid resources, and at another tier, the system abstracts communication for easy PPDM algorithm development. This paper offers a detailed analysis of the challenges in developing PPDM algorithms with existing frameworks, and motivates the design of a new infrastructure based on these challenges. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:891 / 904
页数:14
相关论文
共 50 条
  • [1] High-performance data mining
    IBM, United States
    IBM Data Manag. Mag., 2009, 3
  • [2] Anteater: A service-oriented architecture for high-performance data mining
    Guedes, Dorgival
    Meira, Wagner, Jr.
    Ferreira, Renato
    IEEE INTERNET COMPUTING, 2006, 10 (04) : 36 - 43
  • [3] High-performance data mining system
    Yaginuma, Y
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2000, 36 (02): : 201 - 210
  • [4] An open multi-tier architecture for high-performance data mining using SOA
    Rahman, Muhammad Mushfiqur
    Maksud-Ul-Alam
    Rahman, S. M. Monzurur
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2015, 7 (01) : 60 - 82
  • [5] High-performance data mining with intelligent SSD
    Yong-Yeon Jo
    Sang-Wook Kim
    Sung-Woo Cho
    Duck-Ho Bae
    Hyunok Oh
    Cluster Computing, 2017, 20 : 1155 - 1166
  • [6] High-performance data mining with intelligent SSD
    Jo, Yong-Yeon
    Kim, Sang-Wook
    Cho, Sung-Woo
    Bae, Duck-Ho
    Oh, Hyunok
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (02): : 1155 - 1166
  • [7] Performance prediction of data streams on high-performance architecture
    Gautam, Bhaskar
    Basava, Annappa
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2019, 9 (01)
  • [8] Trident: The Acceleration Architecture for High-Performance Private Set Intersection
    Zhang, Jinkai
    Yang, Yinghao
    Zhou, Zhe
    Hu, Zhicheng
    Zhao, Xin
    Chang, Liang
    Lu, Hang
    Li, Xiaowei
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (04) : 1152 - 1167
  • [9] Exploring PIM Architecture for High-Performance Graph Pattern Mining
    Su, Jiya
    He, Linfeng
    Jiang, Peng
    Wang, Rujia
    IEEE COMPUTER ARCHITECTURE LETTERS, 2021, 20 (02) : 114 - 117
  • [10] A data mining toolset for distributed high-performance platforms
    Cannataro, M
    Congiusta, A
    Talia, D
    Trunfio, P
    DATA MINING III, 2002, 6 : 41 - 50