Efficient Overlapping Document Clustering Using GPUs and Multi-core Systems

被引:0
|
作者
Gonzalez Soler, Lazaro J. [1 ]
Perez-Suarez, Airel [1 ]
Chang, Leonardo [1 ]
机构
[1] Adv Technol Applicat Ctr CENATAV, Havana 12200, Cuba
关键词
Data Mining; Overlapping Clustering; Parallel Algorithms;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Overlapping clustering algorithms have been successfully applied in several contexts. Among the reported overlapping clustering algorithms, OClustR is the one showing the best trade-of between quality of the clusters and efficiency, in the task of document clustering; however, it has a quadratic computational complexity so it could be less useful in applications dealing with a very large number of documents. In this paper, we propose two parallel versions of the OClustR algorithm, specifically tailored for GPUs and multi-core CPUs, which enhance the efficiency of OClustR in problems dealing with a very large number of documents. The experimental evaluation over standard document collections showed the correctness and good performance of our proposals.
引用
收藏
页码:264 / 271
页数:8
相关论文
共 50 条
  • [31] High Performance Matrix Inversion on a Multi-core Platform with Several GPUs
    Ezzatti, Pablo
    Quintana-Orti, Enrique S.
    Remon, Alfredo
    PROCEEDINGS OF THE 19TH INTERNATIONAL EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2011, : 87 - 93
  • [32] An energy-efficient reconfigurable accelerators in multi-core systems using PULP-NN
    Tammireddy, Siva Sankara Phani
    Samson, Mamatha
    Reddy, P. Rahul
    Reddy, A. Kishore
    Panigrahy, Asisa Kumar
    Jayabalan, Sudharsan
    Prakash, M. Durga
    APPLIED NANOSCIENCE, 2021, 13 (3) : 1795 - 1795
  • [33] Ownership Passing: Efficient Distributed Memory Programming on Multi-core Systems
    Friedley, Andrew
    Hoefler, Torsten
    Bronevetsky, Greg
    Lumsdaine, Andrew
    Ma, Ching-Chen
    ACM SIGPLAN NOTICES, 2013, 48 (08) : 177 - 186
  • [34] An Efficient Unbounded Lock-Free Queue for Multi-core Systems
    Aldinucci, Marco
    Danelutto, Marco
    Kilpatrick, Peter
    Meneghin, Massimiliano
    Torquati, Massimo
    EURO-PAR 2012 PARALLEL PROCESSING, 2012, 7484 : 662 - 673
  • [35] LARGE CAPACITY TRANSMISSION SYSTEMS USING MULTI-CORE FIBERS
    Sano, Akihide
    Takara, Hidehiko
    Moyamoto, Yutaka
    2014 OPTOELECTRONICS AND COMMUNICATIONS CONFERENCE AND AUSTRALIAN CONFERENCE ON OPTICAL FIBRE TECHNOLOGY (OECC/ACOFT 2014), 2014, : 704 - 705
  • [36] WCET(m) Estimation in Multi-Core Systems using Single Core Equivalence
    Mancuso, Renato
    Pellizzoni, Rodolfo
    Caccamo, Marco
    Sha, Lui
    Yun, Heechul
    PROCEEDINGS OF THE 2015 27TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS (ECRTS 2015), 2015, : 174 - 183
  • [37] Improving Efficiency of Link Clustering on Multi-Core Machines
    Yan, Guanhua
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 2017 - 2024
  • [38] Multi-Core for K-Means Clustering on FPGA
    Canilho, Jose
    Vestias, Mario
    Neto, Horacio
    2016 26TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2016,
  • [39] EXPLOITING MULTI-CORE AND MANY-CORE PARALLELISM FOR SUBSPACE CLUSTERING
    Datta, Amitava
    Kaur, Amardeep
    Lauer, Tobias
    Chabbouh, Sami
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2019, 29 (01) : 81 - 91
  • [40] Challenges and Opportunities of Obtaining Performance from Multi-Core CPUs and Many-Core GPUs
    Chen, Trista P.
    Chen, Yen-Kuang
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 613 - +