Efficient Overlapping Document Clustering Using GPUs and Multi-core Systems

被引:0
|
作者
Gonzalez Soler, Lazaro J. [1 ]
Perez-Suarez, Airel [1 ]
Chang, Leonardo [1 ]
机构
[1] Adv Technol Applicat Ctr CENATAV, Havana 12200, Cuba
关键词
Data Mining; Overlapping Clustering; Parallel Algorithms;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Overlapping clustering algorithms have been successfully applied in several contexts. Among the reported overlapping clustering algorithms, OClustR is the one showing the best trade-of between quality of the clusters and efficiency, in the task of document clustering; however, it has a quadratic computational complexity so it could be less useful in applications dealing with a very large number of documents. In this paper, we propose two parallel versions of the OClustR algorithm, specifically tailored for GPUs and multi-core CPUs, which enhance the efficiency of OClustR in problems dealing with a very large number of documents. The experimental evaluation over standard document collections showed the correctness and good performance of our proposals.
引用
收藏
页码:264 / 271
页数:8
相关论文
共 50 条
  • [1] Efficient tool path computation using multi-core GPUs
    Morell-Gimenez, Vicente
    Jimeno-Morenilla, Antonio
    Garcia-Rodriguez, Jose
    COMPUTERS IN INDUSTRY, 2013, 64 (01) : 50 - 56
  • [2] An Efficient Implementation of PSRS for Multi-core Systems
    He Songsong
    Gu Naijie
    Weng Yuping
    Ning Lanfang
    2011 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND CONTROL (ICECC), 2011, : 136 - 139
  • [3] Multifrontal Computations on GPUs and Their Multi-core Hosts
    Lucas, Robert F.
    Wagenbreth, Gene
    Davis, Dan M.
    Grimes, Roger
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2010, 2011, 6449 : 71 - +
  • [4] Efficient parallelization of SPH algorithm on modern multi-core CPUs and massively parallel GPUs
    Jagtap, Pravin
    Nasre, Rupesh
    Sanapala, V. S.
    Patnaik, B. S., V
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2021, 12 (06)
  • [5] Hyperspectral Unmixing on GPUs and Multi-Core Processors: A Comparison
    Bernabe, Sergio
    Sanchez, Sergio
    Plaza, Antonio
    Lopez, Sebastian
    Benediktsson, Jon Atli
    Sarmiento, Roberto
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2013, 6 (03) : 1386 - 1398
  • [6] Efficient dynamic program monitoring on multi-core systems
    He, Guojin
    Zhai, Antonia
    JOURNAL OF SYSTEMS ARCHITECTURE, 2011, 57 (01) : 121 - 133
  • [7] PARALLEL SPN ON MULTI-CORE CPUS AND MANY-CORE GPUS
    Kirschenmann, W.
    Plagne, L.
    Poncot, A.
    Vialle, S.
    TRANSPORT THEORY AND STATISTICAL PHYSICS, 2010, 39 (2-4): : 255 - 281
  • [8] A highly efficient multi-core algorithm for clustering extremely large datasets
    Kraus, Johann M.
    Kestler, Hans A.
    BMC BIOINFORMATICS, 2010, 11
  • [9] A highly efficient multi-core algorithm for clustering extremely large datasets
    Johann M Kraus
    Hans A Kestler
    BMC Bioinformatics, 11
  • [10] Scalable Multi-coloring Preconditioning for Multi-core CPUs and GPUs
    Heuveline, Vincent
    Lukarski, Dimitar
    Weiss, Jan-Philipp
    EURO-PAR 2010 PARALLEL PROCESSING WORKSHOPS, 2011, 6586 : 389 - 397