Parallel nearest neighbour algorithms for text categorization

被引:0
|
作者
Gil-Garcia, Reynaldo [1 ]
Badia-Contelles, Jose Manuel [2 ]
Pons-Porrata, Aurora [1 ]
机构
[1] Univ Oriente, Ctr Pattern Recognit & Data Mining, Santiago De Cuba, Cuba
[2] Univ Jaume 1, Dept Comp Sci & Engn, Castellon de La Plana, Spain
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we describe the parallelization of two nearest neighbour classification algorithms. Nearest neighbour methods are well-known machine learning techniques. They have been successfully applied to Text Categorization task. Based on standard parallel techniques we propose two versions of each algorithm on message passing architectures. We also include experimental results on a cluster of personal computers using a large text collection. Our algorithms attempt to balance the load among the processors, they are portable, and obtain very good speedups and scalability.
引用
收藏
页码:328 / +
页数:2
相关论文
共 50 条
  • [1] An empirical comparison of exact nearest neighbour algorithms
    Kibriya, Ashraf M.
    Frank, Eibe
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS, 2007, 4702 : 140 - +
  • [2] Modular k-nearest neighbor classification method for massively parallel text categorization
    Zhao, H
    Lu, BL
    [J]. COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 867 - 872
  • [3] Comparison of Text Categorization Algorithms
    SHI Yong-feng
    [J]. Wuhan University Journal of Natural Sciences, 2004, (05) : 798 - 804
  • [4] A new nearest neighbor rule for text categorization
    Gil-Garcia, Reynaldo
    Pons-Porrata, Aurora
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2006, 4225 : 814 - 823
  • [5] Text Categorization with K-Nearest Neighbor Approach
    Manne, Suneetha
    Kotha, Sita Kumari
    Fatima, S. Sameen
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 413 - +
  • [6] Binary k-nearest neighbor for text categorization
    Tan, SB
    [J]. ONLINE INFORMATION REVIEW, 2005, 29 (04) : 391 - 399
  • [7] New boosting algorithms for text categorization
    Diao, LL
    Lu, MY
    Hu, KY
    Lu, YC
    Shi, CY
    [J]. PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 2326 - 2329
  • [8] MFCC and ARM Algorithms for Text Categorization
    Srinivas, M.
    Spreethi, K. P.
    Prasad, E. V.
    Kumari, S. Anitha
    [J]. ICCN: 2008 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING, 2008, : 698 - +
  • [9] Some improvements in tree based nearest neighbour search algorithms
    Gómez-Ballester, E
    Micó, L
    Oncina, J
    [J]. PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 456 - 463
  • [10] Optimal Space Subdivision for Parallel Approximate Nearest Neighbour Determination
    Feng, Huan
    Mills, Steven
    Eyers, David
    Shen, Xiaolong
    Huang, Zhiyi
    [J]. 2015 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2015,