FSCOALParallel simultaneous fuzzy co-clustering and learning

被引:0
|
作者
Biton, David [1 ]
Kalech, Meir [1 ]
Rokach, Lior [1 ]
机构
[1] Ben Gurion Univ Negev, Software & Informat Syst Engn Dept, Beer Sheva, Israel
关键词
distributed data mining; fuzzy co-clustering; predictive modeling;
D O I
10.1002/int.21967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A model-based co-clustering divides the data based on two main axes and simultaneously trains a supervised model for each co-cluster using all other input features. For example, in the rating prediction task of recommender system, the main two axes are items and users. In each co-cluster, we train a regression model for predicting the rating based on other features such as user's characteristics (e.g., gender), item's characteristics (e.g., genre), contextual features (e.g., location), and so on. In reality, users and items do not necessarily belong to a single co-cluster, but rather can be associated with several co-clusters. We extend the model-based co-clustering to support fuzzy co-clustering. In this setting, each item-user pair is associated to every co-cluster with some membership grade. This grade indicates the level of relevance of the item-user pair to the co-cluster. Furthermore, we propose a distributed algorithm, based on a map-reduce approach, to handle big datasets. Evaluating the fuzzy co-clustering algorithm on three datasets shows a significant improvement comparing with a regular co-clustering algorithm. In addition, a map-reduce version of the fuzzy co-clustering algorithm significantly reduces the runtime.
引用
收藏
页码:1364 / 1380
页数:17
相关论文
共 50 条
  • [1] A Framework for Simultaneous Co-clustering and Learning from Complex Data
    Deodhar, Meghana
    Ghosh, Joydeep
    [J]. KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 250 - 259
  • [2] Fuzzy co-clustering of web documents
    William-Chandra, T
    Chen, L
    [J]. 2005 INTERNATIONAL CONFERENCE ON CYBERWORLDS, PROCEEDINGS, 2005, : 545 - 551
  • [3] Robust fuzzy co-clustering algorithm
    Tjhi, William-Chandra
    Chen, Lihui
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1591 - 1595
  • [4] Co-clustering of fuzzy lagged data
    Shaham, Eran
    Sarne, David
    Ben-Moshe, Boaz
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (01) : 217 - 252
  • [5] Fuzzy co-clustering of documents and keywords
    Kurnmamuru, K
    Dhawale, A
    Krishnapuram, R
    [J]. PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 772 - 777
  • [6] Co-clustering of fuzzy lagged data
    Eran Shaham
    David Sarne
    Boaz Ben-Moshe
    [J]. Knowledge and Information Systems, 2015, 44 : 217 - 252
  • [7] SCOAL: A Framework for Simultaneous Co-Clustering and Learning from Complex Data
    Deodhar, Meghana
    Ghosh, Joydeep
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [8] Co-Adjustment Learning for Co-Clustering
    Ji Zhang
    Hongjun Wang
    Shudong Huang
    Tianrun Li
    Peng Jin
    Ping Deng
    Qigang Zhao
    [J]. Cognitive Computation, 2021, 13 : 504 - 517
  • [9] Fuzzy Co-clustering with Automated Variable Weighting
    Laclau, Charlotte
    de Carvalho, Francisco de A. T.
    Nadif, Mohamed
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [10] Fuzzy Co-Clustering and Application to Collaborative Filtering
    Honda, Katsuhiro
    [J]. INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, IUKM 2016, 2016, 9978 : 16 - 23