Iterative double clustering for unsupervised and semi-supervised learning

被引:0
|
作者
El-Yaniv, R [1 ]
Souroujon, O [1 ]
机构
[1] Technion Israel Inst Technol, Dept Comp Sci, IL-32000 Haifa, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a powerful meta-clustering technique called Iterative Double Clustering (IDC). The IDC method is a natural extension of the recent Double Clustering (DC) method of Slonim and Tishby that exhibited impressive performance on text categorization tasks [12]. Using synthetically generated data we empirically find that whenever the DC procedure is successful in recovering some of the structure hidden in the data, the extended IDC procedure can incrementally compute a significantly more accurate classification. IDC is especially advantageous when the data exhibits high attribute noise. Our simulation results also show the effectiveness of IDC in text categorization problems. Surprisingly, this unsupervised procedure can be competitive with a (supervised) SVM trained with a small training set. Finally, we propose a simple and natural extension of IDC for semi-supervised and transductive learning where we are given both labeled and unlabeled examples.
引用
下载
收藏
页码:1025 / 1032
页数:8
相关论文
共 50 条
  • [1] Temporal Ordered Clustering in Dynamic Networks: Unsupervised and Semi-Supervised Learning Algorithms
    Turowski, Krzysztof
    Sreedharan, Jithin K.
    Szpankowski, Wojciech
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 1426 - 1442
  • [2] Semi-Supervised and Unsupervised Extreme Learning Machines
    Huang, Gao
    Song, Shiji
    Gupta, Jatinder N. D.
    Wu, Cheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (12) : 2405 - 2417
  • [3] Ensemble learning with trees and rules: Supervised, semi-supervised, unsupervised
    Akdemir, Deniz
    Jannink, Jean-Luc
    INTELLIGENT DATA ANALYSIS, 2014, 18 (05) : 857 - 872
  • [4] A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery
    Zhang H.
    Xu H.
    Wang X.
    Long F.
    Gao K.
    IEEE Transactions on Knowledge and Data Engineering, 2024, 36 (11) : 1 - 14
  • [5] Semi-supervised Clustering with Deep Metric Learning
    Li, Xiaocui
    Yin, Hongzhi
    Zhou, Ke
    Chen, Hongxu
    Sadiq, Shazia
    Zhou, Xiaofang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 383 - 386
  • [6] Active Learning of Constraints for Semi-Supervised Clustering
    Xiong, Sicheng
    Azimi, Javad
    Fern, Xiaoli Z.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 43 - 54
  • [7] Disfluency Correction using Unsupervised and Semi-supervised Learning
    Saini, Nikhil
    Trivedi, Drumil
    Khare, Shreya
    Dhamecha, Tejas, I
    Jyothi, Preethi
    Bharadwaj, Samarth
    Bhattacharyya, Pushpak
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3421 - 3427
  • [8] Unsupervised identification of points of interest for semi-supervised learning
    Frigui, H
    FUZZ-IEEE 2005: Proceedings of the IEEE International Conference on Fuzzy Systems: BIGGEST LITTLE CONFERENCE IN THE WORLD, 2005, : 91 - 96
  • [9] Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
    Chen, Yanbei
    Mancini, Massimiliano
    Zhu, Xiatian
    Akata, Zeynep
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1327 - 1347
  • [10] Federated Learning in Healthcare with Unsupervised and Semi-Supervised Methods
    Panos-Basterra, Juan
    Dolores Ruiz, M.
    Martin-Bautista, Maria J.
    FLEXIBLE QUERY ANSWERING SYSTEMS, FQAS 2023, 2023, 14113 : 182 - 193