Learning aspect models with partially labeled data

被引:3
|
作者
Krithara, Anastasia [1 ,2 ]
Amini, Massih R. [2 ,3 ]
Goutte, Cyril [3 ]
Renders, Jean-Michel [1 ]
机构
[1] Xerox Res Ctr Europe, F-38240 Meylan, France
[2] Univ Paris 06, F-75252 Paris 05, France
[3] Natl Res Council Canada, Hull, PQ J8X 3X7, Canada
关键词
Semi supervised learning; Aspect models; Document categorization; DISCRIMINANT-ANALYSIS; INITIAL SAMPLES;
D O I
10.1016/j.patrec.2010.09.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we address the problem of learning aspect models with partially labeled data for the task of document categorization The motivation of this work is to take advantage of the amount of available unlabeled data together with the set of labeled examples to learn latent models whose structure and underlying hypotheses take more accurately Into account the document generation process compared to other mixture-based generative models We present one semi-supervised variant of the Probabilistic Latent Semantic Analysis (PLSA) model (Hofmann 2001) In our approach we try to capture the possible data mislabeling errors which occur during the training of our model This is done by iteratively assigning class labels to unlabeled examples using the current aspect model and re-estimating the probabilities of the mislabeling errors We perform experiments over the 20Newsgroups WebKB and Reuters document collections as well as over a real world dataset coming from a Business Group of Xerox and show the effectiveness of our approach compared to a semi-supervised version of Naive Bayes another semi-supervised version of PLSA and to transductive Support Vector Machines (C) 2010 Elsevier B V All rights reserved
引用
收藏
页码:297 / 304
页数:8
相关论文
共 50 条
  • [1] Learning classifiers on a partially labeled data manifold
    Liu, Qiuhua
    Liao, Xuejun
    Carin, Lawrence
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 621 - +
  • [2] LEARNING DISCRIMINATIVE DICTIONARIES WITH PARTIALLY LABELED DATA
    Shrivastava, Ashish
    Pillai, Jaishanker K.
    Patel, Vishal M.
    Chellappa, Rama
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 3113 - 3116
  • [3] LEARNING α-INTEGRATION WITH PARTIALLY-LABELED DATA
    Choi, Heeyoul
    Choi, Seungjin
    Katake, Anup
    Choe, Yoonsuck
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2058 - 2061
  • [4] Generative Manifold Learning for the Exploration of Partially Labeled Data
    Cruz-Barbosa, Raul
    Vellido, Alfredo
    [J]. COMPUTACION Y SISTEMAS, 2013, 17 (04): : 641 - 653
  • [5] A Unifying Probabilistic Framework for Partially Labeled Data Learning
    Gong, Xiuwen
    Yuan, Dong
    Bao, Wei
    Luo, Fulin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8036 - 8048
  • [6] Deep Learning in Partially-labeled Data Streams
    Read, Jesse
    Perez-Cruz, Fernando
    Bifet, Albert
    [J]. 30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 954 - 959
  • [7] Attribute reduction for partially labeled data based on hypergraph models
    Xie, Xiaojun
    Qin, Xiaolin
    Huang, Guangmei
    Zhao, Wei
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1434 - 1439
  • [8] Non-linear dictionary learning with partially labeled data
    Shrivastava, Ashish
    Patel, Vishal M.
    Chellappa, Rama
    [J]. PATTERN RECOGNITION, 2015, 48 (11) : 3283 - 3292
  • [9] DISCRIMINATIVE DICTIONARY PAIR LEARNING FROM PARTIALLY LABELED DATA
    Rong, Yi
    Xiong, Shengwu
    Gao, Yongsheng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 719 - 723
  • [10] MULTI-TASK CURRICULUM LEARNING FOR PARTIALLY LABELED DATA
    Jang, Won-Dong
    Lukyanenko, Stanislav
    Wei, Donglai
    Yang, Jiancheng
    Leahy, Brian
    Yang, Helen
    Ben-Yosef, Dalit
    Needleman, Daniel
    Pfister, Hanspeter
    [J]. 2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,