A Unifying Probabilistic Framework for Partially Labeled Data Learning

被引:3
|
作者
Gong, Xiuwen [1 ]
Yuan, Dong [1 ]
Bao, Wei [1 ]
Luo, Fulin [2 ]
机构
[1] Univ Sydney, Fac Engn, Camperdown, NSW 2006, Australia
[2] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Phase locked loops; Correlation; Training; Probabilistic logic; Testing; Task analysis; Noise measurement; Partially labeled data learning (PLDL); partial label learning (PLL); partial multi-label learning (PML); classification;
D O I
10.1109/TPAMI.2022.3228755
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Partially labeled data learning (PLDL), including partial label learning (PLL) and partial multi-label learning (PML), has been widely used in nowadays data science. Researchers attempt to construct different specific models to deal with the different classification tasks for PLL and PML scenarios respectively. The main challenge in training classifiers for PLL and PML is how to deal with ambiguities caused by the noisy false-positive labels in the candidate label set. The state-of-the-art strategy for both scenarios is to perform disambiguation by identifying the ground-truth label(s) directly from the candidate label set, which can be summarized into two categories: 'the identifying method' and 'the embedding method'. However, both kinds of methods are constructed by hand-designed heuristic modeling under considerations like feature/label correlations with no theoretical interpretation. Instead of adopting heuristic or specific modeling, we propose a novel unifying framework called A Unifying Probabilistic Framework for Partially Labeled Data Learning (UPF-PLDL), which is derived from a clear probabilistic formulation, and brings existing research on PLL and PML under one theoretical interpretation with respect to information theory. Furthermore, the proposed UPF-PLDL also unifies 'the identifying method' and 'the embedding method' into one integrated framework, which naturally incorporates the feature and label correlation considerations. Comprehensive experiments on synthetic and real-world datasets for both PLL and PML scenarios clearly demonstrate the superiorities of the derived framework.
引用
收藏
页码:8036 / 8048
页数:13
相关论文
共 50 条
  • [21] A Boosting Approach for Learning to Rank Using SVD with Partially Labeled Data
    Lin, Yuan
    Lin, Hongfei
    Yang, Zhihao
    Su, Sui
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 330 - +
  • [22] Thread Structure Learning on Online Health Forums With Partially Labeled Data
    Liu, Yunzhong
    Shi, Jinhe
    Chen, Yi
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (06): : 1273 - 1282
  • [23] A unifying probabilistic framework for analyzing residual dipolar couplings
    Habeck, Michael
    Nilges, Michael
    Rieping, Wolfgang
    [J]. JOURNAL OF BIOMOLECULAR NMR, 2008, 40 (02) : 135 - 144
  • [24] A Unifying Framework for Reinforcement Learning and Planning
    Moerland, Thomas M.
    Broekens, Joost
    Plaat, Aske
    Jonker, Catholijn M.
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [25] Learning to detect partially labeled people
    Rachlin, Y
    Dolan, J
    Khosla, P
    [J]. IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 1536 - 1541
  • [26] A Self-Training Listwise Method for Learning to Rank with Partially Labeled Data
    He, Hai-jiang
    [J]. INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [27] Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation
    Xie, Yutong
    Zhang, Jianpeng
    Xia, Yong
    Shen, Chunhua
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14905 - 14919
  • [28] An NMF-framework for Unifying Posterior Probabilistic Clustering and Probabilistic Latent Semantic Indexing
    Zhang, Zhong-Yuan
    Li, Tao
    Ding, Chris
    Tang, Jie
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2014, 43 (19) : 4011 - 4024
  • [29] Probabilistic modeling for face orientation discrimination: Learning from labeled and unlabeled data
    Baluja, S
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 854 - 860
  • [30] Information integration of partially labeled data
    Rendle, Steffen
    Schmidt-Thieme, Lars
    [J]. DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 171 - +