A Unifying Probabilistic Framework for Partially Labeled Data Learning

被引:3
|
作者
Gong, Xiuwen [1 ]
Yuan, Dong [1 ]
Bao, Wei [1 ]
Luo, Fulin [2 ]
机构
[1] Univ Sydney, Fac Engn, Camperdown, NSW 2006, Australia
[2] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Phase locked loops; Correlation; Training; Probabilistic logic; Testing; Task analysis; Noise measurement; Partially labeled data learning (PLDL); partial label learning (PLL); partial multi-label learning (PML); classification;
D O I
10.1109/TPAMI.2022.3228755
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Partially labeled data learning (PLDL), including partial label learning (PLL) and partial multi-label learning (PML), has been widely used in nowadays data science. Researchers attempt to construct different specific models to deal with the different classification tasks for PLL and PML scenarios respectively. The main challenge in training classifiers for PLL and PML is how to deal with ambiguities caused by the noisy false-positive labels in the candidate label set. The state-of-the-art strategy for both scenarios is to perform disambiguation by identifying the ground-truth label(s) directly from the candidate label set, which can be summarized into two categories: 'the identifying method' and 'the embedding method'. However, both kinds of methods are constructed by hand-designed heuristic modeling under considerations like feature/label correlations with no theoretical interpretation. Instead of adopting heuristic or specific modeling, we propose a novel unifying framework called A Unifying Probabilistic Framework for Partially Labeled Data Learning (UPF-PLDL), which is derived from a clear probabilistic formulation, and brings existing research on PLL and PML under one theoretical interpretation with respect to information theory. Furthermore, the proposed UPF-PLDL also unifies 'the identifying method' and 'the embedding method' into one integrated framework, which naturally incorporates the feature and label correlation considerations. Comprehensive experiments on synthetic and real-world datasets for both PLL and PML scenarios clearly demonstrate the superiorities of the derived framework.
引用
收藏
页码:8036 / 8048
页数:13
相关论文
共 50 条
  • [31] Conformal Prediction with Partially Labeled Data
    Javanmardi, Alireza
    Sale, Yusuf
    Hofman, Paul
    Huellermeier, Eyke
    [J]. CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, VOL 204, 2023, 204 : 251 - 266
  • [32] COMPOSE: A Semisupervised Learning Framework for Initially Labeled Nonstationary Streaming Data
    Dyer, Karl B.
    Capo, Robert
    Polikar, Robi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (01) : 12 - 26
  • [33] Mirror Learning: A Unifying Framework of Policy Optimisation
    Kuba, Jakub Grudzien
    de Witt, Christian Schroeder
    Foerster, Jakob
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [34] Classifying evolving data streams with partially labeled data
    Borchani, Hanen
    Larranaga, Pedro
    Bielza, Concha
    [J]. INTELLIGENT DATA ANALYSIS, 2011, 15 (05) : 655 - 670
  • [35] A Unifying Computational Framework for Teaching and Active Learning
    Yang, Scott Cheng-Hsin
    Vong, Wai Keen
    Yu, Yue
    Shafto, Patrick
    [J]. TOPICS IN COGNITIVE SCIENCE, 2019, 11 (02) : 316 - 337
  • [36] A Probabilistic Framework for Deep Learning
    Patel, Ankit B.
    Tan Nguyen
    Baraniuk, Richard G.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [37] Semi Advised learning and classification algorithm for Partially Labeled Skin Cancer Data Analysis
    Masood, Ammara
    Al-Jumaily, Adel
    [J]. 2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [38] A general probabilistic framework for mining labeled ordered trees
    Ueda, N
    Aoki, KF
    Mamitsuka, H
    [J]. PROCEEDINGS OF THE FOURTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2004, : 357 - 368
  • [39] A unifying framework for conceptual data modelling concepts
    Frederiks, PJM
    terHofstede, AHM
    Lippe, E
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 1997, 39 (01) : 15 - 25
  • [40] A unifying framework for multilevel description of spatial data
    Bertolotto, M
    DeFloriani, L
    Marzano, P
    [J]. SPATIAL INFORMATION THEORY: A THEORETICAL BASIS FOR GIS, 1995, 988 : 259 - 278