Statistical-mechanics analysis of Gaussian labeled-unlabeled classification problems

被引:3
|
作者
Tanaka, Toshiyuki [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Sakyo Ku, Kyoto 6068501, Japan
关键词
D O I
10.1088/1742-6596/473/1/012001
中图分类号
O59 [应用物理学];
学科分类号
摘要
The labeled-unlabeled classification problem in semi-supervised learning is studied via statistical-mechanics approach. We analytically investigate performance of a learner with an equal-weight mixture of two symmetrically-located Gaussians, performing posterior mean estimation of the parameter vector on the basis of a dataset consisting of labeled and unlabeled data generated from the same probability model as that assumed by the learner. Under the assumption of replica symmetry, we have analytically obtained a set of saddle-point equations, which allows us to numerically evaluate performance of the learner. On the basis of the analytical result we have observed interesting phenomena, in particular the coexistence of good and bad solutions, which may happen when the number of unlabeled data is relatively large compared with that of labeled data.
引用
收藏
页数:8
相关论文
共 49 条
  • [41] STATISTICAL-MECHANICS OF THE 1D SINE-GORDON SYSTEM .2. TRANSFER INTEGRAL ANALYSIS IN THE INTERMEDIATE TEMPERATURE REGION
    TAKAYAMA, H
    SATO, G
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 1982, 51 (10) : 3120 - 3125
  • [42] Using data-compressors for statistical analysis of problems on homogeneity testing and classification
    Kyabko, Boris
    Guskov, Andrey
    Selivanova, Irina
    2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 121 - 125
  • [43] Application of Statistical Analysis Methods in Solving Problems of Classification of Absorbing Boreholes.
    Agaev, F.E.
    Izvestia vyssih ucebnyh zavedenij. Neft i gaz, 1980, (04): : 23 - 28
  • [44] Significance Statistical Test Analysis on Classification Models of Adolescent's Emotional Problems
    Tilyeubai, Akhyt
    Tsend, Javzmaa
    Vaanchindorj, Bayarmaa
    Chuluunbaatar, Galbadrakh
    Chilkhaasuren, Baasandorj
    Luvsan-Ish, Ajnai
    Puntsagdash, Jargalbat
    Luvsantseren, Purevdolgor
    Oyunbileg, Bat-Enkh
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2023, 3 (04): : 1743 - 1757
  • [45] Semi-supervised time series classification on positive and unlabeled problems using cross-recurrence quantification analysis
    Pagliosa, Lucas de Carvalho
    de Mello, Rodrigo Fernandes
    PATTERN RECOGNITION, 2018, 80 : 53 - 63
  • [46] ANALYSIS OF RANDOM ANISOTROPIC DAMAGE MECHANICS PROBLEMS OF ROCK MASS .2. STATISTICAL ESTIMATION
    ZHANG, W
    VALLIAPPAN, S
    ROCK MECHANICS AND ROCK ENGINEERING, 1990, 23 (04) : 241 - 259
  • [48] Statistical analysis of big data: An approach based on support vector machines for classification and regression problems
    Kadyrova N.O.
    Pavlova L.V.
    Biophysics, 2014, 59 (3) : 364 - 373
  • [49] The Set of Basis Functions Generated by Pearson Type IV Distributions and Its Application to Problems of Statistical Data Analysis and Quantum Mechanics
    Bogdanov, Yu. I.
    Bogdanova, N. A.
    Lukichev, V. F.
    PROCEEDINGS OF THE STEKLOV INSTITUTE OF MATHEMATICS, 2024, 324 (01) : 53 - 65