Provable Algorithms for Inference in Topic Models

被引:0
|
作者
Arora, Sanjeev [1 ]
Ge, Rong [2 ]
Koehler, Frederic [3 ]
Ma, Tengyu [1 ]
Moitra, Ankur [4 ,5 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[2] Duke Univ, Comp Sci Dept, Durham, NC 27706 USA
[3] Princeton Univ, Dept Math, Princeton, NJ 08544 USA
[4] MIT, Dept Math, Cambridge, MA 02139 USA
[5] MIT, CSAIL, Cambridge, MA 02139 USA
关键词
LASSO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, there has been considerable progress on designing algorithms with provable guarantees - typically using linear algebraic methods - for parameter learning in latent variable models. But designing provable algorithms for inference has proven to be more challenging. Here we take a first step towards provable inference in topic models. We leverage a property of topic models that enables us to construct simple linear estimators for the unknown topic proportions that have small variance, and consequently can work with short documents. Our estimators also correspond to finding an estimate around which the posterior is well-concentrated. We show lower bounds that for shorter documents it can be information theoretically impossible to find the hidden topics. Finally, we give empirical results that demonstrate that our algorithm works on realistic topic models. It yields good solutions on synthetic data and runs in time comparable to a single iteration of Gibbs sampling.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] ON PROVABLE EXACT LOW-RANK RECOVERY IN TOPIC MODELS
    Behmardi, Behrouz
    Raich, Raviv
    2011 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2011, : 265 - 268
  • [2] Distributed Algorithms for Topic Models
    Newman, David
    Asuncion, Arthur
    Smyth, Padhraic
    Welling, Max
    JOURNAL OF MACHINE LEARNING RESEARCH, 2009, 10 : 1801 - 1828
  • [3] An Instability in Variational Inference for Topic Models
    Ghorbani, Behrooz
    Javadi, Hamid
    Montanari, Andrea
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [4] Stochastic Bounds for Inference in Topic Models
    Xuan Bui
    Tu Vu
    Khoat Than
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 582 - 592
  • [5] Provable Variational Inference for Constrained Log-Submodular Models
    Djolonga, Josip
    Jegelka, Stefanie
    Krause, Andreas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Concurrent Inference of Topic Models and Distributed Vector Representations
    Shamanta, Debakar
    Naim, Sheikh Motahar
    Saraf, Parang
    Ramakrishnan, Naren
    Hossain, M. Shahriar
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT II, 2015, 9285 : 441 - 457
  • [7] Empirical study on variational inference methods for topic models
    Chi, Jinjin
    Ouyang, Jihong
    Li, Ximing
    Li, Changchun
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2018, 30 (01) : 129 - 142
  • [8] Stochastic Variational Inference for Dynamic Correlated Topic Models
    Tomasi, Federico
    Ravichandran, Praveen
    Levy-Fix, Gal
    Lalmas, Mounia
    Dai, Zhenwen
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 859 - 868
  • [9] Scalable Inference in Max-margin Topic Models
    Zhu, Jun
    Zheng, Xun
    Zhou, Li
    Zhang, Bo
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 964 - 972
  • [10] Review of Trends in Topic Modeling Techniques, Tools, Inference Algorithms and Applications
    Mulunda, Christine K.
    Wagacha, Peter W.
    Muchemi, Lawrence
    2018 5TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2018, : 28 - 37