An Empirical Study on Crosslingual Transfer in Probabilistic Topic Models

被引:0
|
作者
Hao, Shudong [1 ]
Paul, Michael J. [2 ]
机构
[1] Bard Coll Simons Rock, Div Sci Math & Comp, Great Barrington, MA 01230 USA
[2] Univ Colorado, Dept Informat Sci, Boulder, CO 80309 USA
关键词
Computational linguistics;
D O I
10.1162/coli_a_00369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probabilistic topic modeling is a common first step in crosslingual tasks to enable knowledge transfer and extract multilingual features. Although many multilingual topic models have been developed, their assumptions about the training corpus are quite varied, and it is not clear how well the different models can be utilized under various training conditions. In this article, the knowledge transfer mechanisms behind different multilingual topic models are systematically studied, and through a broad set of experiments with four models on ten languages, we provide empirical insights that can inform the selection and future development of multilingual topic models.
引用
收藏
页码:95 / 134
页数:40
相关论文
共 50 条
  • [1] Analyzing Bayesian Crosslingual Transfer in Topic Models
    Hao, Shudong
    Paul, Michael J.
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1551 - 1565
  • [2] Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA
    Yue Lu
    Qiaozhu Mei
    ChengXiang Zhai
    [J]. Information Retrieval, 2011, 14 : 178 - 203
  • [3] Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA
    Lu, Yue
    Mei, Qiaozhu
    Zhai, ChengXiang
    [J]. INFORMATION RETRIEVAL, 2011, 14 (02): : 178 - 203
  • [4] Crosslingual Topic Modeling with WikiPDA
    Piccardi, Tiziano
    West, Robert
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3032 - 3041
  • [5] Probabilistic Topic Models
    Blei, David
    Carin, Lawrence
    Dunson, David
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 55 - 65
  • [6] Probabilistic Topic Models
    Blei, David M.
    [J]. COMMUNICATIONS OF THE ACM, 2012, 55 (04) : 77 - 84
  • [7] Benchmarking Neural Topic Models: An Empirical Study
    Thanh-Nam Doan
    Tuan-Anh Hoang
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4363 - 4368
  • [8] Empirical study on variational inference methods for topic models
    Chi, Jinjin
    Ouyang, Jihong
    Li, Ximing
    Li, Changchun
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2018, 30 (01) : 129 - 142
  • [9] Probabilistic models for topic detection and tracking
    Walls, F
    Jin, H
    Sista, S
    Schwartz, R
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 521 - 524
  • [10] Probabilistic models for topic detection and tracking
    GTE/BBN Technologies, Cambridge, MA, United States
    [J]. ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (521-524):