Probabilistic topic models for sequence data

被引:21
|
作者
Barbieri, Nicola [1 ]
Manco, Giuseppe [2 ]
Ritacco, Ettore [2 ]
Carnuccio, Marco [3 ]
Bevacqua, Antonio [3 ]
机构
[1] Yahoo Res, Barcelona, Spain
[2] Italian Natl Res Council, Inst High Performance Comp & Networks ICAR, I-87036 Arcavacata Di Rende, CS, Italy
[3] Univ Calabria, Dept Elect Informat & Syst, I-87036 Arcavacata Di Rende, CS, Italy
关键词
Recommender systems; Collaborative filtering; Probabilistic topic models; Performance;
D O I
10.1007/s10994-013-5391-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probabilistic topic models are widely used in different contexts to uncover the hidden structure in large text corpora. One of the main (and perhaps strong) assumption of these models is that generative process follows a bag-of-words assumption, i.e. each token is independent from the previous one. We extend the popular Latent Dirichlet Allocation model by exploiting three different conditional Markovian assumptions: (i) the token generation depends on the current topic and on the previous token; (ii) the topic associated with each observation depends on topic associated with the previous one; (iii) the token generation depends on the current and previous topic. For each of these modeling assumptions we present a Gibbs Sampling procedure for parameter estimation. Experimental evaluation over real-word data shows the performance advantages, in terms of recall and precision, of the sequence-modeling approaches.
引用
下载
收藏
页码:5 / 29
页数:25
相关论文
共 50 条
  • [21] Probabilistic Learning Models for Topic Extraction in Thai Language
    Asawaroengchai, Chulayuth
    Chaisangmongkon, Warasinee
    Laowattana, Djitt
    PROCEEDINGS OF 2018 5TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR): SMART TECHNOLOGY FOR NEXT GENERATION OF INFORMATION, ENGINEERING, BUSINESS AND SOCIAL SCIENCE, 2018, : 35 - 40
  • [22] Incorporating Local Word Relationships into Probabilistic Topic Models
    Rahimi, Marziea
    Zahedi, Morteza
    Mashayekhi, Hoda
    2015 7th Conference on Information and Knowledge Technology (IKT), 2015,
  • [23] Using Probabilistic Topic Models in Enterprise Social Software
    Christidis, Konstantinos
    Mentzas, Gregoris
    BUSINESS INFORMATION SYSTEMS, PROCEEDINGS, 2010, 47 : 23 - 34
  • [24] Inferring functional modules of protein families with probabilistic topic models
    Sebastian GA Konietzny
    Laura Dietz
    Alice C McHardy
    BMC Bioinformatics, 12
  • [25] Knowledge discovery through directed probabilistic topic models: a survey
    Daud, Ali
    Li, Juanzi
    Zhou, Lizhu
    Muhammad, Faqir
    FRONTIERS OF COMPUTER SCIENCE IN CHINA, 2010, 4 (02): : 280 - 301
  • [26] Inferring functional modules of protein families with probabilistic topic models
    Konietzny, Sebastian G. A.
    Dietz, Laura
    McHardy, Alice C.
    BMC BIOINFORMATICS, 2011, 12
  • [27] Keep It Simple with Time: A Reexamination of Probabilistic Topic Detection Models
    He, Qi
    Chang, Kuiyu
    Lim, Ee-Peng
    Banerjee, Arindam
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (10) : 1795 - 1808
  • [28] Knowledge discovery through directed probabilistic topic models: a survey
    Ali Daud
    Juanzi Li
    Lizhu Zhou
    Faqir Muhammad
    Frontiers of Computer Science in China, 2010, 4 : 280 - 301
  • [29] Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data
    Sutton, Charles
    McCallum, Andrew
    Rohanimanesh, Khashayar
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 693 - 723
  • [30] The generative capacity of probabilistic protein sequence models
    McGee, Francisco
    Hauri, Sandro
    Novinger, Quentin
    Vucetic, Slobodan
    Levy, Ronald M.
    Carnevale, Vincenzo
    Haldane, Allan
    NATURE COMMUNICATIONS, 2021, 12 (01)