Probabilistic topic models for sequence data

被引:21
|
作者
Barbieri, Nicola [1 ]
Manco, Giuseppe [2 ]
Ritacco, Ettore [2 ]
Carnuccio, Marco [3 ]
Bevacqua, Antonio [3 ]
机构
[1] Yahoo Res, Barcelona, Spain
[2] Italian Natl Res Council, Inst High Performance Comp & Networks ICAR, I-87036 Arcavacata Di Rende, CS, Italy
[3] Univ Calabria, Dept Elect Informat & Syst, I-87036 Arcavacata Di Rende, CS, Italy
关键词
Recommender systems; Collaborative filtering; Probabilistic topic models; Performance;
D O I
10.1007/s10994-013-5391-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probabilistic topic models are widely used in different contexts to uncover the hidden structure in large text corpora. One of the main (and perhaps strong) assumption of these models is that generative process follows a bag-of-words assumption, i.e. each token is independent from the previous one. We extend the popular Latent Dirichlet Allocation model by exploiting three different conditional Markovian assumptions: (i) the token generation depends on the current topic and on the previous token; (ii) the topic associated with each observation depends on topic associated with the previous one; (iii) the token generation depends on the current and previous topic. For each of these modeling assumptions we present a Gibbs Sampling procedure for parameter estimation. Experimental evaluation over real-word data shows the performance advantages, in terms of recall and precision, of the sequence-modeling approaches.
引用
下载
收藏
页码:5 / 29
页数:25
相关论文
共 50 条
  • [31] The generative capacity of probabilistic protein sequence models
    Francisco McGee
    Sandro Hauri
    Quentin Novinger
    Slobodan Vucetic
    Ronald M. Levy
    Vincenzo Carnevale
    Allan Haldane
    Nature Communications, 12
  • [32] Optimizing Probabilistic Models for Relational Sequence Learning
    Di Mauro, Nicola
    Basile, Teresa M. A.
    Ferilli, Stefano
    Esposito, Floriana
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2011, 6804 : 240 - 249
  • [33] Major Research Topics in Big Data: A Literature Analysis from 2013 to 2017 Using Probabilistic Topic Models
    Gurcan, Fatih
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [34] DeepSumm: Exploiting topic models and sequence to sequence networks for extractive text summarization
    Joshi, Akanksha
    Fidalgo, Eduardo
    Alegre, Enrique
    Fernandez-Robles, Laura
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
  • [35] Similarities Between Human Structured Subject Indexing and Probabilistic Topic Models
    Reiner, Guenter
    Adaemmer, Philipp
    KNOWLEDGE ORGANIZATION AT THE INTERFACE: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL ISKO CONFERENCE, 2020, 2020, 17 : 374 - 383
  • [36] Detecting the research structure and topic trends of social media using static and dynamic probabilistic topic models
    ul Haq, Muhammad Inaam
    Li, Qianmu
    Hou, Jun
    Iftekhar, Adnan
    ASLIB JOURNAL OF INFORMATION MANAGEMENT, 2023, 75 (02) : 215 - 245
  • [37] A new method for mining information of gut microbiome with probabilistic topic models
    Xiong, Xin
    Li, Minrui
    Ren, Yuyan
    Yao, Xusheng
    Du, Yuhui
    Huang, Qingsong
    Kong, Xiangyang
    He, Jianfeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16081 - 16104
  • [38] ONLINE TIME-DEPENDENT CLUSTERING USING PROBABILISTIC TOPIC MODELS
    Renard, Benjamin
    Kharratzadeh, Milad
    Coates, Mark
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2036 - 2040
  • [39] Mining Group Nonverbal Conversational Patterns Using Probabilistic Topic Models
    Jayagopi, Dinesh Babu
    Gatica-Perez, Daniel
    IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (08) : 790 - 802
  • [40] A new method for mining information of gut microbiome with probabilistic topic models
    Xin Xiong
    Minrui Li
    Yuyan Ren
    Xusheng Yao
    Yuhui Du
    Qingsong Huang
    Xiangyang Kong
    Jianfeng He
    Multimedia Tools and Applications, 2023, 82 : 16081 - 16104