Pattern-based Topic Models for Information Filtering

被引:8
|
作者
Gao, Yang [1 ]
Xu, Yue [1 ]
Li, Yuefeng [1 ]
机构
[1] QUT, Fac Sci & Engn, Brisbane, Qld, Australia
关键词
Topic models; user modelling; pattern mining; closed pattern; information filtering;
D O I
10.1109/ICDMW.2013.30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topic modelling, such as Latent Dirichlet Allocation (LDA), was proposed to generate statistical models to represent multiple topics in a collection of documents, which has been widely utilized in the fields of machine learning and information retrieval, etc. But its effectiveness in information filtering is rarely known. Patterns are always thought to be more representative than single terms for representing documents. In this paper, a novel information filtering model, Pattern-based Topic Model (PBTM), is proposed to represent the text documents not only using the topic distributions at general level but also using semantic pattern representations at detailed specific level, both of which contribute to the accurate document representation and document relevance ranking. Extensive experiments are conducted to evaluate the effectiveness of PBTM by using the TREC data collection Reuters Corpus Volume 1. The results show that the proposed model achieves outstanding performance.
引用
收藏
页码:921 / 928
页数:8
相关论文
共 50 条
  • [1] Pattern-based Topics for Document Modelling in Information Filtering
    Gao, Yang
    Xu, Yue
    Li, Yuefeng
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1629 - 1642
  • [2] Local pattern-based interval models
    Cholewa, W
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 948 - 953
  • [3] Pattern-Based Debugging of Declarative Models
    Montaghami, Vajih
    Rayside, Derek
    [J]. 2015 ACM/IEEE 18TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS), 2015, : 322 - 327
  • [4] A Framework for Pattern-Based Global Models
    Giacometti, Arnaud
    Miyaneh, Eynollah Khanjari
    Marcel, Patrick
    Soulet, Arnaud
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, PROCEEDINGS, 2009, 5788 : 433 - 440
  • [5] A pattern-based topic detection and analysis system on Chinese tweets
    Zhang, Lu
    Wu, Zhiang
    Bu, Zhan
    Jiang, Ye
    Cao, Jie
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 28 : 369 - 381
  • [6] A fuzzy pattern-based filtering algorithm for botnet detection
    Wang, Kuochen
    Huang, Chun-Ying
    Lin, Shang-Jyh
    Lin, Ying-Dar
    [J]. COMPUTER NETWORKS, 2011, 55 (15) : 3275 - 3286
  • [7] Pattern-based information integration in dynamic environments
    Göres, J
    [J]. 9th International Database Engineering & Application Symposium, Proceedings, 2005, : 125 - 134
  • [8] Time and category information in pattern-based codes
    Eyherabide, Hugo Gabriel
    Samengo, Inés
    [J]. Frontiers in Computational Neuroscience, 2010, 4 (NOV):
  • [9] Time and category information in pattern-based codes
    Gabriel Eyherabide, Hugo
    Samengo, Ines
    [J]. FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2010, 4
  • [10] A rigorous foundation for pattern-based design models
    Kim, SK
    Carrington, D
    [J]. ZB 2005: FORMAL SPECIFICATION AND DEVELOPMENT IN Z AND B, PROCEEDINGS, 2005, 3455 : 242 - 261