Learning topic-based mixture models for factored classification

被引:0
|
作者
Chen, Qiong [1 ]
Mitchell, Tom M. [2 ]
机构
[1] Southyy China Univ Technol, Sch Engn & Comp Sci, Guangzhou 510641, Peoples R China
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
基金
美国安德鲁·梅隆基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present a learning algorithm for factored classification, employing a topic-based mixture model. In factored classification, the class label is factored into a vector of class features. For example, the class label for a personal web page at a university might be described by two features: the academic discipline of the person, and their position (e.g., 'chemistry professor' or physics student). We present an approach to factored classification of text documents in which each document is assumed to be generated by a mixture of class features. This formulation allows building on recent work on topic based mixture,models for unsupervised text analysis.. We present an algorithm for supervised learning of mixture models for factored classification.. Experiments in two factored text classification, problems (classifying web pages. and classifying the intent, of email senders) demonstrate our approach, and show it can outperform earlier approaches. for categories with especially sparse training data.
引用
收藏
页码:25 / +
页数:2
相关论文
共 50 条
  • [41] Topic-based Targeted Influence Maximization
    Srinivasan, Balaji V.
    Anandhavelu, N.
    Dalal, Aseem
    Yenugula, Madhavi
    Srikanthan, Prashanth
    Layek, Arijit
    [J]. 2014 SIXTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2014,
  • [42] Topic-based hierarchical Bayesian linear regression models for niche items recommendation
    Liu, Yezheng
    Xiong, Qiang
    Sun, Jianshan
    Jiang, Yuanchun
    Silva, Thushari
    Ling, Haifeng
    [J]. JOURNAL OF INFORMATION SCIENCE, 2019, 45 (01) : 92 - 104
  • [43] A topic-based document correlation model
    Jia, Xi-Ping
    Peng, Hong
    Zheng, Qj-Lun
    Jiang, Zhuo-Lin
    Li, Zhao
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2487 - 2491
  • [44] Topic-Based Image Caption Generation
    Sandeep Kumar Dash
    Shantanu Acharya
    Partha Pakray
    Ranjita Das
    Alexander Gelbukh
    [J]. Arabian Journal for Science and Engineering, 2020, 45 : 3025 - 3034
  • [45] A Topic-based Reviewer Assignment System
    Kou, Ngai Meng
    Hou, Leong U.
    Mamoulis, Nikos
    Li, Yuhong
    Li, Ye
    Gong, Zhiguo
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 1852 - 1855
  • [46] Topic-based web site summarization
    Zhang, Yongzheng
    Milios, Evangelos
    Zincir-Heywood, Nur
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2010, 6 (04) : 266 - +
  • [47] Exploring Topic-Based Sharing Mechanisms
    Sleeper, Manya
    Cranor, Lorrie Faith
    Pearman, Sarah K.
    [J]. PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17), 2017, : 6973 - 6985
  • [48] Topic-Based Image Caption Generation
    Dash, Sandeep Kumar
    Acharya, Shantanu
    Pakray, Partha
    Das, Ranjita
    Gelbukh, Alexander
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (04) : 3025 - 3034
  • [49] Collaborative topic regression for predicting topic-based social influence
    Hamzehei, Asso
    Wong, Raymond K.
    Koutra, Danai
    Chen, Fang
    [J]. MACHINE LEARNING, 2019, 108 (10) : 1831 - 1850
  • [50] Multi-document Summarization using Probabilistic Topic-based Network Models
    Yang, Cheng-Zen
    Fan, Jhih-Shang
    Liu, Yu-Fan
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (06) : 1613 - 1634