Mixed-membership models of scientific publications

被引:195
|
作者
Erosheva, E [1 ]
Fienberg, S
Lafferty, J
机构
[1] Univ Washington, Dept Stat, Sch Social Work, Seattle, WA 98195 USA
[2] Univ Washington, Ctr Stat & Social Sci, Seattle, WA 98195 USA
[3] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
[4] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[5] Carnegie Mellon Univ, Ctr Automated Learning & Discovery, Pittsburgh, PA 15213 USA
关键词
D O I
10.1073/pnas.0307760101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
PNAS is one of world's most cited multidisciplinary scientific journals. The PNAS official classification structure of subjects is reflected in topic labels submitted by the authors of articles, largely related to traditionally established disciplines. These include broad field classifications into physical sciences, biological sciences, social sciences, and further subtopic classifications within the fields. Focusing on biological sciences, we explore an internal soft-classification structure of articles based only on semantic decompositions of abstracts and bibliographies and compare it with the formal discipline classifications. Our model assumes that there is a fixed number of internal categories, each characterized by multinomial distributions over words (in abstracts) and references (in bibliographies). Soft classification for each article is based on proportions of the article's content coming from each category. We discuss the appropriateness of the model for the PNAS database as well as other features of the data relevant to soft classification.
引用
下载
收藏
页码:5220 / 5227
页数:8
相关论文
共 50 条
  • [1] Discriminative Mixed-membership Models
    Shan, Hanhuai
    Banerjee, Arindam
    Oza, Nikunj C.
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 466 - +
  • [2] Mixed-membership naive Bayes models
    Shan, Hanhuai
    Banerjee, Arindam
    DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 23 (01) : 1 - 62
  • [3] Mixed-membership naive Bayes models
    Hanhuai Shan
    Arindam Banerjee
    Data Mining and Knowledge Discovery, 2011, 23 : 1 - 62
  • [4] Mixed-Membership Stochastic Block Models for Weighted Networks
    Dulac, A.
    Gaussier, W.
    Largeron, C.
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 679 - 688
  • [5] Typology of Mixed-Membership Models: Towards a Design Method
    Heinrich, Gregor
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2011, 6912 : 32 - 47
  • [6] The Dataminer's Guide to Scalable Mixed-Membership and Nonparametric Bayesian Models
    Ahmed, Amr
    Smola, Alex
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 1528 - 1528
  • [7] Mixed-membership of experts stochastic blockmodel
    White, Arthur
    Murphy, Thomas Brendan
    NETWORK SCIENCE, 2016, 4 (01) : 48 - 80
  • [8] The Dataminer's Guide to Scalable Mixed-Membership and Nonparametric Bayesian Models
    Ahmed, Amr
    Smola, Alex
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 1527 - 1527
  • [9] Accurate and scalable social recommendation using mixed-membership stochastic block models
    Godoy-Lorite, Antonia
    Guimera, Roger
    Moore, Cristopher
    Sales-Pardo, Marta
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (50) : 14207 - 14212
  • [10] Dynamic Infinite Mixed-Membership Stochastic Blockmodel
    Fan, Xuhui
    Cao, Longbing
    Xu, Richard Yi Da
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (09) : 2072 - 2085