Mixed-membership models of scientific publications

被引:195
|
作者
Erosheva, E [1 ]
Fienberg, S
Lafferty, J
机构
[1] Univ Washington, Dept Stat, Sch Social Work, Seattle, WA 98195 USA
[2] Univ Washington, Ctr Stat & Social Sci, Seattle, WA 98195 USA
[3] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
[4] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[5] Carnegie Mellon Univ, Ctr Automated Learning & Discovery, Pittsburgh, PA 15213 USA
关键词
D O I
10.1073/pnas.0307760101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
PNAS is one of world's most cited multidisciplinary scientific journals. The PNAS official classification structure of subjects is reflected in topic labels submitted by the authors of articles, largely related to traditionally established disciplines. These include broad field classifications into physical sciences, biological sciences, social sciences, and further subtopic classifications within the fields. Focusing on biological sciences, we explore an internal soft-classification structure of articles based only on semantic decompositions of abstracts and bibliographies and compare it with the formal discipline classifications. Our model assumes that there is a fixed number of internal categories, each characterized by multinomial distributions over words (in abstracts) and references (in bibliographies). Soft classification for each article is based on proportions of the article's content coming from each category. We discuss the appropriateness of the model for the PNAS database as well as other features of the data relevant to soft classification.
引用
收藏
页码:5220 / 5227
页数:8
相关论文
共 50 条
  • [21] Mining Overlapping Communities and Inner Role Assignments through Bayesian Mixed-Membership Models of Networks with Context-Dependent Interactions
    Costa, Gianni
    Ortale, Riccardo
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (02)
  • [22] Beta-Negative Binomial Process and Exchangeable Random Partitions for Mixed-Membership Modeling
    Zhou, Mingyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [23] Monitoring networks with overlapping communities based on latent mixed-membership stochastic block model
    He, Qing
    Wang, Junjie
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [24] Markov Mixed Membership Models
    Zhang, Aonan
    Paisley, John
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 475 - 483
  • [25] Ordinal Mixed Membership Models
    Virtanen, Seppo
    Girolami, Mark
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 588 - 596
  • [26] Functional Mixed Membership Models
    Marco, Nicholas
    Senturk, Damla
    Jeste, Shafali
    DiStefano, Charlotte
    Dickinson, Abigail
    Telesca, Donatello
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024,
  • [27] Variational Bayesian inference for bipartite mixed-membership stochastic block model with applications to collaborative filtering
    Liu, Jie
    Ye, Zifeng
    Chen, Kun
    Zhang, Panpan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 189
  • [28] SEMANTIC ANNOTATION OF SATELLITE IMAGES USING DISCRETE INFINITE LOGISTIC NORMAL DISTRIBUTION BASED MIXED-MEMBERSHIP MODEL
    Luo, Wang
    Zhang, Tian-Bing
    Hong, Gong-Vi
    Sun, Jing
    2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (LCWAMTIP), 2012, : 149 - 152
  • [29] Partitioned Tensor Factorizations for Learning Mixed Membership Models
    Tan, Zilong
    Mukherjee, Sayan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [30] Bayesian mixed membership models for soft clustering and classification
    Erosheva, EA
    Fienberg, SE
    CLASSIFICATION - THE UBIQUITOUS CHALLENGE, 2005, : 11 - 26