Modeling Flickr Communities Through Probabilistic Topic-Based Analysis

被引:21
|
作者
Negoescu, Radu-Andrei [1 ,2 ]
Gatica-Perez, Daniel
机构
[1] Idiap Res Inst, Lausanne, Switzerland
[2] Ecole Polytech Fed Lausanne, Swiss Fed Inst Technol, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
Flickr; probabilistic topic models; social media; LATENT SEMANTIC ANALYSIS;
D O I
10.1109/TMM.2010.2050649
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increased presence of digital imaging devices, there also came an explosion in the amount of multimedia content available online. Users have transformed from passive consumers of media into content creators and have started organizing themselves in and around online communities. Flickr has more than 30 million users and over 3 billion photos, and many of them are tagged and public. One very important aspect in Flickr is the ability of users to organize in self-managed communities called groups. This paper examines an unexplored problem, which is jointly analyzing Flickr groups and users. We show that although users and groups are conceptually different, in practice they can be represented in a similar way via a bag-of-tags derived from their photos, which is amenable for probabilistic topic modeling. We then propose a probabilistic topic model representation learned in an unsupervised manner that allows the discovery of similar users and groups beyond direct tag-based strategies, and we demonstrate that higher-level information such as topics of interest are a viable alternative. On a dataset containing users of 10 000 Flickr groups and over 1 milion photos, we show how this common topic-based representation allows for a novel analysis of the groups-users Flickr ecosystem, which results into new insights about the structure of the entities in this social media source. We demonstrate novel practical applications of our topic-based representation, such as similarity-based exploration of entities, or single and multi-topic tag-based search, which address current limitations in the ways Flickr is used today.
引用
收藏
页码:399 / 416
页数:18
相关论文
共 50 条
  • [41] Detecting topic-based communities in social networks: A study in a real software development network
    Horta, Vitor A. C.
    Stroele, Victor
    Oliveira, Jonice
    Braga, Regina
    David, Jose Maria N.
    Campos, Fernanda
    [J]. JOURNAL OF WEB SEMANTICS, 2022, 74
  • [42] Online Topic-based Social Influence Analysis for the Wimbledon Championships
    Embar, Varun R.
    Bhattacharya, Indrajit
    Pandit, Vinayaka
    Vaculin, Roman
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1759 - 1768
  • [43] Topic-based Coordination for Visual Analysis of Evolving Document Collections
    Eler, Danilo Medeiros
    Paulovich, Fernando Vieira
    Ferreira de Oliveira, Maria Cristina
    Minghim, Rosane
    [J]. INFORMATION VISUALIZATION, IV 2009, PROCEEDINGS, 2009, : 149 - 155
  • [44] Topic-based Targeted Influence Maximization
    Srinivasan, Balaji V.
    Anandhavelu, N.
    Dalal, Aseem
    Yenugula, Madhavi
    Srikanthan, Prashanth
    Layek, Arijit
    [J]. 2014 SIXTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2014,
  • [45] Topic-based software defect explanation
    Chen, Tse-Hsun
    Shang, Weiyi
    Nagappan, Meiyappan
    Hassan, Ahmed E.
    Thomas, Stephen W.
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 129 : 79 - 106
  • [46] Personalized topic-based tag recommendation
    Krestel, Ralf
    Fankhauser, Peter
    [J]. NEUROCOMPUTING, 2012, 76 (01) : 61 - 70
  • [47] A topic-based document correlation model
    Jia, Xi-Ping
    Peng, Hong
    Zheng, Qj-Lun
    Jiang, Zhuo-Lin
    Li, Zhao
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2487 - 2491
  • [48] Topic-Based Image Caption Generation
    Sandeep Kumar Dash
    Shantanu Acharya
    Partha Pakray
    Ranjita Das
    Alexander Gelbukh
    [J]. Arabian Journal for Science and Engineering, 2020, 45 : 3025 - 3034
  • [49] Characterization of topic-based online communities by combining network data and user generated content
    Igarashi, Mirai
    Terui, Nobuhiko
    [J]. STATISTICS AND COMPUTING, 2020, 30 (05) : 1309 - 1324
  • [50] Characterization of topic-based online communities by combining network data and user generated content
    Mirai Igarashi
    Nobuhiko Terui
    [J]. Statistics and Computing, 2020, 30 : 1309 - 1324