Modeling Flickr Communities Through Probabilistic Topic-Based Analysis

被引:21
|
作者
Negoescu, Radu-Andrei [1 ,2 ]
Gatica-Perez, Daniel
机构
[1] Idiap Res Inst, Lausanne, Switzerland
[2] Ecole Polytech Fed Lausanne, Swiss Fed Inst Technol, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
Flickr; probabilistic topic models; social media; LATENT SEMANTIC ANALYSIS;
D O I
10.1109/TMM.2010.2050649
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increased presence of digital imaging devices, there also came an explosion in the amount of multimedia content available online. Users have transformed from passive consumers of media into content creators and have started organizing themselves in and around online communities. Flickr has more than 30 million users and over 3 billion photos, and many of them are tagged and public. One very important aspect in Flickr is the ability of users to organize in self-managed communities called groups. This paper examines an unexplored problem, which is jointly analyzing Flickr groups and users. We show that although users and groups are conceptually different, in practice they can be represented in a similar way via a bag-of-tags derived from their photos, which is amenable for probabilistic topic modeling. We then propose a probabilistic topic model representation learned in an unsupervised manner that allows the discovery of similar users and groups beyond direct tag-based strategies, and we demonstrate that higher-level information such as topics of interest are a viable alternative. On a dataset containing users of 10 000 Flickr groups and over 1 milion photos, we show how this common topic-based representation allows for a novel analysis of the groups-users Flickr ecosystem, which results into new insights about the structure of the entities in this social media source. We demonstrate novel practical applications of our topic-based representation, such as similarity-based exploration of entities, or single and multi-topic tag-based search, which address current limitations in the ways Flickr is used today.
引用
收藏
页码:399 / 416
页数:18
相关论文
共 50 条
  • [1] Content Patterns in Topic-Based Overlapping Communities
    Rios, Sebastian A.
    Munoz, Ricardo
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [2] Topic-based ranking in Folksonomy via probabilistic model
    Yan’an Jin
    Ruixuan Li
    Kunmei Wen
    Xiwu Gu
    Fei Xiao
    [J]. Artificial Intelligence Review, 2011, 36 : 139 - 151
  • [3] Topic-based ranking in Folksonomy via probabilistic model
    Jin, Yan'an
    Li, Ruixuan
    Wen, Kunmei
    Gu, Xiwu
    Xiao, Fei
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (02) : 139 - 151
  • [4] Topic-Based Sentiment Analysis
    Buddhitha, Prasadith
    Inkpen, Diana
    [J]. INFORMATION MANAGEMENT AND BIG DATA, 2017, 656 : 95 - 107
  • [5] Sentiment Analysis on Twitter through Topic-Based Lexicon Expansion
    Zhou, Zhixin
    Zhang, Xiuzhen
    Sanderson, Mark
    [J]. DATABASES THEORY AND APPLICATIONS, ADC 2014, 2014, 8506 : 98 - 109
  • [6] Time-Sensitive Topic-Based Communities on Twitter
    Fani, Hossein
    Zarrinkalam, Fattane
    Bagheri, Ebrahim
    Du, Weichang
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2016, 2016, 9673 : 192 - 204
  • [7] Topic-based Classification through Unigram Unmasking
    HaCohen-Kerner, Yaakov
    Rosenfeld, Avi
    Sabag, Asaf
    Tzidkani, Maor
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 69 - 76
  • [8] Topic-based Video Analysis: A Survey
    Pal, Ratnabali
    Sekh, Arif Ahmed
    Dogra, Debi Prosad
    Kar, Samarjit
    Roy, Partha Pratim
    Prasad, Dilip K.
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (06)
  • [9] Hierarchical Topic-Based Communities Construction for Authors in a Literature Database
    Wu, Chien-Liang
    Koh, Jia-Ling
    [J]. TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT II, PROCEEDINGS, 2010, 6097 : 514 - 524
  • [10] Evaluating Public Anxiety for Topic-Based Communities in Social Networks
    Ta, Na
    Li, Kaiyu
    Yang, Yi
    Jiao, Fang
    Tang, Zheng
    Li, Guoliang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1191 - 1205