Topic Modeling of Multimodal Data: an Autoregressive Approach

被引:48
|
作者
Zheng, Yin [1 ]
Zhang, Yu-Jin [1 ]
Larochelle, Hugo [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[2] Univ Sherbrooke, Dept Informat, Sherbrooke, PQ J1K 2R1, Canada
关键词
D O I
10.1109/CVPR.2014.178
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for text document modeling. In this work, we show how to successfully apply and extend this model to multimodal data, such as simultaneous image classification and annotation. Specifically, we propose SupDocNADE, a supervised extension of DocNADE, that increases the discriminative power of the hidden topic features by incorporating label information into the training objective of the model and show how to employ SupDocNADE to learn a joint representation from image visual words, annotation words and class label information. We also describe how to leverage information about the spatial position of the visual words for SupDocNADE to achieve better performance in a simple, yet effective manner. We test our model on the LabelMe and UIUC-Sports datasets and show that it compares favorably to other topic models such as the supervised variant of LDA and a Spatial Matching Pyramid (SPM) approach.
引用
收藏
页码:1370 / 1377
页数:8
相关论文
共 50 条
  • [1] A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data
    Zheng, Yin
    Zhang, Yu-Jin
    Larochelle, Hugo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (06) : 1056 - 1069
  • [2] Artificial intelligence and multimodal data fusion for smart healthcare: topic modeling and bibliometrics
    Xieling Chen
    Haoran Xie
    Xiaohui Tao
    Fu Lee Wang
    Mingming Leng
    Baiying Lei
    [J]. Artificial Intelligence Review, 57
  • [3] Artificial intelligence and multimodal data fusion for smart healthcare: topic modeling and bibliometrics
    Chen, Xieling
    Xie, Haoran
    Tao, Xiaohui
    Wang, Fu Lee
    Leng, Mingming
    Lei, Baiying
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)
  • [4] Mining Massive Amounts of Genomic Data: A Semiparametric Topic Modeling Approach
    Fang, Ethan X.
    Li, Min-Dian
    Jordan, Michael I.
    Liu, Han
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (519) : 921 - 932
  • [5] A Topic Modeling Approach to Ranking
    Ding, Weicong
    Ishwar, Prakash
    Saligrama, Venkatesh
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 214 - 222
  • [6] A Novel Approach for Classifying Gene Expression Data using Topic Modeling
    Kho, Soon Jye
    Yalamanchili, Hima Bindu
    Raymer, Michael L.
    Sheth, Amit P.
    [J]. ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 388 - 393
  • [7] Autoregressive modeling approach of vibration data for bearing fault diagnosis in electric motors
    Ayaz, Emine
    [J]. JOURNAL OF VIBROENGINEERING, 2014, 16 (05) : 2130 - 2138
  • [8] AN AUTOREGRESSIVE APPROACH TO HOUSE PRICE MODELING
    Nagaraja, Chaitra H.
    Brown, Lawrence D.
    Zhao, Linda H.
    [J]. ANNALS OF APPLIED STATISTICS, 2011, 5 (01): : 124 - 149
  • [9] A systems approach for analysis of high content screening assay data with topic modeling
    Bisgin, Halil
    Chen, Minjun
    Wang, Yuping
    Kelly, Reagan
    Fang, Hong
    Xu, Xiaowei
    Tong, Weida
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [10] Clustering of Business Organisations based on Textual Data - An LDA Topic Modeling Approach
    Tolner, Ferenc
    Takacs, Marta
    Eigner, Gyorgy
    Barta, Balazs
    [J]. 21ST IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2021, : 79 - 84