Automatic Concept Discovery from Parallel Text and Visual Corpora

被引:73
|
作者
Sun, Chen [1 ]
Gan, Chuang [2 ]
Nevatia, Ram [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
[2] Tsinghua Univ, Beijing, Peoples R China
关键词
D O I
10.1109/ICCV.2015.298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans connect language and vision to perceive the world. How to build a similar connection for computers? One possible way is via visual concepts, which are text terms that relate to visually discriminative entities. We propose an automatic visual concept discovery algorithm using parallel text and visual corpora; it filters text terms based on the visual discriminative power of the associated images, and groups them into concepts using visual and semantic similarities. We illustrate the applications of the discovered concepts using bidirectional image and sentence retrieval task and image tagging task, and show that the discovered concepts not only outperform several large sets of manually selected concepts significantly, but also achieves the state-of-the-art performance in the retrieval task.
引用
收藏
页码:2596 / 2604
页数:9
相关论文
共 50 条
  • [41] Term variation in specialised corpora: Characterisation, automatic discovery and applications
    Lopes Mesquita, Luiz Antonio
    NATURAL LANGUAGE ENGINEERING, 2018, 24 (02) : 313 - 315
  • [42] Fundamental Visual Concept Learning From Correlated Images and Text
    Du, Youtian
    Wang, Hang
    Cui, Yunbo
    Huang, Xin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3598 - 3612
  • [43] Acquiring Paraphrases from Text Corpora
    Bhagat, Rahul
    Hovy, Eduard
    Patwardhan, Siddharth
    K-CAP'09: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE, 2009, : 161 - 168
  • [44] TopicNets: Visual Analysis of Large Text Corpora with Topic Modeling
    Gretarsson, Brynjar
    O'Donovan, John
    Bostandjiev, Svetlin
    Hoellerer, Tobias
    Asuncion, Arthur
    Newman, David
    Smyth, Padhraic
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (02)
  • [45] A Pilot Study on Automatic Inference Rule Discovery from Turkish Text
    Isguder-Sahin, Gozde Gul
    Adali, Esref
    2014 IEEE 8TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2014, : 268 - 272
  • [46] Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora
    de Knijff, Jeroen
    Meijer, Kevin
    Frasincar, Flavius
    Hogenboom, Frederik
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2011, 2011, 6997 : 241 - 248
  • [47] Automatic Dictionary Expansion Using Non-parallel Corpora
    Rapp, Reinhard
    Zock, Michael
    ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, : 317 - +
  • [48] Automatic Entity Recognition and Typing from Massive Text Corpora: A Phrase and Network Mining Approach
    Ren, Xiang
    El-Kishky, Ahmed
    Wang, Chi
    Han, Jiawei
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 2319 - 2320
  • [49] Topic discovery in massive text corpora based on Min-Hashing
    Fuentes-Pineda, Gibran
    Meza-Ruiz, Ivan, V
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 136 : 62 - 72
  • [50] EUROSENSE: Automatic Harvesting of Multilingual Sense Annotations from Parallel Text
    Delli Bovi, Claudio
    Camacho-Collados, Jose
    Raganato, Alessandro
    Navigli, Roberto
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 594 - 600