Automatic Concept Discovery from Parallel Text and Visual Corpora

被引:73
|
作者
Sun, Chen [1 ]
Gan, Chuang [2 ]
Nevatia, Ram [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
[2] Tsinghua Univ, Beijing, Peoples R China
关键词
D O I
10.1109/ICCV.2015.298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans connect language and vision to perceive the world. How to build a similar connection for computers? One possible way is via visual concepts, which are text terms that relate to visually discriminative entities. We propose an automatic visual concept discovery algorithm using parallel text and visual corpora; it filters text terms based on the visual discriminative power of the associated images, and groups them into concepts using visual and semantic similarities. We illustrate the applications of the discovered concepts using bidirectional image and sentence retrieval task and image tagging task, and show that the discovered concepts not only outperform several large sets of manually selected concepts significantly, but also achieves the state-of-the-art performance in the retrieval task.
引用
收藏
页码:2596 / 2604
页数:9
相关论文
共 50 条
  • [1] Automatic Visual Theme Discovery from Joint Image and Text Corpora
    Sun, Ke
    Hou, Xianxu
    Zhang, Qian
    Qiu, Guoping
    2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 220 - 224
  • [2] Discovery of Treatments from Text Corpora
    Fong, Christian
    Grimmer, Justin
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1600 - 1609
  • [3] A parallel system architecture for biology and medical concept discovery from biological corpora
    Kapsokalibas, Leonidas
    Makris, Christos
    Perdikuri, Katerina
    Theodoridis, Evangelos
    Tsakalidis, Athanasios
    RECENT PROGRESS IN COMPUTATIONAL SCIENCES AND ENGINEERING, VOLS 7A AND 7B, 2006, 7A-B : 1064 - 1068
  • [4] Automatic creation of WordNets from parallel corpora
    Oliver, Antoni
    Climent, Salvador
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1112 - 1116
  • [5] Learning visual entities and their visual attributes from text corpora
    Boiy, Erik
    Deschacht, Koen
    Moens, Marie-Francine
    DEXA 2008: 19TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, : 48 - 53
  • [6] Discovery of event entailment knowledge from text corpora
    Pekar, Viktor
    COMPUTER SPEECH AND LANGUAGE, 2008, 22 (01): : 1 - 16
  • [7] Automatic discovery of translation collocations from bilingual corpora
    Barrachina, S
    Vilar, JM
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 571 - 575
  • [8] Automatic discovery of concepts from text
    Chin, Ong Siou
    Kulathuramaiyer, Narayanan
    Yeo, Alvin W.
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 1046 - +
  • [9] Automatic Parallel Corpora and Bilingual Terminology extraction from Parallel WebSites
    Almeida, Jose Joao
    Simoes, Alberto
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 50 - 55
  • [10] MetaPAD: Meta Pattern Discovery from Massive Text Corpora
    Jiang, Meng
    Shang, Jingbo
    Cassidy, Taylor
    Ren, Xiang
    Kaplan, Lance M.
    Hanratty, Timothy P.
    Han, Jiawei
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 877 - 886