Automatic Concept Discovery from Parallel Text and Visual Corpora

被引:73
|
作者
Sun, Chen [1 ]
Gan, Chuang [2 ]
Nevatia, Ram [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
[2] Tsinghua Univ, Beijing, Peoples R China
关键词
D O I
10.1109/ICCV.2015.298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans connect language and vision to perceive the world. How to build a similar connection for computers? One possible way is via visual concepts, which are text terms that relate to visually discriminative entities. We propose an automatic visual concept discovery algorithm using parallel text and visual corpora; it filters text terms based on the visual discriminative power of the associated images, and groups them into concepts using visual and semantic similarities. We illustrate the applications of the discovered concepts using bidirectional image and sentence retrieval task and image tagging task, and show that the discovered concepts not only outperform several large sets of manually selected concepts significantly, but also achieves the state-of-the-art performance in the retrieval task.
引用
收藏
页码:2596 / 2604
页数:9
相关论文
共 50 条
  • [21] Automatic Computation of Poetic Creativity in Parallel Corpora
    Zuniga, Daniel F.
    Amido, Teresa
    Camargo, Jorge E.
    ADVANCES IN COMPUTING, CCC 2017, 2017, 735 : 710 - 720
  • [22] Building parallel corpora by automatic title alignment
    Yang, CC
    Li, KW
    DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS, 2002, 2555 : 328 - 339
  • [23] AUTOMATIC CONCEPT CLASSIFICATION OF TEXT FROM ELECTRONIC MEETINGS
    CHEN, H
    HSU, P
    ORWIG, R
    HOOPES, L
    NUNAMAKER, JF
    COMMUNICATIONS OF THE ACM, 1994, 37 (10) : 56 - 73
  • [24] Automatic extraction of the fine category of person named entities from text corpora
    Nguyen, Tri-Thanh
    Shimazu, Akira
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (10) : 1542 - 1549
  • [25] Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora
    Roller, Stephen
    Kiela, Douwe
    Nickel, Maximilian
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 358 - 363
  • [26] Automatic annotation of corpora for text summarisation: A comparative study
    Orasan, C
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 670 - 681
  • [27] Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
    Maeda, Kazuaki
    Ma, Xiaoyi
    Strassel, Stephanie
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3066 - 3069
  • [28] Automatic building of an ontology on the basis of text corpora in Thai
    Aurawan Imsombut
    Asanee Kawtrakul
    Language Resources and Evaluation, 2008, 42 : 137 - 149
  • [29] Automatic Entity Recognition and Typing in Massive Text Corpora
    Ren, Xiang
    El-Kishky, Ahmed
    Wang, Chi
    Han, Jiawei
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 1025 - 1028
  • [30] Automatic building of an ontology on the basis of text corpora in Thai
    Imsombut, Aurawan
    Kawtrakul, Asanee
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 137 - 149