The Long-Short Story of Movie Description

被引:56
|
作者
Rohrbach, Anna [1 ]
Rohrbach, Marcus [2 ,3 ]
Schiele, Bernt [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
[2] UC Berkeley EECS, Berkeley, CA USA
[3] ICSI, Berkeley, CA USA
来源
关键词
D O I
10.1007/978-3-319-24947-6_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating descriptions for videos has many applications including assisting blind people and human-robot interaction. The recent advances in image captioning as well as the release of large-scale movie description datasets such as MPII-MD [28] and M-VAD [31] allow to study this task in more depth. Many of the proposed methods for image captioning rely on pre-trained object classifier CNNs and Long Short-Term Memory recurrent networks (LSTMs) for generating descriptions. While image description focuses on objects, we argue that it is important to distinguish verbs, objects, and places in the setting of movie description. In this work we show how to learn robust visual classifiers from the weak annotations of the sentence descriptions. Based on these classifiers we generate a description using an LSTM. We explore different design choices to build and train the LSTM and achieve the best performance to date on the challenging MPII-MD and M-VAD datasets. We compare and analyze our approach and prior work along various dimensions to better understand the key challenges of the movie description task.
引用
收藏
页码:209 / 221
页数:13
相关论文
共 50 条
  • [41] THE SHORT-STORY - THE LONG AND THE SHORT OF IT
    PRATT, ML
    POETICS, 1981, 10 (2-3) : 175 - 194
  • [42] THE SHORT-STORY - THE LONG AND THE SHORT OF IT
    PRATT, ML
    PAMIETNIK LITERACKI, 1989, 80 (02): : 349 - 369
  • [43] Long short story (The book business, editors, and the short story)
    Miller, L
    NEW YORK TIMES BOOK REVIEW, 2003, : 35 - 35
  • [44] EDA with Switching Distributions for Long-Short Portfolio Replication Problems
    Shibata, Shunsuke
    Orito, Yukiko
    Hanada, Yoshiko
    Yamamoto, Hisashi
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 427 - 432
  • [45] 'To Cut a Long Story Short'
    Tayler, C
    TLS-THE TIMES LITERARY SUPPLEMENT, 2000, (5061): : 23 - 23
  • [46] To make a long story short
    Warren, PH
    METEORITICS & PLANETARY SCIENCE, 2001, 36 (07): : 867 - 868
  • [47] THE LONG-SHORT DAY REQUIREMENT FOR FLOWERING IN STYLOSANTHES-GUIANENSIS
    TRONGKONGSIN, K
    HUMPHREYS, LR
    AUSTRALIAN JOURNAL OF AGRICULTURAL RESEARCH, 1988, 39 (02): : 199 - 207
  • [48] Uniform Attractor for the Fractional Nonautonomous Long-Short Wave Equations
    Ge, Huanmin
    Xin, Jie
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2014, 2014
  • [49] Settlement calculation for long-short composite piled raft foundation
    Zhao Ming-hua
    Zhang Ling
    Yang Ming-hui
    JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2006, 13 (06): : 749 - 754
  • [50] Do size and sector classification matter for long-short strategies?
    Kawasaki, Y
    Udaka, H
    Hirano, T
    COMPUTATIONAL FINANCE AND ITS APPLICATIONS, 2004, : 3 - 11