The Long-Short Story of Movie Description

被引:56
|
作者
Rohrbach, Anna [1 ]
Rohrbach, Marcus [2 ,3 ]
Schiele, Bernt [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
[2] UC Berkeley EECS, Berkeley, CA USA
[3] ICSI, Berkeley, CA USA
来源
关键词
D O I
10.1007/978-3-319-24947-6_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating descriptions for videos has many applications including assisting blind people and human-robot interaction. The recent advances in image captioning as well as the release of large-scale movie description datasets such as MPII-MD [28] and M-VAD [31] allow to study this task in more depth. Many of the proposed methods for image captioning rely on pre-trained object classifier CNNs and Long Short-Term Memory recurrent networks (LSTMs) for generating descriptions. While image description focuses on objects, we argue that it is important to distinguish verbs, objects, and places in the setting of movie description. In this work we show how to learn robust visual classifiers from the weak annotations of the sentence descriptions. Based on these classifiers we generate a description using an LSTM. We explore different design choices to build and train the LSTM and achieve the best performance to date on the challenging MPII-MD and M-VAD datasets. We compare and analyze our approach and prior work along various dimensions to better understand the key challenges of the movie description task.
引用
收藏
页码:209 / 221
页数:13
相关论文
共 50 条
  • [31] A Generalized Framework for Simultaneous Long-Short Feedback Trading
    O'Brien, Joseph D.
    Burke, Mark E.
    Burke, Kevin
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (06) : 2652 - 2663
  • [32] HERMITE SPECTRAL METHOD FOR LONG-SHORT WAVE EQUATIONS
    Lu, Shujuan
    Liu, Zeting
    Feng, Zhaosheng
    DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS-SERIES B, 2019, 24 (02): : 941 - 964
  • [33] Online Portfolio Selection with Long-Short Term Forecasting
    Li R.
    Liu J.
    Operations Research Forum, 3 (4)
  • [34] How Skewed are Simultaneously Long-Short Trading Gains?
    Baumann, Michael Heinrich
    2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
  • [35] Long-Short Term Policy for Visual Object Navigation
    Bai, Yubing
    Song, Xinhang
    Li, Weijie
    Zhang, Sixian
    Jiang, Shuqiang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 9035 - 9042
  • [36] An adaptive evolutionary strategy for long-short portfolio construction
    di Tollo, Giacomo
    Fattoruso, Gerarda
    Filograsso, Gianni
    DECISIONS IN ECONOMICS AND FINANCE, 2024,
  • [37] LONG-SHORT TEMPORAL MODELING FOR EFFICIENT ACTION RECOGNITION
    Wu, Liyu
    Zou, Yuexian
    Zhang, Can
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2435 - 2439
  • [38] A characterization of long-short trading strategies based on cointegration
    Kawasaki, Y
    Tachiki, S
    Udaka, H
    Hirano, T
    2003 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING, PROCEEDINGS, 2003, : 411 - 416
  • [39] Effect of bolt with long-short layout on slope stability
    Lin, Hang
    Chen, Baocheng
    Fan, Xiang
    Li, Jiangteng
    Wu, Qihong
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2015, 46 (02): : 625 - 630
  • [40] Solitons and other solutions of the long-short wave equation
    Ebadi, Ghodrat
    Mojaver, Aida
    Kumar, Sachin
    Biswas, Anjan
    INTERNATIONAL JOURNAL OF NUMERICAL METHODS FOR HEAT & FLUID FLOW, 2015, 25 (01) : 129 - 145