Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks

被引:266
|
作者
Xie, Junyuan [1 ]
Girshick, Ross [1 ]
Farhadi, Ali [1 ,2 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Allen Inst Artificial Intelligence, Seattle, WA USA
来源
关键词
Monocular stereo reconstruction; Deep convolutional neural networks; DEPTH; IMAGE;
D O I
10.1007/978-3-319-46493-0_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As 3D movie viewing becomes mainstream and the Virtual Reality (VR) market emerges, the demand for 3D contents is growing rapidly. Producing 3D videos, however, remains challenging. In this paper we propose to use deep neural networks to automatically convert 2D videos and images to a stereoscopic 3D format. In contrast to previous automatic 2D-to-3D conversion algorithms, which have separate stages and need ground truth depth map as supervision, our approach is trained end-to-end directly on stereo pairs extracted from existing 3D movies. This novel training scheme makes it possible to exploit orders of magnitude more data and significantly increases performance. Indeed, Deep3D outperforms baselines in both quantitative and human subject evaluations.
引用
收藏
页码:842 / 857
页数:16
相关论文
共 50 条
  • [1] Automatic 2D-to-3D video conversion using 3D densely connected convolutional networks
    Chen, Bei
    Yuan, Jiabin
    Bao, Xiuping
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 361 - 367
  • [2] AUTOMATIC 2D-TO-3D CONVERSION USING MULTI-SCALE DEEP NEURAL NETWORK
    Lee, Jiyoung
    Jung, Hyungjoo
    Kim, Youngjung
    Sohn, Kwanghoon
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 730 - 734
  • [3] A NOVEL METHOD FOR AUTOMATIC 2D-TO-3D VIDEO CONVERSION
    Yan, Youwei
    Xu, Feng
    Dai, Qionghai
    Liu, Xiaodong
    [J]. 2010 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON 2010), 2010,
  • [4] 3D-TV Content Creation: Automatic 2D-to-3D Video Conversion
    Zhang, Liang
    Vazquez, Carlos
    Knorr, Sebastian
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2011, 57 (02) : 372 - 383
  • [5] Learning-Based, Automatic 2D-to-3D Image and Video Conversion
    Konrad, Janusz
    Wang, Meng
    Ishwar, Prakash
    Wu, Chen
    Mukherjee, Debargha
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) : 3485 - 3496
  • [6] A Distributed 2D-to-3D Video Conversion System
    Zhang Zhebin
    Zhang Ji'an
    Zhang Xuexi
    Wang Yizhou
    Gao Wen
    [J]. CHINA COMMUNICATIONS, 2013, 10 (05) : 30 - 38
  • [7] Enhanced Motion Cues for Automatic Depth Extraction for 2D-to-3D Video Conversion
    Alves, Gustavo de O. e
    da Silva, Eduardo A. B.
    [J]. 2014 INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM (ITS), 2014,
  • [8] Foreground Depth Estimation for Semi-automatic 2D-to-3D Video Conversion
    Lie, Wen-Nung
    Chen, Yi-Kai
    Chiang, Jui-Chiu
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1077 - 1081
  • [9] Review on 2D-to-3D Image and Video Conversion Methods
    Patil, Shweta
    Charles, Priya
    [J]. 1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 728 - 732
  • [10] An Automatic Depth Map Generation for 2D-to-3D Conversion
    Dong, Hao
    Yin, Shouyi
    Xu, Weizhi
    Zhang, Zhen
    Shi, Rui
    Liu, Leibo
    Wei, Shaojun
    [J]. 18TH IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE 2014), 2014,