Automatic Subtitling for Live 3D TV Transmissions by Real-Time Analysis of Spatio-Temporal Depth Map of the Scene

被引:0
|
作者
Bojar, Konrad [1 ]
机构
[1] Ind Res Inst Automat & Measurements PIAP, Al Jerozolimskie 202, PL-02486 Warsaw, Poland
关键词
D O I
10.1007/978-3-319-48923-0_21
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to maximize experience and perception of the 3D TV transmission, there is a rule of thumb for the 3D video content subtitling which states that the subtitle should appear in front of the in-focus content at all times of the subtitle exposure. The main problem with live 3D transmissions containing subtitles, such as TV news or football matches, is that besides a pure text and a pair of video streams acquired by a stereo rig, there must be some additional information calculated which would allow to settle the correct subtitle depth. Therefore, either all set-top-boxes must determine this depth by themselves or the broadcaster must calculate and provide this information in the Disparity Signalling Segment (DSS). In this paper we present an algorithm for automatic subtitle depth estimation based on unsupervised spatiotemporal analysis of stereoscopic pair of compressed video streams. The proposed algorithm first analyzes the texture in the streams for left and right eye in the area where the subtitle should appear. The result of this analysis is a set of correspondences, that is pairs of points corresponding to the same single point in the scene. Every correspondence yields a stereoscopic parallax vector, and the magnitude of this vector is inversely proportional to the depth of point in the scene. It is shown how to effectively calculate the depth of the subtitle from depth maps for every stereoscopic pair of frames in which this subtitle should to appear. Also, latency problems and hardware aspects of low-cost FPGA implementation of the algorithm are discussed.
引用
收藏
页码:163 / 171
页数:9
相关论文
共 50 条
  • [1] Real-Time Spatio-Temporal Analysis of Dynamic Scenes in 3D Soccer Simulation
    Warden, Tobias
    Lattner, Andreas D.
    Visser, Ubbo
    [J]. ROBOCUP 2008: ROBOT SOCCER WORLD CUP XII, 2009, 5399 : 366 - +
  • [2] Real-time spatio-temporal analysis of dynamic scenes
    Tobias Warden
    Ubbo Visser
    [J]. Knowledge and Information Systems, 2012, 32 : 243 - 279
  • [3] Real-time 3D semantic map building in indoor scene
    Shan, Jichao
    Li, Xiuzhi
    Zhang, Xiangyin
    Jia, Songmin
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (05): : 240 - 248
  • [4] 3D scene analysis by real-time stereovision
    Garibotto, G
    Cibei, C
    [J]. 2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 1737 - 1740
  • [5] Real-time spatio-temporal analysis of dynamic scenes
    Warden, Tobias
    Visser, Ubbo
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (02) : 243 - 279
  • [6] Real-time scene background initialization based on spatio-temporal neighborhood exploration
    Wided Souidene Mseddi
    Marwa Jmal
    Rabah Attia
    [J]. Multimedia Tools and Applications, 2019, 78 : 7289 - 7319
  • [7] Real-time scene background initialization based on spatio-temporal neighborhood exploration
    Mseddi, Wided Souidene
    Jmal, Marwa
    Attia, Rabah
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (06) : 7289 - 7319
  • [8] Automatic, Real Time, Unsupervised Spatio-temporal 3D Object Detection Using RGB-D Cameras
    Alassaf, Manal H.
    Kowsari, Kamran
    Hahn, James K.
    [J]. 2015 19TH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION IV 2015, 2015, : 444 - 449
  • [9] REAL-TIME DEPTH MAP GENERATION ARCHITECTURE FOR 3D VIDEOCONFERENCING
    Congote, John
    Barandiaran, Inigo
    Barandiaran, Javier
    Montserrat, Tomas
    Quelen, Julien
    Ferran, Christian
    Mindan, Pere J.
    Mur, Olga
    Tarres, Francesc
    Ruiz, Oscar
    [J]. 2010 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO (3DTV-CON 2010), 2010,
  • [10] A depth map representation for real-time transmission and view-based rendering of a dynamic 3D scene
    Chai, BB
    Sethuraman, S
    Sawhney, HS
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON 3D DATA PROCESSING VISUALIZATION AND TRANSMISSION, 2002, : 107 - 114