Video Caption Extraction Using Spatio-Temporal Slices

被引:5
|
作者
Chen, Liang-Hua [1 ]
Su, Chih-Wen [2 ]
机构
[1] Fu Jen Catholic Univ, Dept Comp Sci & Informat Engn, New Taipei, Taiwan
[2] Chung Yuan Christian Univ, Dept Informat & Comp Engn, Chungli, Taiwan
关键词
Video content analysis; caption detection; spatio-temporal slice;
D O I
10.1142/S0219467818500092
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Captions in videos play an important role for video indexing and retrieval. In this paper, we propose a novel algorithm to extract multilingual captions from video. Our approach is based on the analysis of spatio-temporal slices of video. If the horizontal (or vertical) scan line contains some pixels of caption region then the corresponding spatio-temporal slice will have bar-code like patterns. By integrating the structure information of bar-code like patterns in horizontal and vertical slices, the spatial and temporal positions of video captions can be located accurately. Experimental results show that the proposed algorithm is effective and outperforms some existing techniques.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A Spatio-temporal Approach for Video Caption Extraction
    Chen, Liang-Hua
    Hsieh, Meng-Chen
    Su, Chih-Wen
    [J]. SIGMAP: PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS - VOL. 5, 2016, : 83 - 88
  • [2] Spatio-temporal relationships and video object extraction
    Deng, YN
    Manjunath, BS
    [J]. CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 895 - 899
  • [3] Video shot similarity algorithm based on spatio-temporal slices
    Cai, Bo
    Yin, Shuangtai
    Du, Hao
    Zhang, Dengyi
    Zhao, Jianhui
    [J]. 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 4, 2008, : 862 - 866
  • [4] Video caption detection and extraction using temporal information
    Luo, B
    Tang, XO
    Liu, JZ
    Zhang, HJ
    [J]. 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 297 - 300
  • [5] Video vehicle detection algorithm through spatio-temporal slices processing
    Liu Anan
    Yang Zhaoxuan
    [J]. PROCEEDINGS OF THE 2006 IEEE/ASME INTERNATIONAL CONFERENCE ON MECHATRONIC AND EMBEDDED SYSTEMS AND APPLICATIONS, 2006, : 407 - +
  • [6] Video segmentation using spatio-temporal information
    Kim, YW
    Ho, YS
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 785 - 788
  • [7] Spatio-temporal Prompting Network for Robust Video Feature Extraction
    Sun, Guanxiong
    Wang, Chi
    Zhang, Zhaoyu
    Deng, Jiankang
    Zafeiriou, Stefanos
    Hua, Yang
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13541 - 13551
  • [8] Spatio-temporal Sampling for Video
    Shankar, Mohan
    Pitsiauis, Nikos P.
    Brady, David
    [J]. IMAGE RECONSTRUCTION FROM INCOMPLETE DATA V, 2008, 7076
  • [9] Exemplar Extraction using Spatio-Temporal Hierarchical Agglomerative Clustering for Face Recognition in Video
    See, John
    Eswaran, Chikkannan
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 1481 - 1486
  • [10] Video Inpainting Algorithm Using Spatio-Temporal Consistency
    Lee, Sang-Heon
    Lee, Soon-Young
    Heu, Jun-Hee
    Kim, Chang-Su
    Lee, Sang-Uk
    [J]. COMPUTATIONAL IMAGING VII, 2009, 7246