Exploring inter-frame correlation analysis and wavelet-domain modeling for real-time caption detection in streaming video

被引:0
|
作者
Li, Jia [1 ,2 ]
Tian, Yonghong [3 ]
Gao, Wen [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100864, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing, Peoples R China
[3] Peking Univ, Inst Digital Media, Beijing, Peoples R China
关键词
caption detection; inter-frame correlation; generalized Gaussian model; streaming video analysis;
D O I
10.1117/12.759571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the amount of streaming video has grown rapidly on the Web. Often, retrieving these streaming videos offers the challenge of indexing and analyzing the media in real time because the streams must be treated as effectively infinite in length, thus precluding offline processing. Generally speaking, captions ate important semantic clues for video indexing and retrieval. However, existing caption detection methods often have difficulties to make real-time detection for streaming video, and few of them concern on the differentiation of captions from scene texts and scrolling texts. In general, these texts have different roles in streaming video retrieval. To overcome these difficulties, this paper proposes a novel approach which explores the inter-frame correlation analysis and wavelet-domain modeling for real-time caption detection in streaming video. In our approach, the inter-frame correlation information is used to distinguish caption texts from scene texts and scrolling texts. Moreover, wavelet-domain Generalized Gaussian Models (GGMs) are utilized to automatically remove non-text regions from each frame and only keep caption regions for further processing. Experiment results show that our approach is able to offer real-time caption detection with high recall and low false alarm rate, and also can effectively discern caption texts from the other texts even in low resolutions.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Improving Real-time Pedestrian Detection using Adaptive Confidence Thresholding and Inter-Frame Correlation
    Al-Shatnawi, Mufleh
    Movahedi, Vida
    Asif, Amir
    An, Aijun
    [J]. 2018 IEEE 20TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2018,
  • [2] Video denoising based on inter-frame statistical Modeling of wavelet coefficients
    Rahman, S. M. Mahbubur
    Ahmad, M. Omair
    Swamy, M. N. S.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (02) : 187 - 198
  • [3] Bandwidth-Efficient Mobile Volumetric Video Streaming by Exploiting Inter-Frame Correlation
    Wang, Yizong
    Zhao, Dong
    Zhang, Huanhuan
    Gao, Teng
    Guo, Zixuan
    Huang, Chenghao
    Ma, Huadong
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9410 - 9423
  • [4] Modeling and Analysis of FPGA Accelerators for Real-Time Streaming Video Processing in the Healthcare Domain
    Steven van der Vlugt
    Hadi Alizadeh Ara
    Rob de Jong
    Martijn Hendriks
    Ruben Guerra Marin
    Marc Geilen
    Dip Goswami
    [J]. Journal of Signal Processing Systems, 2019, 91 : 75 - 91
  • [5] Modeling and Analysis of FPGA Accelerators for Real-Time Streaming Video Processing in the Healthcare Domain
    van der Vlugt, Steven
    Ara, Hadi Alizadeh
    de Jong, Rob
    Hendriks, Martijn
    Marin, Ruben Guerra
    Geilen, Marc
    Goswami, Dip
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 91 (01): : 75 - 91
  • [6] Malicious inter-frame video tampering detection in MPEG videos using time and spatial domain analysis of quantization effects
    Aghamaleki, Javad Abbasi
    Behrad, Alireza
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (20) : 20691 - 20717
  • [7] Malicious inter-frame video tampering detection in MPEG videos using time and spatial domain analysis of quantization effects
    Javad Abbasi Aghamaleki
    Alireza Behrad
    [J]. Multimedia Tools and Applications, 2017, 76 : 20691 - 20717
  • [8] Video Object Detection using Inter-frame Correlation Based Background Subtraction
    Rout, Deepak Kumar
    Puhan, Sharmistha
    [J]. 2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 167 - 171
  • [9] Real-Time Inter-Frame Histogram Builder for SPAD Image Sensors
    Vornicu, Ion
    Carmona-Galan, Ricardo
    Rodriguez-Vazquez, Angel
    [J]. IEEE SENSORS JOURNAL, 2018, 18 (04) : 1576 - 1584
  • [10] Real-Time Reconstruction of Moving Objects in an Electrical Capacitance Tomography System Using Inter-Frame Correlation
    Teniou, Samir
    Meribout, Mahmoud
    Belarbi, Khaled
    [J]. IEEE SENSORS JOURNAL, 2012, 12 (07) : 2517 - 2525