Caption Detection and Positioning in Digital Video

被引:0
|
作者
Wang, Zhujun [1 ]
Lan, Shanzhen [1 ]
Yang, Lei [1 ]
Zhang, Yue [1 ]
机构
[1] Commun Univ China, Digital Media Technol Dept, Beijing 100024, Peoples R China
关键词
Caption detection; Shot segmentation; Caption positioning; Edge detection;
D O I
10.1007/978-3-642-54924-3_99
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Captions in digital video are of great importance for video semantic understanding. In order to realize the function of caption detection and positioning, this paper first selects the frames that contain captions then uses the characteristic of edge and projection to locate the caption area in the caption frame. This algorithm avoids the handling of every frame image thus greatly improves the speed of process. By experiments on different kinds of videos, it is demonstrated that the algorithm in this paper has high accuracy.
引用
收藏
页码:1051 / 1059
页数:9
相关论文
共 50 条
  • [31] A Multimodal Framework for Video Caption Generation
    Bhooshan, Reshmi S.
    Suresh, K.
    IEEE ACCESS, 2022, 10 : 92166 - 92176
  • [32] A Multimodal Framework for Video Caption Generation
    Bhooshan, Reshmi S.
    Suresh, K.
    IEEE Access, 2022, 10 : 92166 - 92176
  • [33] A video browser based on closed caption
    Lee, Janghwan
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (03) : 1124 - 1128
  • [34] Exploiting Auxiliary Caption for Video Grounding
    Li, Hongxiang
    Cao, Meng
    Cheng, Xuxin
    Li, Yaowei
    Zhu, Zhihong
    Zou, Yuexian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18508 - 18516
  • [35] The Improved Corner Detection for Video Text Positioning Algorithm
    Wu, Yanhai
    Li, Jiaxin
    Zhang, Fangni
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1523 - +
  • [36] A video image detection approach based on cooperative positioning
    Cai A.
    Wang T.
    International Journal of High Performance Systems Architecture, 2020, 9 (2-3): : 70 - 76
  • [37] EXPLOITING CAPTION DIVERSITY FOR UNSUPERVISED VIDEO SUMMARIZATION
    Kaseris, Michail
    Mademlis, Ioannis
    Pitas, Ioannis
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1650 - 1654
  • [38] Enhanced transformer model for video caption generation
    Varma, Soumya
    Peter, J. Dinesh
    EXPERT SYSTEMS, 2023,
  • [39] A Method of Caption Location and Segmentation in News Video
    Huang, He
    Shi, Ping
    Yang, Laiwen
    2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 365 - 369
  • [40] Scene-Edge GRU for Video Caption
    Hao, Xin
    Zhou, Feng
    Li, Xiaoyong
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 1290 - 1295