A generic video parsing system with a scene description language (SDL)

被引:8
|
作者
Gong, YH [1 ]
Chuan, CH [1 ]
Zhu, YW [1 ]
Sakauchi, M [1 ]
机构
[1] UNIV TOKYO,INST IND SCI,MINATO KU,TOKYO 106,JAPAN
关键词
D O I
10.1006/rtim.1996.0005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Techniques for automatic video parsing and annotation are crucial to turn enormous volumes of video data into a rich and structured data type, and to facilitate video content-based search and retrieval, In this paper, we present a generic video parser with a Scene Description Language (SDL), The SDL enables the human operator to model a video clip in terms of a relatively high abstract level. The video parser is equipped with various algorithms that are common and essential to general video analyses, To handle the video domain with virtually unlimited sets of unanticipated and variable objects and events efficiently, an object-orientated, processing-on-demand approach is devised to perform the video parsing, The video parser first interprets the video model defined by the operator, identifies the prominent video properties to be parsed, and then creates an entity for each of the video properties, Each entity knows how to find a match for itself from the video properties extracted from the video image, The video parser interacts with these entities, and performs the feature extraction operations with processing-on-demand basis. Each entity has a self-diagnostic function that is able to turn itself into an inert state when it fails to find the necessary matches during the video parsing process. The inert entities will be excluded from subsequent operations, and will no longer consume any system resources, Our experiments have shown that our generic video parser is effective and efficient in handling a large variety of video images. (C) 1996 Academic Press Limited
引用
收藏
页码:45 / 59
页数:15
相关论文
共 50 条
  • [1] Video Scene Classification based on Natural Language Description
    Zhang, Lei
    Khan, Muhammad Usman Ghani
    Gotoh, Yoshihiko
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [2] MPEG-7 description of generic video objects for scene reconstruction
    Steiger, O
    Cavallaro, A
    Ebrahimi, T
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2, 2002, 4671 : 947 - 958
  • [3] SDL - CCITT SPECIFICATION AND DESCRIPTION LANGUAGE
    ROCKSTROM, A
    SARACCO, R
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1982, 30 (06) : 1310 - 1318
  • [4] SDL-TRAN-An Interactive Generator for Formal Description Language SDL
    张尧学
    陈桦
    张越
    刘国丽
    [J]. Journal of Computer Science & Technology, 1996, (01) : 49 - 60
  • [5] Video Scene Parsing with Predictive Feature Learning
    Jin, Xiaojie
    Li, Xin
    Xiao, Huaxin
    Shen, Xiaohui
    Lin, Zhe
    Yang, Jimei
    Chen, Yunpeng
    Dong, Jian
    Liu, Luoqi
    Jie, Zequn
    Feng, Jiashi
    Yan, Shuicheng
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5581 - 5589
  • [6] AN OVERVIEW OF SDL, THE CCITT SPECIFICATION AND DESCRIPTION LANGUAGE
    GERRAND, P
    BIERMAN, E
    [J]. TELECOMMUNICATION JOURNAL, 1982, 49 (05): : 285 - 291
  • [7] THE CCITT-SPECIFICATION AND DESCRIPTION LANGUAGE SDL
    BELINA, F
    HOGREFE, D
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1989, 16 (04): : 311 - 341
  • [8] On detection of gradual scene changes for parsing of video data
    Song, SMH
    Kwon, TH
    Kim, WM
    Kim, H
    Rhee, BD
    [J]. STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VI, 1997, 3312 : 404 - 413
  • [9] Audio scene segmentation for video with generic content
    Niu, Feng
    Goela, Naveen
    Divakaran, Ajay
    Abdel-Mottaleb, Mohamed
    [J]. MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS II, 2008, 6820
  • [10] Automatic scene parsing for generic object descriptions using shape primitives
    Buettner, Stefan
    Marton, Zoltan-Csaba
    Hertkorn, Katharina
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 76 : 93 - 112