A generic video parsing system with a scene description language (SDL)

被引：8

作者：

Gong, YH ^{[1
]}

Chuan, CH ^{[1
]}

Zhu, YW ^{[1
]}

Sakauchi, M ^{[1
]}

机构：

[1] UNIV TOKYO,INST IND SCI,MINATO KU,TOKYO 106,JAPAN

来源：

REAL-TIME IMAGING | 1996年 / 2卷 / 01期

关键词：

D O I：

10.1006/rtim.1996.0005

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Techniques for automatic video parsing and annotation are crucial to turn enormous volumes of video data into a rich and structured data type, and to facilitate video content-based search and retrieval, In this paper, we present a generic video parser with a Scene Description Language (SDL), The SDL enables the human operator to model a video clip in terms of a relatively high abstract level. The video parser is equipped with various algorithms that are common and essential to general video analyses, To handle the video domain with virtually unlimited sets of unanticipated and variable objects and events efficiently, an object-orientated, processing-on-demand approach is devised to perform the video parsing, The video parser first interprets the video model defined by the operator, identifies the prominent video properties to be parsed, and then creates an entity for each of the video properties, Each entity knows how to find a match for itself from the video properties extracted from the video image, The video parser interacts with these entities, and performs the feature extraction operations with processing-on-demand basis. Each entity has a self-diagnostic function that is able to turn itself into an inert state when it fails to find the necessary matches during the video parsing process. The inert entities will be excluded from subsequent operations, and will no longer consume any system resources, Our experiments have shown that our generic video parser is effective and efficient in handling a large variety of video images. (C) 1996 Academic Press Limited

引用

页码：45 / 59

页数：15

共 50 条

[1] Video Scene Classification based on Natural Language Description
Zhang, Lei
Khan, Muhammad Usman Ghani
Gotoh, Yoshihiko
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
[2] MPEG-7 description of generic video objects for scene reconstruction
Steiger, O
Cavallaro, A
Ebrahimi, T
[J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2, 2002, 4671 : 947 - 958
[3] SDL - CCITT SPECIFICATION AND DESCRIPTION LANGUAGE
ROCKSTROM, A
SARACCO, R
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1982, 30 (06) : 1310 - 1318
[4] SDL-TRAN-An Interactive Generator for Formal Description Language SDL
张尧学
陈桦
张越
刘国丽
[J]. Journal of Computer Science & Technology, 1996, (01) : 49 - 60
[5] Video Scene Parsing with Predictive Feature Learning
Jin, Xiaojie
Li, Xin
Xiao, Huaxin
Shen, Xiaohui
Lin, Zhe
Yang, Jimei
Chen, Yunpeng
Dong, Jian
Liu, Luoqi
Jie, Zequn
Feng, Jiashi
Yan, Shuicheng
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5581 - 5589
[6] AN OVERVIEW OF SDL, THE CCITT SPECIFICATION AND DESCRIPTION LANGUAGE
GERRAND, P
BIERMAN, E
[J]. TELECOMMUNICATION JOURNAL, 1982, 49 (05): : 285 - 291
[7] THE CCITT-SPECIFICATION AND DESCRIPTION LANGUAGE SDL
BELINA, F
HOGREFE, D
[J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1989, 16 (04): : 311 - 341
[8] On detection of gradual scene changes for parsing of video data
Song, SMH
Kwon, TH
Kim, WM
Kim, H
Rhee, BD
[J]. STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VI, 1997, 3312 : 404 - 413
[9] Audio scene segmentation for video with generic content
Niu, Feng
Goela, Naveen
Divakaran, Ajay
Abdel-Mottaleb, Mohamed
[J]. MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS II, 2008, 6820
[10] Automatic scene parsing for generic object descriptions using shape primitives
Buettner, Stefan
Marton, Zoltan-Csaba
Hertkorn, Katharina
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 76 : 93 - 112

← 1 2 3 4 5 →