Conceptual representations between video signals and natural language descriptions

被引：16

作者：

Arens, M. ^{[1
]}

Gerber, R. ^{[1
]}

Nagel, H. -H. ^{[1
]}

机构：

[1] Univ Karlsruhe TH, Fak Informat, Inst Algorithmen & Kognit Syst, D-76128 Karlsruhe, Germany

来源：

IMAGE AND VISION COMPUTING | 2008年 / 26卷 / 01期

关键词：

cognitive vision; knowledge representation;

D O I：

10.1016/j.imavis.2005.07.026

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An artificial cognitive vision system associates video signals with conceptual descriptions of the depicted time-varying scene. This linkage is mediated by knowledge representation formalisms. An experimental implementation of such an approach yielded initial results for the conceptual description of videos recorded at innercity traffic scenes, see [M. Haag, H.-H. Nagel, Incremental recognition of traffic situations from video image sequences, Image and Vision Computing 18 (2) (2000) 137-153]. Accumulating experience with this system approach and its extension for the generation of natural language texts from videos caused us to redesign the overall computer vision system as well as the knowledge representation formalisms utilised within that system. (c) 2006 Elsevier B.V. All rights reserved.

引用

页码：53 / 66

页数：14

共 50 条

[41] The Role of Context in the Interpretation of Natural Language Location Descriptions
Stock, Kristin
Hall, Mark
[J]. PROCEEDINGS OF WORKSHOPS AND POSTERS AT THE 13TH INTERNATIONAL CONFERENCE ON SPATIAL INFORMATION THEORY (COSIT 2017), 2018, : 245 - 254
[42] Characterizing the Natural Language Descriptions in Software Logging Statements
He, Pinjia
Chen, Zhuangbin
He, Shilin
Lyu, Michael R.
[J]. PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 178 - 189
[43] Zoom: a corpus of natural language descriptions of map locations
Altamirano, Romina
Ferreira, Thiago C.
Paraboni, Ivandre
Benotti, Luciana
[J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 69 - 75
[44] The identification of index terms in natural language object descriptions
Heidorn, PB
[J]. ASIS 99: PROCEEDINGS OF THE 62ND ASIS ANNUAL MEETING, VOL 36, 1999: KNOWLEDGE: CREATION ORGANIZATION AND USE, 1999, 36 : 472 - 481
[45] Detecting geospatial location descriptions in natural language text
Stock, Kristin
Jones, Christopher B.
Russell, Shaun
Radke, Mansi
Das, Prarthana
Aflaki, Niloofar
[J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (03) : 547 - 584
[46] Politics of Language in Video Games: Identity and Representations of Iran
Rad, Siavash Rafiee
[J]. INTERNATIONAL JOURNAL OF PERSIAN LITERATURE, 2021, 6 (01) : 103 - 119
[47] Natural language driven video sequencer
[J]. Terebijon Gakkaishi, 10 (1585):
[48] Natural Language Access to Video Databases
Francis, Danny
Pidou, Paul
Merialdo, Bernard
Huet, Benoit
[J]. 2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 78 - 81
[49] Learning Video Representations from Large Language Models
Zhao, Yue
Misra, Ishan
Krahenbuhl, Philipp
Girdhar, Rohit
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6586 - 6597
[50] Localizing Moments in Video with Natural Language
Hendricks, Lisa Anne
Wang, Oliver
Shechtman, Eli
Sivic, Josef
Darrell, Trevor
Russell, Bryan
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5804 - 5813

← 1 2 3 4 5 →