Conceptual representations between video signals and natural language descriptions

被引：16

作者：

Arens, M. ^{[1
]}

Gerber, R. ^{[1
]}

Nagel, H. -H. ^{[1
]}

机构：

[1] Univ Karlsruhe TH, Fak Informat, Inst Algorithmen & Kognit Syst, D-76128 Karlsruhe, Germany

来源：

IMAGE AND VISION COMPUTING | 2008年 / 26卷 / 01期

关键词：

cognitive vision; knowledge representation;

D O I：

10.1016/j.imavis.2005.07.026

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An artificial cognitive vision system associates video signals with conceptual descriptions of the depicted time-varying scene. This linkage is mediated by knowledge representation formalisms. An experimental implementation of such an approach yielded initial results for the conceptual description of videos recorded at innercity traffic scenes, see [M. Haag, H.-H. Nagel, Incremental recognition of traffic situations from video image sequences, Image and Vision Computing 18 (2) (2000) 137-153]. Accumulating experience with this system approach and its extension for the generation of natural language texts from videos caused us to redesign the overall computer vision system as well as the knowledge representation formalisms utilised within that system. (c) 2006 Elsevier B.V. All rights reserved.

引用

页码：53 / 66

页数：14

共 50 条

[1] Translating Video Content to Natural Language Descriptions
Rohrbach, Marcus
Qiu, Wei
Titov, Ivan
Thater, Stefan
Pinkal, Manfred
Schiele, Bernt
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 433 - 440
[2] Video Event Understanding using Natural Language Descriptions
Ramanathan, Vignesh
Liang, Percy
Li Fei-Fei
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 905 - 912
[3] A framework for creating natural language descriptions of video streams
Khan, Muhammad Usman Ghani
Al Harbi, Nouf
Gotoh, Yoshihiko
[J]. INFORMATION SCIENCES, 2015, 303 : 61 - 82
[4] Natural language descriptions of human Behavior from video sequences
Tena, Carles Fernandez
Baiget, Pau
Roca, Xavier
Gonzalez, Jordi
[J]. KI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4667 : 279 - +
[5] Analyzing the Gap Between Workflows and their Natural Language Descriptions
Groth, Paul
Gil, Yolanda
[J]. 2009 IEEE CONGRESS ON SERVICES (SERVICES-1 2009), VOLS 1 AND 2, 2009, : 299 - 305
[6] Summarizing Conceptual Descriptions using Knowledge Representations
Harispe, Sebastien
Montmain, Jacky
Medjkoune, Massissilia
[J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
[7] Learning Spatial-Semantic Representations from Natural Language Descriptions and Scene Classifications
Hemachandra, Sachithra
Walter, Matthew R.
Tellex, Stefanie
Teller, Seth
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 2623 - 2630
[8] Learning Unified Video-Language Representations via Joint Modeling and Contrastive Learning for Natural Language Video Localization
Cui, Chenhao
Liang, Xinnian
Wu, Shuangzhi
Li, Zhoujun
[J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[9] From Sensory Signals to Modality-Independent Conceptual Representations: A Probabilistic Language of Thought Approach
Erdogan, Goker
Yildirim, Ilker
Jacobs, Robert A.
[J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (11)
[10] Vision signals and the language of vision descriptions in the prophets
Carver, Daniel E.
[J]. JOURNAL FOR THE STUDY OF THE OLD TESTAMENT, 2021, 45 (03) : 371 - 387

← 1 2 3 4 5 →