Video description with subject, verb and object supervision

被引：0

作者：

Yue W. ^{[1
]}

Jinlai L. ^{[1
]}

Xiaojie W. ^{[1
]}

机构：

[1] School of Computer Science, Beijing University of Posts and Telecommunications, Beijing

来源：

Journal of China Universities of Posts and Telecommunications | 2019年 / 26卷 / 02期

基金：

中国国家自然科学基金;

关键词：

CNN; DNN; LSTM; VD;

D O I：

10.19682/j.cnki.1005-8885.2019.1006

中图分类号：

学科分类号：

摘要：

Video description aims to generate descriptive natural language for videos. Inspired from the deep neural network (DNN) used in the machine translation, the video description (VD) task applies the convolutional neural network (CNN) to extracting video features and the long short-term memory (LSTM) to generating descriptions. However, some models generate incorrect words and syntax. The reason may because that the previous models only apply LSTM to generate sentences, which learn insufficient linguistic information. In order to solve this problem, an end-to-end DNN model incorporated subject, verb and object (SVO) supervision is proposed. Experimental results on a publicly available dataset, i. e. Youtube2Text, indicate that our model gets a 58. 4% consensus-based image description evaluation (CIDEr) value. It outperforms the mean pool and video description with first feed (VD-FF) models, demonstrating the effectiveness of SVO supervision. © 2019, Beijing University of Posts and Telecommunications. All rights reserved.

引用

页码：52 / 58

页数：6

共 50 条

[1] Video description with subject, verb and object supervision
Wang Yue
Liu Jinlai
Wang Xiaojie
TheJournalofChinaUniversitiesofPostsandTelecommunications, 2019, 26 (02) : 52 - 58
[2] Subject to/flesh, object/to verb (:) the business of naming
Maraj, Louis M. M.
COMMUNICATION AND CRITICAL-CULTURAL STUDIES, 2023, 20 (01) : 47 - 53
[3] Diachrony of the subject object relations in the ketischen verb
Polenowa, Galina T.
CENTRAL ASIATIC JOURNAL, 2009, 53 (02) : 262 - 279
[4] Object attraction in subject-verb agreement construction
Hartsuiker, RJ
Antón-Méndez, I
van Zee, M
JOURNAL OF MEMORY AND LANGUAGE, 2001, 45 (04) : 546 - 572
[5] SUBJECT-VERB-OBJECT APPROACH TO SOCIAL COGNITION
GOLLOB, HF
PSYCHOLOGICAL REVIEW, 1974, 81 (04) : 286 - 321
[6] The Effects of Verb Argument Complexity on Verb Production in Persons with Aphasia: Evidence from a Subject–Object–Verb Language
Jee Eun Sung
Journal of Psycholinguistic Research, 2016, 45 : 287 - 305
[7] The subject verb object class II (Semantic ambiguity, ontology)
Almog, J
NOUS, 1998, : 77 - 104
[8] SUBJECT AND OBJECT OF THE VERB ACTION AND THEIR RELATIONSHIP WITH THE ACTANT IN RUSSIAN LINGUISTICS
Lazarev, S. V.
Smirnova, S. V.
Lakhaeva, A. I.
REVISTA INCLUSIONES, 2019, 6 : 113 - 118
[9] INFORMATIVE ATTENTION SUPERVISION FOR GROUNDED VIDEO DESCRIPTION
Wan, Boyang
Jiang, Wenhui
Fang, Yuming
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1955 - 1959
[10] TRAIT DIMENSIONALITY AND BALANCE IN SUBJECT-VERB-OBJECT JUDGMENTS
THOMPSON, EG
GARD, JW
PHILLIPS, JL
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1980, 38 (01) : 57 - 66

← 1 2 3 4 5 →