A CASCADED LONG SHORT-TERM MEMORY (LSTM) DRIVEN GENERIC VISUAL QUESTION ANSWERING (VQA)

被引:0
|
作者
Chowdhury, Iqbal [1 ]
Kien Nguyen [1 ]
Fookes, Clinton [1 ]
Sridharan, Sridha [1 ]
机构
[1] Queensland Univ Technol, Brisbane, Qld, Australia
关键词
Visual Question Answering (VQA); Long Short-term Memory (LSTM); scene understanding;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
A cascaded long short-term memory (LSTM) architecture with discriminant feature learning is proposed for the task of question answering on real world images. The proposed LSTM architecture jointly learns visual features and parts of speech (POS) tags of question words or tokens. Also, dimensionality of deep visual features is reduced by applying Principal Component Analysis (PCA) technique. In this manner, the proposed question answering model captures the generic pattern of question for a given context of image which is just not constricted within the training dataset. Empirical outcome shows that this kind of approach significantly improves the accuracy. It is believed that this kind of generic learning is a step towards a real-world visual question answering (VQA) system which will perform well for all possible forms of open-ended natural language queries.
引用
收藏
页码:1842 / 1846
页数:5
相关论文
共 50 条
  • [1] A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering
    Wang, Di
    Nyberg, Eric
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 707 - 712
  • [2] Non-Factoid Answer Selection in Indonesian Science Question Answering System using Long Short-Term Memory (LSTM)
    Hanifah, Alfi Fauzia
    Kusumaningrum, Retno
    [J]. 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 736 - 746
  • [3] Learning Term Weight with Long Short-Term Memory for Question Retrieval
    Huang, Xifeng
    Dai, Xiang
    [J]. CHINESE LEXICAL SEMANTICS, CLSW 2018, 2018, 11173 : 615 - 622
  • [4] Bi-directional Long Short-Term Memory Model with Semantic Positional Attention for the Question Answering System
    Bi, Mingwen
    Zhang, Qingchuan
    Zuo, Min
    Xu, Zelong
    Jin, Qingyu
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (05)
  • [5] Niederschlags-Abfluss-Modellierung mit Long Short-Term Memory (LSTM)Rainfall-Runoff modeling with Long Short-Term Memory Networks (LSTM)—an overview
    Frederik Kratzert
    Martin Gauch
    Grey Nearing
    Sepp Hochreiter
    Daniel Klotz
    [J]. Österreichische Wasser- und Abfallwirtschaft, 2021, 73 (7-8) : 270 - 280
  • [6] Intrusion detection systems using long short-term memory (LSTM)
    FatimaEzzahra Laghrissi
    Samira Douzi
    Khadija Douzi
    Badr Hssina
    [J]. Journal of Big Data, 8
  • [7] STOCK MARKET PREDICTION USING LONG SHORT-TERM MEMORY (LSTM)
    Abu Nadif, Mohammad
    Samin, Towhidur Rahman
    Islam, Tohedul
    [J]. 2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [8] Intrusion detection systems using long short-term memory (LSTM)
    Laghrissi, FatimaEzzahra
    Douzi, Samira
    Douzi, Khadija
    Hssina, Badr
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [9] Lane Position Detection Based on Long Short-Term Memory (LSTM)
    Yang, Wei
    Zhang, Xiang
    Lei, Qian
    Shen, Dengye
    Xiao, Ping
    Huang, Yu
    [J]. SENSORS, 2020, 20 (11)
  • [10] Detecting Android malware using Long Short-term Memory (LSTM)
    Vinayakumar, R.
    Soman, K. P.
    Poornachandran, Prabaharan
    Kumar, S. Sachin
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (03) : 1277 - 1288