Layout-Aware Dreamer for Embodied Referring Expression Grounding

被引:0
|
作者
Li, Mingxiao [1 ]
Wang, Zehao [2 ]
Tuytelaars, Tinne [2 ]
Moens, Marie-Francine [1 ]
机构
[1] Katholieke Univ Leuven, Comp Sci Dept, Leuven, Belgium
[2] Katholieke Univ Leuven, Elect Engn Dept ESAT PSI, Leuven, Belgium
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the problem of Embodied Referring Expression Grounding, where an agent needs to navigate in a previously unseen environment and to localize a remote object described by a concise high-level natural language instruction. When facing such a situation, a human tends to imagine what the destination may look like and to explore the environment based on prior knowledge of the environmental layout, such as the fact that a bathroom is more likely to be found near a bedroom than a kitchen. We have de-signed an autonomous agent called Layout-aware Dreamer (LAD), including two novel modules, that is, the Layout Learner and the Goal Dreamer to mimic this cognitive decision process. The Layout Learner learns to infer the room category distribution of neighboring unexplored areas along the path for coarse layout estimation, which effectively introduces layout common sense of room-to-room transitions to our agent. To learn an effective exploration of the environment, the Goal Dreamer imagines the destination before-hand. Our agent achieves new state-of-the-art performance on the public leaderboard of the REVERIE dataset in challenging unseen test environments with improvement in navigation success (SR) by 4.02% and remote grounding success (RGS) by 3.43% compared to the previous state-of-the-art. The code is released at https://github.com/zehao-wang/LAD
引用
收藏
页码:1386 / 1395
页数:10
相关论文
共 50 条
  • [1] PATRON: Perspective-Aware Multitask Model for Referring Expression Grounding Using Embodied Multimodal Cues
    Islam, Md Mofijul
    Gladstone, Alexi
    Iqbal, Tariq
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 971 - 979
  • [2] Layout-aware synthesis of arithmetic circuits
    Um, J
    Kim, T
    39TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2002, 2002, : 207 - 212
  • [3] Layout-Aware Optimization of STT MRAMs
    Gupta, Sumeet Kumar
    Park, Sang Phill
    Mojumder, Niladri Narayan
    Roy, Kaushik
    DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 1455 - 1458
  • [4] Layout-Aware Embedding for Quantum Annealing Processors
    Pinilla, Jose P.
    Wilton, Steven J. E.
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2019, 2019, 11501 : 121 - 139
  • [5] Layout-aware Signal Selection in Reconfigurable Architectures
    Thakyal, Prateek
    Mishra, Prabhat
    18TH INTERNATIONAL SYMPOSIUM ON VLSI DESIGN AND TEST, 2014,
  • [6] A layout-aware synthesis methodology for RF circuits
    Vancorenland, P
    Van der Plas, G
    Steyaert, M
    Gielen, G
    Sansen, W
    ICCAD 2001: IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, DIGEST OF TECHNICAL PAPERS, 2001, : 358 - 362
  • [7] Layout-aware gate duplication and buffer insertion
    Baneres, D.
    Cortadella, J.
    Kishinevsky, A.
    2007 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2007, : 1367 - +
  • [8] On the suitability and development of layout templates for analog layout reuse and layout-aware synthesis
    Castro-López, R
    Fernández, FV
    Vázquez, AR
    VLSI CIRCUITS AND SYSTEMS II, PTS 1 AND 2, 2005, 5837 : 661 - 672
  • [9] Object-aware navigation for remote embodied visual referring expression
    Zhan, Zhaohuan
    Lin, Liang
    Tan, Guang
    NEUROCOMPUTING, 2023, 515 : 68 - 78
  • [10] Incremental Layout-Aware Analog Design Methodology
    Elshawy, Mohannad
    Dessouky, Mohamed
    2015 IEEE CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (ICECS), 2015, : 486 - 489