WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models

被引:0
|
作者
The Hebrew University of Jerusalem, Israel [1 ]
不详 [2 ]
不详 [3 ]
机构
来源
arXiv | 1600年
关键词
D O I
暂无
中图分类号
学科分类号
摘要
66
引用
收藏
相关论文
共 50 条
  • [1] WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models
    Bitton, Yonatan
    Bitton-Guetta, Nitzan
    Yosef, Ron
    Elovici, Yuval
    Bansal, Mohit
    Stanovsky, Gabriel
    Schwartz, Roy
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks
    Srinivasan, Tejas
    Chang, Ting-Yun
    Alva, Leticia Pinto
    Chochlakis, Georgios
    Rostami, Mohammad
    Thomason, Jesse
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Kiki or Bouba? Sound Symbolism in Vision-and-Language Models
    Alper, Morris
    Averbuch-Elor, Hadar
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Speaker-Follower Models for Vision-and-Language Navigation
    Fried, Daniel
    Hu, Ronghang
    Cirik, Volkan
    Rohrbach, Anna
    Andreas, Jacob
    Morency, Louis-Philippe
    Berg-Kirkpatrick, Taylor
    Saenko, Kate
    Klein, Dan
    Darrell, Trevor
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [5] Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language Models
    Iki, Taichi
    Aizawa, Akiko
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2189 - 2196
  • [6] NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
    Zhou, Gengze
    Hong, Yicong
    Wu, Qi
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7641 - 7649
  • [7] Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images
    Bitton-Guetta, Nitzan
    Bitton, Yonatan
    Hesselplusminus, Jack
    Schmidtplusminus, Ludwig
    Elovici, Yuval
    Stanovsky, Gabriel
    Schwartz, Roy
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 2616 - 2627
  • [8] Tools Identification By On-Board Adaptation of Vision-and-Language Models
    Hu, Jun
    Miller, Phil
    Lomnitz, Michael
    Farkya, Saurabh
    Yilmaz, Emre
    Raghavan, Aswin
    Zhang, David
    Piacentino, Michael
    [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23799 - 23801
  • [9] Iterative Vision-and-Language Navigation
    Krantz, Jacob
    Banerjee, Shurjo
    Zhu, Wang
    Corso, Jason
    Anderson, Peter
    Lee, Stefan
    Thomason, Jesse
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14921 - 14930
  • [10] On the Evaluation of Vision-and-Language Navigation Instructions
    Zhao, Ming
    Anderson, Peter
    Jain, Vihan
    Wang, Su
    Ku, Alexander
    Baldridge, Jason
    Ie, Eugene
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1302 - 1316