Zero-Shot Learning for Computer Vision Applications

被引:0
|
作者
Sarma, Sandipan [1 ]
机构
[1] Indian Inst Technol Guwahati, Gauhati, Assam, India
关键词
Zero-shot learning; seed construction; visual-semantic mining; GAN; triplet loss; transformer; CLIP; object recognition; object detection; action recognition; human-object interaction detection; ATTRIBUTE; DATABASE;
D O I
10.1145/3581783.3613435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human beings possess the remarkable ability to recognize unseen concepts by integrating their visual perception of known concepts with some high-level descriptions. However, the best-performing deep learning frameworks today are supervised learners that struggle to recognize concepts without training on their labeled visual samples. Zero-shot learning (ZSL) has recently emerged as a solution that mimics humans and leverages multimodal information to transfer knowledge from seen to unseen concepts. This study aims to emphasize the practicality of ZSL, unlocking its potential across four different applications in computer vision, namely - object recognition, object detection, action recognition, and human-object interaction detection. Several task-specific challenges are identified and addressed in the presented research hypotheses. Zero-shot frameworks are proposed to attain state-of-the-art performance, elucidating some future research directions as well.
引用
收藏
页码:9360 / 9364
页数:5
相关论文
共 50 条
  • [1] Research Progress of Zero-Shot Learning Beyond Computer Vision
    Cao, Weipeng
    Zhou, Cong
    Wu, Yuhao
    Ming, Zhong
    Xu, Zhiwu
    Zhang, Jiyong
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II, 2020, 12453 : 538 - 551
  • [2] A Survey of Zero-Shot Learning: Settings, Methods, and Applications
    Wang, Wei
    Zheng, Vincent W.
    Yu, Han
    Miao, Chunyan
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2019, 10 (02)
  • [3] Ordinal Zero-Shot Learning
    Huo, Zengwei
    Geng, Xin
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1916 - 1922
  • [4] Zero-shot causal learning
    Nilforoshan, Hamed
    Moor, Michael
    Roohani, Yusuf
    Chen, Yining
    Surina, Anja
    Yasunaga, Michihiro
    Oblak, Sara
    Leskovec, Jure
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Zero-Shot Kernel Learning
    Zhang, Hongguang
    Koniusz, Piotr
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7670 - 7679
  • [6] Rebalanced Zero-Shot Learning
    Ye, Zihan
    Yang, Guanyu
    Jin, Xiaobo
    Liu, Youfa
    Huang, Kaizhu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4185 - 4198
  • [7] Incremental Zero-Shot Learning
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13788 - 13799
  • [8] Active Zero-Shot Learning
    Xie, Sihong
    Wang, Shaoxiong
    Yu, Philip S.
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1889 - 1892
  • [9] Zero-shot Metric Learning
    Xu, Xinyi
    Cao, Huanhuan
    Yang, Yanhua
    Yang, Erkun
    Deng, Cheng
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3996 - 4002
  • [10] Spherical Zero-Shot Learning
    Shen, Jiayi
    Xiao, Zehao
    Zhen, Xiantong
    Zhang, Lei
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 634 - 645