Efficient multi-attribute image classification through context-driven networks

被引:0
|
作者
Banger, Sean [1 ]
Ceresani, Ryan [1 ]
Twedt, Jason [1 ]
机构
[1] Lockheed Martin AI Ctr, King Of Prussia, PA 19406 USA
来源
关键词
image classification; computer vision; deep learning; neural networks; attention; transformers; multitask; learning; visual question answering;
D O I
10.1117/12.2618977
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Performing many simultaneous tasks on a resource-limited device is challenging due to the limited amount of available computational resources. Efficient and universal model architectures are the key to solving this problem. Existing sub-fields of machine learning, such as Multi-Task Learning (MTL), have proven that learning multiple tasks with a single neural network architecture is possible and even has the potential to improve sample efficiency, memory efficiency, and can be less prone to overfitting. In Visual Question Answering (VQA), a model ingests multi-modal input to produce text-based responses in the context of an image. Our proposed architecture merges the MTL and VQA concepts to form TaskNet. TaskNet solves the visual MTL problem using an input task to provide context to the network and guide its attention mechanism towards providing a relevant response. Our approach saves memory without sacrificing performance relative to naively training independent models. TaskNet efficiently provides multiple fine-grained classifications on a single input image and seamlessly incorporates context-specific metadata to further boost performance in a world of high variance.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Context-driven hybrid image inpainting
    Cai, Lu
    Kim, Taewhan
    [J]. IET IMAGE PROCESSING, 2015, 9 (10) : 866 - 873
  • [2] COCCI: Context-Driven Clothing Classification Network
    Jiang, Minghua
    Liu, Shuqing
    Shi, Yankang
    Du, Chenghu
    Tang, Guangyu
    Liu, Li
    Peng, Tao
    Hu, Xinrong
    Yu, Feng
    [J]. ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT I, 2024, 14495 : 69 - 80
  • [3] Affective image recognition with multi-attribute knowledge in deep neural networks
    Hao Zhang
    Gaifang Luo
    Yingying Yue
    Kangjian He
    Dan Xu
    [J]. Multimedia Tools and Applications, 2024, 83 : 18353 - 18379
  • [4] Affective image recognition with multi-attribute knowledge in deep neural networks
    Zhang, Hao
    Luo, Gaifang
    Yue, Yingying
    He, Kangjian
    Xu, Dan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 18353 - 18379
  • [5] Efficient Multi-Attribute Query Processing in Heterogeneous Wireless Sensor Networks
    Zhou, ZhangBing
    Zhao, Deng
    Shu, Lei
    Chao, Han-Chieh
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (05): : 699 - 712
  • [6] Efficient multi-attribute pattern matching
    Ando, K
    Mizobuchi, S
    Shishibori, M
    Aoe, J
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 1998, 66 (1-2) : 21 - 38
  • [7] DECISION FUSION BASED ON EXTENDED MULTI-ATTRIBUTE PROFILES FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Song, Benqin
    Lie, Jun
    Li, Peijun
    Plaza, Antonio
    [J]. 2013 5TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2013,
  • [8] Hyperspectral Image Classification by Combination of Active Learning and Extended Multi-Attribute Profile
    Li, Changli
    Zhang, Lin
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 541 - 544
  • [9] A context-driven Bayesian classification method for eye location
    Koh, Eun Jin
    Nam, Mi Young
    Rhee, Phil Kyu
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 2, 2007, 4432 : 517 - +
  • [10] Multi-Attribute Decision Fusion for Pattern Classification
    Awogbami, Gabriel
    Agana, Norbert
    Homaifar, Abdollah
    [J]. IEEE SOUTHEASTCON 2018, 2018,