Batch construction and multitask learning in visual relationship recognition

被引:0
|
作者
Josias, Shane [1 ,2 ]
Brink, Willie [1 ]
机构
[1] Stellenbosch Univ, Appl Math, Stellenbosch, South Africa
[2] Stellenbosch Univ, CAIR, Stellenbosch, South Africa
关键词
visual relationship recognition; batch construction; multitask learning;
D O I
10.1109/saupec/robmech/prasa48453.2020.9041144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An image can be described by the objects within it, as well as interactions between those objects. A pair of object labels together with an interaction label is known as a visual relationship, and is represented as a triplet of the form (subject, predicate, object). Recognising visual relationships in a given image is a challenging task, owing to the combinatorially large number of possible relationship triplets, which leads to an extreme classification problem, as well as a very long tail found typically in the distribution of those possible triplets. We investigate the effects of three strategies that could potentially address these issues. Firstly, instead of predicting the full triplet we opt to predict each element separately. Secondly, we investigate the use of shared network parameters to perform these separate predictions in a multitask setting. Thirdly, we consider a class selective batch construction strategy to expose the network to more of the many rare classes during mini-batch training. Our experiments demonstrate that batch construction can improve performance on the long tail, possibly at the expense of accuracy on the small number of dominating classes. We also find that a multitask model neither improves nor impedes performance in any significant way, but that its smaller size may be beneficial.
引用
收藏
页码:713 / 718
页数:6
相关论文
共 50 条
  • [21] Improved Depression Recognition Using Attention and Multitask Learning of Gender Recognition
    Liu, Yang
    Lu, Xiaoyong
    Hi, Daimin S.
    Yuan, Jingyi
    Pan, Tao
    An, Haizhen
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 57 - 61
  • [22] Semi-Supervised Multitask Learning for Scene Recognition
    Lu, Xiaoqiang
    Li, Xuelong
    Mou, Lichao
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (09) : 1967 - 1976
  • [23] Multitask Learning with Local Attention for Tibetan Speech Recognition
    Wang, Hui
    Gao, Fei
    Zhao, Yue
    Yang, Li
    Yue, Jianjian
    Ma, Huilin
    COMPLEXITY, 2020, 2020
  • [24] MULTITASK LEARNING AND SYSTEM COMBINATION FOR AUTOMATIC SPEECH RECOGNITION
    Siohan, Olivier
    Rybach, David
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 589 - 595
  • [25] Retinal Abnormalities Recognition Using Regional Multitask Learning
    Wang, Xin
    Ju, Lie
    Zhao, Xin
    Ge, Zongyuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT I, 2019, 11764 : 30 - 38
  • [26] Robust Visual Tracking With Multitask Joint Dictionary Learning
    Fan, Heng
    Xiang, Jinhai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (05) : 1018 - 1030
  • [27] Multitask Learning with CTC and Segmental CRF for Speech Recognition
    Lu, Liang
    Kong, Lingpeng
    Dyer, Chris
    Smith, Noah A.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 954 - 958
  • [28] Sparse Bayesian Multitask Learning for Radar Target Recognition
    Xu, Danlei
    Du, Lan
    Liu, Hongwei
    Luo, Dingli
    2016 CIE INTERNATIONAL CONFERENCE ON RADAR (RADAR), 2016,
  • [29] Differentially Private Federated Learning for Multitask Objective Recognition
    Xie, Renyou
    Li, Chaojie
    Zhou, Xiaojun
    Chen, Hongyang
    Dong, Zhaoyang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (05) : 7269 - 7281
  • [30] MULTITASK LEARNING FOR FRAME-LEVEL INSTRUMENT RECOGNITION
    Hung, Yun-Ning
    Chen, Yi-An
    Yang, Yi-Hsuan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 381 - 385