Batch construction and multitask learning in visual relationship recognition

被引:0
|
作者
Josias, Shane [1 ,2 ]
Brink, Willie [1 ]
机构
[1] Stellenbosch Univ, Appl Math, Stellenbosch, South Africa
[2] Stellenbosch Univ, CAIR, Stellenbosch, South Africa
关键词
visual relationship recognition; batch construction; multitask learning;
D O I
10.1109/saupec/robmech/prasa48453.2020.9041144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An image can be described by the objects within it, as well as interactions between those objects. A pair of object labels together with an interaction label is known as a visual relationship, and is represented as a triplet of the form (subject, predicate, object). Recognising visual relationships in a given image is a challenging task, owing to the combinatorially large number of possible relationship triplets, which leads to an extreme classification problem, as well as a very long tail found typically in the distribution of those possible triplets. We investigate the effects of three strategies that could potentially address these issues. Firstly, instead of predicting the full triplet we opt to predict each element separately. Secondly, we investigate the use of shared network parameters to perform these separate predictions in a multitask setting. Thirdly, we consider a class selective batch construction strategy to expose the network to more of the many rare classes during mini-batch training. Our experiments demonstrate that batch construction can improve performance on the long tail, possibly at the expense of accuracy on the small number of dominating classes. We also find that a multitask model neither improves nor impedes performance in any significant way, but that its smaller size may be beneficial.
引用
收藏
页码:713 / 718
页数:6
相关论文
共 50 条
  • [41] Hierarchical Multitask Learning for Improved Underwater Recognition on Imbalanced Tasks
    Castro, Filipa
    Costa, Pedro
    Marques, Filipe
    Parente, Manuel
    2020 10TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2020, : 202 - 208
  • [42] Neural Simile Recognition with Cyclic Multitask Learning and Local Attention
    Zeng, Jiali
    Song, Linfeng
    Su, Jinsong
    Xie, Jun
    Song, Wei
    Luo, Jiebo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9515 - 9522
  • [43] Visual Social Relationship Recognition
    Junnan Li
    Yongkang Wong
    Qi Zhao
    Mohan S. Kankanhalli
    International Journal of Computer Vision, 2020, 128 : 1750 - 1764
  • [44] Visual Social Relationship Recognition
    Li, Junnan
    Wong, Yongkang
    Zhao, Qi
    Kankanhalli, Mohan S.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1750 - 1764
  • [45] Multitask learning approach for understanding the relationship between two sentences
    Choi, HongSeok
    Lee, Hyunju
    INFORMATION SCIENCES, 2019, 485 : 413 - 426
  • [46] The relationship between visual metaphor comprehension and recognition of similarities in children with learning disabilities
    Mashal, Nira
    Kasirer, Anat
    RESEARCH IN DEVELOPMENTAL DISABILITIES, 2012, 33 (06) : 1741 - 1748
  • [47] A semi-supervised mixture model of visual language multitask for vehicle recognition
    Liu, Wenjin
    Zhang, Shudong
    Zhou, Lijuan
    Luo, Ning
    Xu, Min
    APPLIED SOFT COMPUTING, 2024, 159
  • [48] VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry
    Radwan, Noha
    Valada, Abhinav
    Burgard, Wolfram
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 4407 - 4414
  • [49] JS']JSUM: A Multitask Learning Speech Recognition Model for Jointly Supervised and Unsupervised Learning
    Yolwas, Nurmemet
    Meng, Weijing
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [50] Depression Severity Level Classification Using Multitask Learning of Gender Recognition
    Liu, Yang
    Lu, Xiaoyong
    Shi, Daimin
    Yuan, Jingyi
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1317 - 1322