Large-Scale Coarse-to-Fine Object Retrieval Ontology and Deep Local Multitask Learning

被引:4
|
作者
Ly, Ngoc Q. [1 ]
Do, Tuong K. [2 ]
Nguyen, Binh X. [1 ]
机构
[1] VNUHCM Univ Sci, Dept Informat Technol, Hcm 70000, Vietnam
[2] AIOZ Pte Ltd, Hcm 70000, Vietnam
关键词
45;
D O I
10.1155/2019/1483294
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Object retrieval plays an increasingly important role in video surveillance, digital marketing, e-commerce, etc. It is facing challenges such as large-scale datasets, imbalanced data, viewpoint, cluster background, and fine-grained details (attributes). This paper has proposed a model to integrate object ontology, a local multitask deep neural network (local MDNN), and an imbalanced data solver to take advantages and overcome the shortcomings of deep learning network models to improve the performance of the large-scale object retrieval system from the coarse-grained level (categories) to the fine-grained level (attributes). Our proposed coarse-to-fine object retrieval (CFOR) system can be robust and resistant to the challenges listed above. To the best of our knowledge, the new main point of our CFOR system is the power of mutual support of object ontology, a local MDNN, and an imbalanced data solver in a unified system. Object ontology supports the exploitation of the inner-group correlations to improve the system performance in category classification, attribute classification, and conducting training flow and retrieval flow to save computational costs in the training stage and retrieval stage on large-scale datasets, respectively. A local MDNN supports linking object ontology to the raw data, and an imbalanced data solver based on Matthews' correlation coefficient (MCC) addresses that the imbalance of data has contributed effectively to increasing the quality of object ontology realization without adjusting network architecture and data augmentation. In order to evaluate the performance of the CFOR system, we experimented on the DeepFashion dataset. This paper has shown that our local MDNN framework based on the pretrained NASNet architecture has achieved better performance (14.2% higher in recall rate) compared to single-task learning (STL) in the attribute learning task; it has also shown that our model with an imbalanced data solver has achieved better performance (5.14% higher in recall rate for fewer data attributes) compared to models that do not take this into account. Moreover, MAP@30 hovers 0.815 in retrieval on an average of 35 imbalanced fashion attributes.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Coarse-to-Fine Deep Metric Learning for Remote Sensing Image Retrieval
    Yun, Min-Sub
    Nam, Woo-Jeoung
    Lee, Seong-Whan
    REMOTE SENSING, 2020, 12 (02)
  • [2] Learning multi-layer coarse-to-fine representations for large-scale image classification
    Zhang, Ji
    Mei, Kuizhi
    Zheng, Yu
    Fan, Jianping
    PATTERN RECOGNITION, 2019, 91 : 175 - 189
  • [3] Coarse-to-Fine Lane Boundary Extraction for Large-Scale HD Mapping
    Li, Tianyi
    Lai, Chuanbin
    Chai, Xun
    Shen, Lixia
    Wu, Yong
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 119 - 126
  • [4] Coarse-to-Fine: Progressive Knowledge Transfer-Based Multitask Convolutional Neural Network for Intelligent Large-Scale Fault Diagnosis
    Wang, Yu
    Liu, Ruonan
    Lin, Di
    Chen, Dongyue
    Li, Ping
    Hu, Qinghua
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (02) : 761 - 774
  • [5] Is coarse-to-fine tuning in object recognition one of size or scale?
    Fiser, J.
    Subramaniam, S.
    Biederman, I.
    PERCEPTION, 1996, 25 : 49 - 49
  • [6] Dynamic Coarse-to-Fine Learning for Oriented Tiny Object Detection
    Xu, Chang
    Ding, Jian
    Wang, Jinwang
    Yang, Wen
    Yu, Huai
    Yu, Lei
    Xia, Gui-Song
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7318 - 7328
  • [7] Pattern Retrieval in Large Image Databases Using Multiscale Coarse-to-Fine Cascaded Active Learning
    Blanchart, Pierre
    Ferecatu, Marin
    Cui, Shiyong
    Datcu, Mihai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (04) : 1127 - 1141
  • [8] Deep Large-Scale Multitask Learning Network for Gene Expression Inference
    Dizaji, Kamran Ghasedi
    Chen, Wei
    Huang, Heng
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (05) : 485 - 500
  • [9] Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection
    Li, Zekun
    Liu, Yufan
    Li, Bing
    Hu, Weiming
    Zhou, Xue
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] Learning Deep Local Features with Multiple Dynamic Attentions for Large-Scale Image Retrieval
    Wu, Hui
    Wang, Min
    Zhou, Wengang
    Li, Houqiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11396 - 11405