Large-Scale Coarse-to-Fine Object Retrieval Ontology and Deep Local Multitask Learning

被引:4
|
作者
Ly, Ngoc Q. [1 ]
Do, Tuong K. [2 ]
Nguyen, Binh X. [1 ]
机构
[1] VNUHCM Univ Sci, Dept Informat Technol, Hcm 70000, Vietnam
[2] AIOZ Pte Ltd, Hcm 70000, Vietnam
关键词
45;
D O I
10.1155/2019/1483294
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Object retrieval plays an increasingly important role in video surveillance, digital marketing, e-commerce, etc. It is facing challenges such as large-scale datasets, imbalanced data, viewpoint, cluster background, and fine-grained details (attributes). This paper has proposed a model to integrate object ontology, a local multitask deep neural network (local MDNN), and an imbalanced data solver to take advantages and overcome the shortcomings of deep learning network models to improve the performance of the large-scale object retrieval system from the coarse-grained level (categories) to the fine-grained level (attributes). Our proposed coarse-to-fine object retrieval (CFOR) system can be robust and resistant to the challenges listed above. To the best of our knowledge, the new main point of our CFOR system is the power of mutual support of object ontology, a local MDNN, and an imbalanced data solver in a unified system. Object ontology supports the exploitation of the inner-group correlations to improve the system performance in category classification, attribute classification, and conducting training flow and retrieval flow to save computational costs in the training stage and retrieval stage on large-scale datasets, respectively. A local MDNN supports linking object ontology to the raw data, and an imbalanced data solver based on Matthews' correlation coefficient (MCC) addresses that the imbalance of data has contributed effectively to increasing the quality of object ontology realization without adjusting network architecture and data augmentation. In order to evaluate the performance of the CFOR system, we experimented on the DeepFashion dataset. This paper has shown that our local MDNN framework based on the pretrained NASNet architecture has achieved better performance (14.2% higher in recall rate) compared to single-task learning (STL) in the attribute learning task; it has also shown that our model with an imbalanced data solver has achieved better performance (5.14% higher in recall rate for fewer data attributes) compared to models that do not take this into account. Moreover, MAP@30 hovers 0.815 in retrieval on an average of 35 imbalanced fashion attributes.
引用
收藏
页数:40
相关论文
共 50 条
  • [41] CASCADED ACTIVE LEARNING FOR OBJECT RETRIEVAL USING MULTISCALE COARSE TO FINE ANALYSIS
    Blanchart, Pierre
    Ferecatu, Marin
    Datcu, Mihai
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [42] Large-scale Deep Learning at Baidu
    Yu, Kai
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2211 - 2211
  • [43] Coarse-to-fine textures retrieval in the JPEG 2000 compressed domain for fast browsing of large image databases
    Descampe, Antonin
    Vandergheynst, Pierre
    De Vleeschouwer, Christophe
    Macq, Benoit
    MULTIMEDIA CONTENT REPRESENTATION, CLASSIFICATION AND SECURITY, 2006, 4105 : 282 - 289
  • [44] Coarse-to-fine evolutionary search for large-scale multi-objective optimization: An application to ratio error estimation of voltage transformers
    Li, Jun
    Zou, Kai
    Xing, Lining
    FRONTIERS IN ENERGY RESEARCH, 2022, 10
  • [45] Deep Product Quantization for Large-Scale Image Retrieval
    Zhai, Qi
    Jiang, Mingyan
    2019 4TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2019), 2019, : 198 - 202
  • [46] Cascaded Deep Hashing for Large-Scale Image Retrieval
    Lu, Jun
    Zhang, Li
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VI, 2018, 11306 : 419 - 429
  • [47] Efficient Representation of Local Geometry for Large Scale Object Retrieval
    Perd'och, Michal
    Chum, Ondrej
    Matas, Jiri
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 9 - 16
  • [48] FAST AND COMPACT VISUAL CODEBOOK FOR LARGE-SCALE OBJECT RETRIEVAL
    Cen, Shusheng
    Dong, Yuan
    Bai, Hongliang
    Huang, Chong
    2013 5TH IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY (IC-BNMT), 2013, : 35 - 38
  • [49] Large-Scale Online Multitask Learning and Decision Making for Flexible Manufacturing
    Wang, JunPing
    Sun, YunChuan
    Zhang, WenSheng
    Thomas, Ian
    Duan, ShiHui
    Shi, YouKang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2016, 12 (06) : 2139 - 2147
  • [50] Large-scale semantic web image retrieval using bimodal deep learning techniques
    Huang, Changqin
    Xu, Haijiao
    Xie, Liang
    Zhu, Jia
    Xu, Chunyan
    Tang, Yong
    INFORMATION SCIENCES, 2018, 430 : 331 - 348