Jointly Learning Visually Correlated Dictionaries for Large-Scale Visual Recognition Applications

被引:50
|
作者
Zhou, Ning [1 ]
Fan, Jianping [1 ]
机构
[1] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA
基金
美国国家科学基金会;
关键词
Joint dictionary learning; common visual atoms; category-specific visual atoms; visual tree; large-scale visual recognition; IMAGE; CLASSIFICATION;
D O I
10.1109/TPAMI.2013.189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning discriminative dictionaries for image content representation plays a critical role in visual recognition. In this paper, we present a joint dictionary learning (JDL) algorithm which exploits the inter-category visual correlations to learn more discriminative dictionaries. Given a group of visually correlated categories, JDL simultaneously learns one common dictionary and multiple category-specific dictionaries to explicitly separate the shared visual atoms from the category-specific ones. The problem of JDL is formulated as a joint optimization with a discrimination promotion term according to the Fisher discrimination criterion. A visual tree method is developed to cluster a large number of categories into a set of disjoint groups, so that each of them contains a reasonable number of visually correlated categories. The process of image category clustering helps JDL to learn better dictionaries for classification by ensuring that the categories in the same group are of strong visual correlations. Also, it makes JDL to be computationally affordable in large-scale applications. Three classification schemes are adopted to make full use of the dictionaries learned by JDL for visual content representation in the task of image categorization. The effectiveness of the proposed algorithms has been evaluated using two image databases containing 17 and 1,000 categories, respectively.
引用
收藏
页码:715 / 730
页数:16
相关论文
共 50 条
  • [1] Fast Learning Discriminative Dictionaries for Large-scale Visual Recognition
    Zhao, Tianyi
    Qu, Yanyun
    Fan, Jianping
    [J]. 2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [2] Discriminative Learning of Relaxed Hierarchy for Large-scale Visual Recognition
    Gao, Tianshi
    Koller, Daphne
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 2072 - 2079
  • [3] Three Guidelines of Online Learning for Large-Scale Visual Recognition
    Ushiku, Yoshitaka
    Hidaka, Masatoshi
    Harada, Tatsuya
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3574 - 3581
  • [4] Large-Scale Visual Speech Recognition
    Shillingford, Brendan
    Assael, Yannis
    Hoffman, Matthew W.
    Paine, Thomas
    Hughes, Cian
    Prabhu, Utsav
    Liao, Hank
    Sak, Hasim
    Rao, Kanishka
    Bennett, Lorrayne
    Mulville, Marie
    Denil, Misha
    Coppin, Ben
    Laurie, Ben
    Senior, Andrew
    de Freitas, Nando
    [J]. INTERSPEECH 2019, 2019, : 4135 - 4139
  • [5] Large-Scale Visual Font Recognition
    Chen, Guang
    Yang, Jianchao
    Jin, Hailin
    Brandt, Jonathan
    Shechtman, Eli
    Agarwala, Aseem
    Han, Tony X.
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3598 - 3605
  • [6] Large-scale Pollen Recognition with Deep Learning
    de Geus, Andre R.
    Barcelos, Celia A. Z.
    Batista, Marcos A.
    da Silva, Sergio F.
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [7] Sparse Output Coding for Large-Scale Visual Recognition
    Zhao, Bin
    Xing, Eric P.
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3350 - 3357
  • [8] Embedding Visual Hierarchy With Deep Networks for Large-Scale Visual Recognition
    Zhao, Tianyi
    Zhang, Baopeng
    He, Ming
    Zhang, Wei
    Zhou, Ning
    Yu, Jun
    Fan, Jianping
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (10) : 4740 - 4755
  • [9] Improvement of the AlexNet Networks for Large-Scale Recognition Applications
    Wu, Zixian
    He, Shuping
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2021, 45 (02) : 493 - 503
  • [10] Improvement of the AlexNet Networks for Large-Scale Recognition Applications
    Zixian Wu
    Shuping He
    [J]. Iranian Journal of Science and Technology, Transactions of Electrical Engineering, 2021, 45 : 493 - 503