Large-Scale Visual Font Recognition

被引:22
|
作者
Chen, Guang [2 ]
Yang, Jianchao [1 ]
Jin, Hailin [1 ]
Brandt, Jonathan [1 ]
Shechtman, Eli [1 ]
Agarwala, Aseem [1 ]
Han, Tony X. [2 ]
机构
[1] Adobe Res, San Jose, CA USA
[2] Univ Missouri, Columbia, MO 65211 USA
关键词
D O I
10.1109/CVPR.2014.460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the large-scale visual font recognition (VFR) problem, which aims at automatic identification of the typeface, weight, and slope of the text in an image or photo without any knowledge of content. Although visual font recognition has many practical applications, it has largely been neglected by the vision community. To address the VFR problem, we construct a large-scale dataset containing 2,420 font classes, which easily exceeds the scale of most image categorization datasets in computer vision. As font recognition is inherently dynamic and open-ended, i.e., new classes and data for existing categories are constantly added to the database over time, we propose a scalable solution based on the nearest class mean classifier (NCM). The core algorithm is built on local feature embedding, local feature metric learning and max-margin template selection, which is naturally amenable to NCM and thus to such open-ended classification problems. The new algorithm can generalize to new classes and new data at little added cost. Extensive experiments demonstrate that our approach is very effective on our synthetic test images, and achieves promising results on real world test images.
引用
收藏
页码:3598 / 3605
页数:8
相关论文
共 50 条
  • [41] Decoupling Sparse Coding with Fusion of Fisher Vectors and Scalable SVMs for Large-scale Visual Recognition
    Ji, Zhengping
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 450 - 457
  • [42] Integrating multi-level deep learning and concept ontology for large-scale visual recognition
    Kuang, Zhenzhong
    Yu, Jun
    Li, Zongmin
    Zhang, Baopeng
    Fan, Jianping
    [J]. PATTERN RECOGNITION, 2018, 78 : 198 - 214
  • [43] Dimensionality Reduction using Compressed Sensing and its Application to a Large-Scale Visual Recognition Task
    Yang, Jie
    Bouzerdoum, Abdesselam
    Tivive, Fok Hing Chi
    Phung, Son Lam
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [44] Plant Disease Recognition: A Large-Scale Benchmark Dataset and a Visual Region and Loss Reweighting Approach
    Liu, Xinda
    Min, Weiqing
    Mei, Shuhuan
    Wang, Lili
    Jiang, Shuqiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2003 - 2015
  • [45] Visual Representation Learning for Automating Car Part Recognition in a Large-scale Car Sharing Platform
    Park, Kyung Ho
    Kwon, Yunhwan
    Song, Youngin
    Byeon, Seongyun
    [J]. 2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 1104 - 1110
  • [46] Visual Place Recognition in Long-term and Large-scale Environment based on CNN Feature
    Zhu, Jianliang
    Ai, Yunfeng
    Tian, Bin
    Cao, Dongpu
    Scherer, Sebastian
    [J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1679 - 1685
  • [47] Large-Scale Gaussian Process Inference with Generalized Histogram Intersection Kernels for Visual Recognition Tasks
    Rodner, Erik
    Freytag, Alexander
    Bodesheim, Paul
    Froehlich, Bjoern
    Denzler, Joachim
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 121 (02) : 253 - 280
  • [48] Large-Scale Gaussian Process Inference with Generalized Histogram Intersection Kernels for Visual Recognition Tasks
    Erik Rodner
    Alexander Freytag
    Paul Bodesheim
    Björn Fröhlich
    Joachim Denzler
    [J]. International Journal of Computer Vision, 2017, 121 : 253 - 280
  • [49] Visual Co-occurrence Network: Using Context for Large-Scale Object Recognition in Retail
    Advani, Siddharth
    Smith, Brigid
    Tanabe, Yasuki
    Irick, Kevin
    Cotter, Matthew
    Sampson, Jack
    Narayanan, Vijaykrishnan
    [J]. 2015 13TH IEEE SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA, 2015, : 103 - 112
  • [50] FONT FINDER: VISUAL RECOGNITION OF TYPEFACE IN PRINTED DOCUMENTS
    Bui, Tu
    Collomosse, John
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3926 - 3930