Large-Scale Visual Font Recognition

被引：22

作者：

Chen, Guang ^{[2
]}

Yang, Jianchao ^{[1
]}

Jin, Hailin ^{[1
]}

Brandt, Jonathan ^{[1
]}

Shechtman, Eli ^{[1
]}

Agarwala, Aseem ^{[1
]}

Han, Tony X. ^{[2
]}

机构：

[1] Adobe Res, San Jose, CA USA

[2] Univ Missouri, Columbia, MO 65211 USA

来源：

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年

关键词：

D O I：

10.1109/CVPR.2014.460

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the large-scale visual font recognition (VFR) problem, which aims at automatic identification of the typeface, weight, and slope of the text in an image or photo without any knowledge of content. Although visual font recognition has many practical applications, it has largely been neglected by the vision community. To address the VFR problem, we construct a large-scale dataset containing 2,420 font classes, which easily exceeds the scale of most image categorization datasets in computer vision. As font recognition is inherently dynamic and open-ended, i.e., new classes and data for existing categories are constantly added to the database over time, we propose a scalable solution based on the nearest class mean classifier (NCM). The core algorithm is built on local feature embedding, local feature metric learning and max-margin template selection, which is naturally amenable to NCM and thus to such open-ended classification problems. The new algorithm can generalize to new classes and new data at little added cost. Extensive experiments demonstrate that our approach is very effective on our synthetic test images, and achieves promising results on real world test images.

引用

页码：3598 / 3605

页数：8

共 50 条

[41] Decoupling Sparse Coding with Fusion of Fisher Vectors and Scalable SVMs for Large-scale Visual Recognition
Ji, Zhengping
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 450 - 457
[42] Integrating multi-level deep learning and concept ontology for large-scale visual recognition
Kuang, Zhenzhong
Yu, Jun
Li, Zongmin
Zhang, Baopeng
Fan, Jianping
[J]. PATTERN RECOGNITION, 2018, 78 : 198 - 214
[43] Dimensionality Reduction using Compressed Sensing and its Application to a Large-Scale Visual Recognition Task
Yang, Jie
Bouzerdoum, Abdesselam
Tivive, Fok Hing Chi
Phung, Son Lam
[J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[44] Plant Disease Recognition: A Large-Scale Benchmark Dataset and a Visual Region and Loss Reweighting Approach
Liu, Xinda
Min, Weiqing
Mei, Shuhuan
Wang, Lili
Jiang, Shuqiang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2003 - 2015
[45] Visual Representation Learning for Automating Car Part Recognition in a Large-scale Car Sharing Platform
Park, Kyung Ho
Kwon, Yunhwan
Song, Youngin
Byeon, Seongyun
[J]. 2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 1104 - 1110
[46] Visual Place Recognition in Long-term and Large-scale Environment based on CNN Feature
Zhu, Jianliang
Ai, Yunfeng
Tian, Bin
Cao, Dongpu
Scherer, Sebastian
[J]. 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1679 - 1685
[47] Large-Scale Gaussian Process Inference with Generalized Histogram Intersection Kernels for Visual Recognition Tasks
Rodner, Erik
Freytag, Alexander
Bodesheim, Paul
Froehlich, Bjoern
Denzler, Joachim
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 121 (02) : 253 - 280
[48] Large-Scale Gaussian Process Inference with Generalized Histogram Intersection Kernels for Visual Recognition Tasks
Erik Rodner
Alexander Freytag
Paul Bodesheim
Björn Fröhlich
Joachim Denzler
[J]. International Journal of Computer Vision, 2017, 121 : 253 - 280
[49] Visual Co-occurrence Network: Using Context for Large-Scale Object Recognition in Retail
Advani, Siddharth
Smith, Brigid
Tanabe, Yasuki
Irick, Kevin
Cotter, Matthew
Sampson, Jack
Narayanan, Vijaykrishnan
[J]. 2015 13TH IEEE SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA, 2015, : 103 - 112
[50] FONT FINDER: VISUAL RECOGNITION OF TYPEFACE IN PRINTED DOCUMENTS
Bui, Tu
Collomosse, John
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3926 - 3930

← 1 2 3 4 5 →