Rosetta: Large Scale System for Text Detection and Recognition in Images

被引:180
|
作者
Borisyuk, Fedor [1 ]
Gordo, Albert [1 ]
Sivakumar, Viswanath [1 ]
机构
[1] Facebook Inc, Menlo Pk, CA 94025 USA
关键词
Optical character recognition; text detection; text recognition;
D O I
10.1145/3219819.3219861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a deployed, scalable optical character recognition (OCR) system, which we call Rosetta, designed to process images uploaded daily at Facebook scale. Sharing of image content has become one of the primary ways to communicate information among internet users within social networks such as Facebook, and the understanding of such media, including its textual information, is of paramount importance to facilitate search and recommendation applications. We present modeling techniques for efficient detection and recognition of text in images and describe Rosetta's system architecture. We perform extensive evaluation of presented technologies, explain useful practical approaches to build an OCR system at scale, and provide insightful intuitions as to why and how certain components work based on the lessons learnt during the development and deployment of the system.
引用
收藏
页码:71 / 79
页数:9
相关论文
共 50 条
  • [1] Object Detection and Text Recognition in Large-scale Technical Drawings
    Nguyen, Trang M.
    Long Van Pham
    Chien Chu Nguyen
    Vinh Van Nguyen
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 612 - 619
  • [2] Text Detection and Recognition in Natural Scene Images
    Huang, Xiaoming
    Shen, Tao
    Wang, Run
    Gao, Chenqiang
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ESTIMATION, DETECTION AND INFORMATION FUSION ICEDIF 2015, 2015, : 44 - 49
  • [3] Text Detection and Recognition in Real World Images
    Saabni, Raid
    Zwilling, Moti
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 443 - 448
  • [4] Text detection and recognition in images and video frames
    Chen, DT
    Odobez, JM
    Bourlard, H
    PATTERN RECOGNITION, 2004, 37 (03) : 595 - 608
  • [5] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [6] Integrated Text Detection and Recognition in Natural Images
    Roubtsova, Nadejda S.
    Wijnhoven, Rob G. J.
    de With, Peter H. N.
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS X AND PARALLEL PROCESSING FOR IMAGING APPLICATIONS II, 2012, 8295
  • [7] Robust Text Detection and Recognition in Blurred Images
    George, Sonia
    Jagadeesh, Noopa
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING SYSTEMS, ICSCS 2015, VOL 1, 2016, 397 : 125 - 134
  • [8] Detection And Recognition For Text In Traffic Sign Images
    Kong, Ling-Yun
    2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 2043 - 2045
  • [9] Detection of Large Herbivores in UAV Images: A New Method for Small Target Recognition in Large-Scale Images
    Ma, Jiarong
    Hu, Zhuowei
    Shao, Quanqin
    Wang, Yongcai
    Zhou, Yanqiong
    Liu, Jiayan
    Liu, Shuchao
    DIVERSITY-BASEL, 2022, 14 (08):
  • [10] Data-Driven Container Marking Detection and Recognition System With an Open Large-Scale Scene Text Dataset
    Xu, Ying
    Liang, Zhangzhao
    Liang, Yanyang
    Li, Xinru
    Pan, Wenfeng
    You, Jie
    Long, Zhihao
    Zhai, Yikui
    Genovese, Angelo
    Piuri, Vincenzo
    Scotti, Fabio
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (05): : 1 - 14