Rosetta: Large Scale System for Text Detection and Recognition in Images

被引:197
|
作者
Borisyuk, Fedor [1 ]
Gordo, Albert [1 ]
Sivakumar, Viswanath [1 ]
机构
[1] Facebook Inc, Menlo Pk, CA 94025 USA
关键词
Optical character recognition; text detection; text recognition;
D O I
10.1145/3219819.3219861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a deployed, scalable optical character recognition (OCR) system, which we call Rosetta, designed to process images uploaded daily at Facebook scale. Sharing of image content has become one of the primary ways to communicate information among internet users within social networks such as Facebook, and the understanding of such media, including its textual information, is of paramount importance to facilitate search and recommendation applications. We present modeling techniques for efficient detection and recognition of text in images and describe Rosetta's system architecture. We perform extensive evaluation of presented technologies, explain useful practical approaches to build an OCR system at scale, and provide insightful intuitions as to why and how certain components work based on the lessons learnt during the development and deployment of the system.
引用
收藏
页码:71 / 79
页数:9
相关论文
共 50 条
  • [41] A Method of Text Detection and Recognition from Receipt Images Based on CRAFT and CRNN
    Wang, Xiaohui
    Zhang, Xi
    Lei, Shuya
    Deng, Hongmei
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [42] Text Detection and Recognition for Images of Medical Laboratory Reports With a Deep Learning Approach
    Xue, Wenyuan
    Li, Qingyong
    Xue, Qiyuan
    IEEE ACCESS, 2020, 8 (08): : 407 - 416
  • [43] End-to-End Analysis for Text Detection and Recognition in Natural Scene Images
    Alnefaie, Ahlam
    Gupta, Deepak
    Bhuyan, Monowar H.
    Razzak, Imran
    Gupta, Prashant
    Prasad, Mukesh
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [44] A robust arbitrary text detection system for natural scene images
    Risnumawan, Anhar
    Shivakumara, Palaiahankote
    Chan, Chee Seng
    Tan, Chew Lim
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (18) : 8027 - 8048
  • [45] Fast recognition of harbor target in large scale remote sensor images
    Zhu, Bing
    Li, Jin-Zong
    Chen, Ai-Jun
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2006, 19 (04): : 552 - 556
  • [46] Analyzing the influence of contrast in large-scale recognition of natural images
    Sanchez, Angel
    Belen Moreno, A.
    Velez, Daniel
    Veleza, Jose F.
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2016, 23 (03) : 221 - 235
  • [47] Generating Text Sequence Images for Recognition
    Gong, Yanxiang
    Deng, Linjie
    Ma, Zheng
    Xie, Mei
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1677 - 1688
  • [48] ARGO: a web system for the detection of degenerate motifs and large-scale recognition of eukaryotic promoters
    Vishnevsky, OV
    Kolchanov, NA
    NUCLEIC ACIDS RESEARCH, 2005, 33 : W417 - W422
  • [49] Generating Text Sequence Images for Recognition
    Yanxiang Gong
    Linjie Deng
    Zheng Ma
    Mei Xie
    Neural Processing Letters, 2020, 51 : 1677 - 1688
  • [50] Recognition of Text in Wine Label images
    Lim, Junsik
    Kim, Soohyung
    Park, JongHyun
    Lee, GueeSang
    Yang, HyungJeong
    Lee, ChilWoo
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 911 - 915