Rosetta: Large Scale System for Text Detection and Recognition in Images

被引:197
|
作者
Borisyuk, Fedor [1 ]
Gordo, Albert [1 ]
Sivakumar, Viswanath [1 ]
机构
[1] Facebook Inc, Menlo Pk, CA 94025 USA
关键词
Optical character recognition; text detection; text recognition;
D O I
10.1145/3219819.3219861
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a deployed, scalable optical character recognition (OCR) system, which we call Rosetta, designed to process images uploaded daily at Facebook scale. Sharing of image content has become one of the primary ways to communicate information among internet users within social networks such as Facebook, and the understanding of such media, including its textual information, is of paramount importance to facilitate search and recommendation applications. We present modeling techniques for efficient detection and recognition of text in images and describe Rosetta's system architecture. We perform extensive evaluation of presented technologies, explain useful practical approaches to build an OCR system at scale, and provide insightful intuitions as to why and how certain components work based on the lessons learnt during the development and deployment of the system.
引用
收藏
页码:71 / 79
页数:9
相关论文
共 50 条
  • [21] A Database for Urdu Text Detection and Recognition in Natural Scene Images
    Chandio, Asghar Ali
    Leghari, Mehwish
    Memon, Mukhtiar Ahmed
    Leghari, Mehjabeen
    Jalbani, Akhtar Hussain
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2020, 39 (01) : 47 - 54
  • [22] Text Flow: A Unified Text Detection System in Natural Scene Images
    Tian, Shangxuan
    Pan, Yifeng
    Huang, Chang
    Lu, Shijian
    Yu, Kai
    Tan, Chew Lim
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4651 - 4659
  • [23] Text Recognition from Images
    Manwatkar, Pratik Madhukar
    Yadav, Shashank H.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [24] Recognition as translating images into text
    Barnard, K
    Duygulu, P
    Forsyth, D
    INTERNET IMAGING IV, 2003, 5018 : 168 - 178
  • [25] Automated system for text detection in individual video images
    Du, YZ
    Chang, CI
    Thouin, PD
    JOURNAL OF ELECTRONIC IMAGING, 2003, 12 (03) : 410 - 422
  • [26] Text detection, recognition, and script identification in natural scene images: a Review
    Veronica Naosekpam
    Nilkanta Sahu
    International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
  • [27] A Framework of Text Detection and Recognition from Natural Images for Mobile Device
    Selmi, Zied
    Ben Halima, Mohamed
    Wali, Ali
    Alimi, Adel M.
    NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
  • [28] Semiautomatic Ground Truth Generation for Text Detection and Recognition in Video Images
    Trung Quy Phan
    Shivakumara, Palaiahnakote
    Bhowmick, Souvik
    Li, Shimiao
    Tan, Chew Lim
    Pal, Umapada
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (08) : 1277 - 1287
  • [29] Text detection, recognition, and script identification in natural scene images: a Review
    Naosekpam, Veronica
    Sahu, Nilkanta
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
  • [30] Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning
    Coates, Adam
    Carpenter, Blake
    Case, Carl
    Satheesh, Sanjeev
    Suresh, Bipin
    Wang, Tao
    Wu, David J.
    Ng, Andrew Y.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 440 - 445