Rosetta: Large Scale System for Text Detection and Recognition in Images

被引：197

作者：

Borisyuk, Fedor ^{[1
]}

Gordo, Albert ^{[1
]}

Sivakumar, Viswanath ^{[1
]}

机构：

[1] Facebook Inc, Menlo Pk, CA 94025 USA

来源：

KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2018年

关键词：

Optical character recognition; text detection; text recognition;

D O I：

10.1145/3219819.3219861

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a deployed, scalable optical character recognition (OCR) system, which we call Rosetta, designed to process images uploaded daily at Facebook scale. Sharing of image content has become one of the primary ways to communicate information among internet users within social networks such as Facebook, and the understanding of such media, including its textual information, is of paramount importance to facilitate search and recommendation applications. We present modeling techniques for efficient detection and recognition of text in images and describe Rosetta's system architecture. We perform extensive evaluation of presented technologies, explain useful practical approaches to build an OCR system at scale, and provide insightful intuitions as to why and how certain components work based on the lessons learnt during the development and deployment of the system.

引用

页码：71 / 79

页数：9

共 50 条

[21] A Database for Urdu Text Detection and Recognition in Natural Scene Images
Chandio, Asghar Ali
Leghari, Mehwish
Memon, Mukhtiar Ahmed
Leghari, Mehjabeen
Jalbani, Akhtar Hussain
MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2020, 39 (01) : 47 - 54
[22] Text Flow: A Unified Text Detection System in Natural Scene Images
Tian, Shangxuan
Pan, Yifeng
Huang, Chang
Lu, Shijian
Yu, Kai
Tan, Chew Lim
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4651 - 4659
[23] Text Recognition from Images
Manwatkar, Pratik Madhukar
Yadav, Shashank H.
2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
[24] Recognition as translating images into text
Barnard, K
Duygulu, P
Forsyth, D
INTERNET IMAGING IV, 2003, 5018 : 168 - 178
[25] Automated system for text detection in individual video images
Du, YZ
Chang, CI
Thouin, PD
JOURNAL OF ELECTRONIC IMAGING, 2003, 12 (03) : 410 - 422
[26] Text detection, recognition, and script identification in natural scene images: a Review
Veronica Naosekpam
Nilkanta Sahu
International Journal of Multimedia Information Retrieval, 2022, 11 : 291 - 314
[27] A Framework of Text Detection and Recognition from Natural Images for Mobile Device
Selmi, Zied
Ben Halima, Mohamed
Wali, Ali
Alimi, Adel M.
NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
[28] Semiautomatic Ground Truth Generation for Text Detection and Recognition in Video Images
Trung Quy Phan
Shivakumara, Palaiahnakote
Bhowmick, Souvik
Li, Shimiao
Tan, Chew Lim
Pal, Umapada
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (08) : 1277 - 1287
[29] Text detection, recognition, and script identification in natural scene images: a Review
Naosekpam, Veronica
Sahu, Nilkanta
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 291 - 314
[30] Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning
Coates, Adam
Carpenter, Blake
Case, Carl
Satheesh, Sanjeev
Suresh, Bipin
Wang, Tao
Wu, David J.
Ng, Andrew Y.
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 440 - 445

← 1 2 3 4 5 →