Towards Visual Words to Words Text Detection with a General Bag of Words Representation

被引：0

作者：

Mehta, Rakesh ^{[1
]}

Chum, Ondrej ^{[2
]}

Matas, Jiri ^{[2
]}

机构：

[1] Tampere Univ Technol, Dept Signal Proc, FIN-33101 Tampere, Finland

[2] Czech Tech Univ, Fac Elect Engn, Ctr Machine Pecept, Dept Cybernet, Prague, Czech Republic

来源：

2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2015年

关键词：

IMAGES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We address the problem of text localization and retrieval in real world images. We are first to study the retrieval of text images, i.e. the selection of images containing text in large collections at high speed. We propose a novel representation, textual visual words, which describe text by generic visual words that geometrically consistently predict bottom and top lines of text. The visual words are discretized SIFT descriptors of Hessian features. The features may correspond to various structures present in the text - character fragments, individual characters or their arrangements. The textual words representation is invariant to affine transformation of the image and local linear change of intensity. Experiments demonstrate that the proposed method outperforms the state-of-the-art on the MS dataset. The proposed method detects blurry, small font, low contrast, noisy text from real world images.

引用

页码：641 / 645

页数：5

共 50 条

[1] Informative visual words construction to improve bag of words image representation
Farhangi, Mohammad Mehdi
Soryani, Mohsen
Fathy, Mahmood
IET IMAGE PROCESSING, 2014, 8 (05) : 310 - 318
[2] Beyond the bag of words: A text representation for sentence selection
Caropreso, Maria Fernanda
Matwin, Stan
ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4013 : 324 - 335
[3] Learning Bag of Visual Words for Motorbike Detection
Ngoc Dung Thai
Thanh Sach Le
Nam Thoai
Hamamoto, Kazuhiko
2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 1045 - 1050
[4] On Vocabulary Size in Bag-of-Visual-Words Representation
Hou, Jian
Kang, Jianxin
Qi, Naiming
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 414 - 424
[5] Extended Bag of Visual Words for Face Detection
Montazer, Gholam Ali
Soltanshahi, Mohammad Ali
Giveki, Davar
ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT I (IWANN 2015), 2015, 9094 : 503 - 510
[6] A Bag of Constrained Visual Words Model for Image Representation
Mukherjee, Anindita
Sil, Jaya
Chowdhury, Ananda S.
PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 403 - 415
[7] Patch Enhancement for Melanoma Detection with Bag of Visual Words
Okur, Erdem
Turkan, Mehmet
2022 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO'22), 2022,
[8] Feature Selection using Bag-Of-Visual-Words Representation
Faheema, A. G.
Rakshit, Subrata
2010 IEEE 2ND INTERNATIONAL ADVANCE COMPUTING CONFERENCE, 2010, : 151 - 156
[9] An Adult Image Detection Algorithm Based on Bag-of-Visual-Words and Text Information
Dong, Kaikun
Guo, Li
Fu, Quansheng
2014 10TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2014, : 556 - 560
[10] Bag of Words and Embedding Text Representation Methods for Medical Article Classification
Cichosz, Pawel
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2023, 33 (04) : 603 - 621

← 1 2 3 4 5 →