A novel image text extraction method based on k-means clustering

被引:20
|
作者
Song, Yan [1 ]
Liu, Anan [1 ]
Pang, Lin [1 ]
Lin, Shouxun [1 ]
Zhang, Yongdong [1 ]
Tang, Sheng [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100080, Peoples R China
关键词
D O I
10.1109/ICIS.2008.31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Texts in web pages, images and videos contain important clues for information indexing and retrieval. Most existing text extraction methods depend on the language type and text appearance. In this paper, a novel and universal method of image text extraction is proposed A coarse-to-fine text location method is implemented Firstly, a multi-scale approach is adopted to locate texts with different font sizes. Secondly, projection profiles are used in location refinement step. Color-based k-means clustering is adopted in text segmentation. Compared to grayscale image which is used in most existing methods, color image is more suitable for segmentation based on clustering. It treats corner-points, edge-points and other points equally so that it solves the problem of handling multilingual text. It is demonstrated in experimental results that best performance is obtained when k is 3. Comparative experimental results on a large number of images show that our method is accurate and robust in various conditions.
引用
收藏
页码:185 / 190
页数:6
相关论文
共 50 条
  • [41] Korean Text Extraction by Local Color Quantization and K-means Clustering in Natural Scene
    Lai, Anh-Nga
    Park, KeonHee
    Kumar, Manoj
    Lee, GueeSang
    2009 FIRST ASIAN CONFERENCE ON INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2009, : 138 - 143
  • [42] Image Steganography Method Using K-Means Clustering and Encryption Techniques
    Pillai, Bhagya
    Mounika, Mundra
    Rao, Pooja J.
    Sriram, Padmamala
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1206 - 1211
  • [43] Image Segmentation Using Gabor Filter and K-Means Clustering Method
    Premana, Agyztia
    Wijaya, Akhmad Pandhu
    Soeleman, Moch Arief
    2017 INTERNATIONAL SEMINAR ON APPLICATION FOR TECHNOLOGY OF INFORMATION AND COMMUNICATION (ISEMANTIC), 2017, : 95 - 99
  • [44] Improved K-Means algorithm in text semantic clustering
    Ma, Junhong
    Open Cybernetics and Systemics Journal, 2014, 8 : 530 - 534
  • [45] An improved K-means clustering method for cDNA microarray image segmentation
    Wang, T. N.
    Li, T. J.
    Shao, G. F.
    Wu, S. X.
    GENETICS AND MOLECULAR RESEARCH, 2015, 14 (03) : 7771 - 7781
  • [46] A novel hierarchical K-means clustering algorithm based on entropy
    Tang, Zhihang
    Li, Rongjun
    Journal of Information and Computational Science, 2010, 7 (14): : 3019 - 3026
  • [47] A novel prediction method of complex univariate time series based on k-means clustering
    Liu, Yunxin
    Ding, Shifei
    Jia, Weikuan
    SOFT COMPUTING, 2020, 24 (21) : 16425 - 16437
  • [48] A novel prediction method of complex univariate time series based on k-means clustering
    Yunxin Liu
    Shifei Ding
    Weikuan Jia
    Soft Computing, 2020, 24 : 16425 - 16437
  • [49] A Novel Supervised Multi-model Modeling Method Based on k-means Clustering
    Liu, Linlin
    Zhou, Lifang
    Xie, Shenggang
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 684 - 689
  • [50] A Novel K-Means based Clustering Algorithm for Big Data
    Sinha, Ankita
    Jana, Prasanta K.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1875 - 1879