A novel image text extraction method based on k-means clustering

被引:20
|
作者
Song, Yan [1 ]
Liu, Anan [1 ]
Pang, Lin [1 ]
Lin, Shouxun [1 ]
Zhang, Yongdong [1 ]
Tang, Sheng [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100080, Peoples R China
关键词
D O I
10.1109/ICIS.2008.31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Texts in web pages, images and videos contain important clues for information indexing and retrieval. Most existing text extraction methods depend on the language type and text appearance. In this paper, a novel and universal method of image text extraction is proposed A coarse-to-fine text location method is implemented Firstly, a multi-scale approach is adopted to locate texts with different font sizes. Secondly, projection profiles are used in location refinement step. Color-based k-means clustering is adopted in text segmentation. Compared to grayscale image which is used in most existing methods, color image is more suitable for segmentation based on clustering. It treats corner-points, edge-points and other points equally so that it solves the problem of handling multilingual text. It is demonstrated in experimental results that best performance is obtained when k is 3. Comparative experimental results on a large number of images show that our method is accurate and robust in various conditions.
引用
收藏
页码:185 / 190
页数:6
相关论文
共 50 条
  • [21] A Novel Method for Clustering using k-means and Apriori Algorithm
    Ali, Syed Zishan
    Tiwari, Nikhil
    Sen, Sushmita
    PROCEEDINGS OF THE 2016 IEEE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL & ELECTRONICS, INFORMATION, COMMUNICATION & BIO INFORMATICS (IEEE AEEICB-2016), 2016, : 59 - 62
  • [22] K-Means Clustering Based on Density for Scene Image Classification
    Xie, Ke
    Wu, Jin
    Yang, Wankou
    Sun, Changyin
    PROCEEDINGS OF THE 2015 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2015, 336 : 379 - 386
  • [23] Hyperspectral Image Classification: A k-means Clustering Based Approach
    Ranjan, Sameer
    Nayak, Deepak Ranjan
    Kumar, Kallepalli Satish
    Dash, Ratnakar
    Majhi, Banshidhar
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2017,
  • [24] Comparative Analysis of Multi-scale Wavelet Decomposition and k-Means Clustering Based Text Extraction
    Ghai, Deepika
    Jain, Neelu
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 109 (01) : 455 - 490
  • [25] Distributed Algorithm for Text Documents Clustering Based on k-Means Approach
    Sarnovsky, Martin
    Carnoka, Noema
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2015, PT II, 2016, 430 : 165 - 174
  • [26] An improved K-Means text clustering algorithm based on Local Search
    Liu, Xiangwei
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11578 - 11581
  • [27] Image Matching Algorithm based on ORB and K-means Clustering
    Zhang, Liye
    Cai, Fudong
    Wang, Jinjun
    Lv, Changfeng
    Liu, Wei
    Guo, Guoxin
    Liu, Huanyun
    Xing, Yixin
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 460 - 464
  • [28] Extracting vein of leaf image based on K-means clustering
    Li, F. (ganguli@126.com), 1600, Chinese Society of Agricultural Engineering (28):
  • [29] Image segmentation based on swarm intelligence and K-means clustering
    Zhao, Bo
    Qi, Lixia
    Mao, Enrong
    Song, Zhenghe
    Zhu, Zhongxiang
    Journal of Information and Computational Science, 2007, 4 (03): : 937 - 945
  • [30] K-Means Clustering for Adaptive Wavelet Based Image Denoising
    Agrawal, Utkarsh
    Roy, Soumava Kumar
    Tiwary, U. S.
    Prashanth, D. S.
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, : 134 - 137