A New Approach to Extract Text from Images based on DWT and K-means Clustering

被引:3
|
作者
Ghai, Deepika [1 ]
Gera, Divya [1 ]
Jain, Neelu [1 ]
机构
[1] PEC Univ Technol, ECE Dept, Sect 12, Chandigarh 160012, UT, India
关键词
Text extraction; Texture features; DWT; K-means clustering; sliding window; voting decision; VIDEO; LOCALIZATION;
D O I
10.1080/18756891.2016.1237189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text present in image provides important information for automatic annotation, indexing and retrieval. Therefore, its extraction is a well known research area in computer vision. However, variations of text due to differences in orientation, alignment, font, size, low image contrast and complex background make the problem of text extraction extremely challenging. In this paper, we propose a texture-based text extraction method using DWT with K-means clustering. First, the edges are detected from image by using DWT. Then, a small size overlapped sliding window is used to scan high frequency component sub-bands from which texture features of text and non-text regions are extracted. Based on these features, K-means clustering is employed to classify the image into text, simple background and complex background clusters. Finally, voting decision process and area based filtering are used to locate text regions exactly. Experimentation is carried out using public dataset ICDAR 2013 and our own dataset for English, Hindi and Punjabi text images for different number of clusters. The results show that the proposed method gives promising results with different languages in terms of detection rate (DR), precision rate (PR) and recall rate (RR).
引用
收藏
页码:900 / 916
页数:17
相关论文
共 50 条
  • [31] Hyperspectral Image Classification: A k-means Clustering Based Approach
    Ranjan, Sameer
    Nayak, Deepak Ranjan
    Kumar, Kallepalli Satish
    Dash, Ratnakar
    Majhi, Banshidhar
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2017,
  • [32] On K-means clustering-based approach for DDBSs design
    Ali A. Amer
    Journal of Big Data, 7
  • [33] On K-means clustering-based approach for DDBSs design
    Amer, Ali A.
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [34] K-means clustering-based approach for face recognition
    Xie, Yinggang
    Kuang, Jiaoli
    Ye, Nan
    Journal of Information and Computational Science, 2010, 7 (01): : 169 - 175
  • [35] A New Real-Time FPGA-Based Implementation of K-Means Clustering for Images
    Deng, Tiantai
    Crookes, Danny
    Siddiqui, Fahad
    Woods, Roger
    INTELLIGENT COMPUTING AND INTERNET OF THINGS, PT II, 2018, 924 : 468 - 477
  • [36] A k-means based clustering algorithm
    Bloisi, Domenico Daniele
    Locchi, Luca
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 109 - 118
  • [37] Graph based k-means clustering
    Galluccio, Laurent
    Michel, Olivier
    Comon, Pierre
    Hero, Alfred O., III
    SIGNAL PROCESSING, 2012, 92 (09) : 1970 - 1984
  • [38] A New Approach to Robust k-Means Clustering Based on Fuzzy Principal Component Analysis
    Honda, Katsuhiro
    Araki, Hiromichi
    Matsui, Tomohiro
    Ichihashi, Hidetomo
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 208 - 213
  • [39] Inverted Index based Modified Version of K-Means Algorithm for Text Clustering
    Jo, Taeho
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2008, 4 (02): : 67 - 76
  • [40] Cloud Computing K-Means Text Clustering Filtering Algorithm based on Hadoop
    Huang Suyu
    Proceedings of the 2016 4th International Conference on Machinery, Materials and Information Technology Applications, 2016, 71 : 1516 - 1521