A New Approach to Extract Text from Images based on DWT and K-means Clustering

被引:3
|
作者
Ghai, Deepika [1 ]
Gera, Divya [1 ]
Jain, Neelu [1 ]
机构
[1] PEC Univ Technol, ECE Dept, Sect 12, Chandigarh 160012, UT, India
关键词
Text extraction; Texture features; DWT; K-means clustering; sliding window; voting decision; VIDEO; LOCALIZATION;
D O I
10.1080/18756891.2016.1237189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text present in image provides important information for automatic annotation, indexing and retrieval. Therefore, its extraction is a well known research area in computer vision. However, variations of text due to differences in orientation, alignment, font, size, low image contrast and complex background make the problem of text extraction extremely challenging. In this paper, we propose a texture-based text extraction method using DWT with K-means clustering. First, the edges are detected from image by using DWT. Then, a small size overlapped sliding window is used to scan high frequency component sub-bands from which texture features of text and non-text regions are extracted. Based on these features, K-means clustering is employed to classify the image into text, simple background and complex background clusters. Finally, voting decision process and area based filtering are used to locate text regions exactly. Experimentation is carried out using public dataset ICDAR 2013 and our own dataset for English, Hindi and Punjabi text images for different number of clusters. The results show that the proposed method gives promising results with different languages in terms of detection rate (DR), precision rate (PR) and recall rate (RR).
引用
收藏
页码:900 / 916
页数:17
相关论文
共 50 条
  • [41] TABULAR K-MEANS CLUSTERING ON REMOTE SENSING IMAGES
    Tsai, Victor J. D.
    Tsui, C. K.
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6967 - 6970
  • [42] A k-means approach to clustering disease progressions
    Duc Thanh Anh Luong
    Chandola, Varun
    2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 268 - 274
  • [43] A New Clustering Validity Index based on K-means Algorithm
    Hou, Xiangru
    2018 INTERNATIONAL SYMPOSIUM ON POWER ELECTRONICS AND CONTROL ENGINEERING (ISPECE 2018), 2019, 1187
  • [44] A new k-means based clustering algorithm in aspect mining
    Serban, Gabriela
    Moldovan, Grigoreta Sofia
    SYNASC 2006: EIGHTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, 2007, : 69 - +
  • [45] Hierarchical initialization approach for K-Means clustering
    Lu, J. F.
    Tang, J. B.
    Tang, Z. M.
    Yang, J. Y.
    PATTERN RECOGNITION LETTERS, 2008, 29 (06) : 787 - 795
  • [46] Quantum clustering with k-Means: A hybrid approach
    Poggiali, Alessandro
    Berti, Alessandro
    Bernasconi, Anna
    Del Corso, Gianna M.
    Guidotti, Riccardo
    THEORETICAL COMPUTER SCIENCE, 2024, 992
  • [47] Centroid Update Approach to K-Means Clustering
    Borlea, Ioan-Daniel
    Precup, Radu-Emil
    Dragan, Florin
    Borlea, Alexandra-Bianca
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2017, 17 (04) : 3 - 10
  • [48] K-Means Clustering with a New Initialization Approach for Wind Power Forecasting
    Ghofrani, M.
    de Rezende, Maike
    Azimi, R.
    Ghayekhloo, M.
    2016 IEEE/PES TRANSMISSION AND DISTRIBUTION CONFERENCE AND EXPOSITION (T&D), 2016,
  • [49] KMC/EDAM: A new approach for the visualization of K-means clustering results
    Raabe, N
    Luebke, K
    Weihs, C
    Classification - the Ubiquitous Challenge, 2005, : 200 - 207
  • [50] On the performance of feature weighting K-means for text subspace clustering
    Jing, LP
    Ng, MK
    Xu, J
    Huang, JZX
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 502 - 512