Generation of Synthetic Data for Handwritten Word Alteration Detection

被引:3
|
作者
Dansena, Prabhat [1 ]
Bag, Soumen [1 ]
Pal, Rajarshi [2 ]
机构
[1] Indian Inst Technol ISM Dhanbad, Dept Comp Sci & Engn, Dhanbad 826004, Bihar, India
[2] Inst Dev & Res Banking Technol, Hyderabad 500057, India
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Ink; Feature extraction; Image color analysis; Training; Task analysis; Writing; Strain; Convolution neural network; document forensics; handwritten; ink analysis; synthetic data; BALLPOINT PEN INKS; RAMAN-SPECTROSCOPY; DIFFERENTIATION; IDENTIFICATION; CLASSIFICATION; BLUE;
D O I
10.1109/ACCESS.2021.3059342
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fraudsters often alter handwritten contents in a document in order to achieve illicit purposes. At times, this may result in financial and mental loss to an individual or an organization. Hence, ink analysis is necessary to identify such an alteration. Convolution Neural Network (CNN) can be used to identify such cases of alteration, as CNN has emerged as a monumental success in the field of computer vision for varieties of classification tasks. But, CNN requires large amount of labeled data for training. Hence, there is a need to generate a large dataset for the experiments relating to handwritten word alteration detection. Collection, digitization, and cropping of a large number of altered and unaltered handwritten words are tedious and time consuming. To overcome such an issue, an approach for synthetic word data generation is presented in this paper for handwritten word alteration detection experiments. This scheme is designed in such a way that the synthetically generated words are very similar to the original ones. In order to achieve this, handwritten character data set is prepared using 10 blue and 10 black pens. These handwritten characters are used for creating synthetic word alteration data set. The presented approach uses relatively less number of handwritten character images to create a huge word alteration data set. Further, deep learning models are trained on the synthetically generated data set for word alteration detection.
引用
收藏
页码:38979 / 38990
页数:12
相关论文
共 50 条
  • [1] Generation of Synthetic Data for Handwritten Word Alteration Detection
    Dansena, Prabhat
    Bag, Soumen
    Pal, Rajarshi
    [J]. IEEE Access, 2021, 9 : 38979 - 38990
  • [2] Generation of synthetic training data for handwritten Indic script recognition
    Gaur, Shivansh
    Sonkar, Siddhant
    Roy, Partha Pratim
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 491 - 495
  • [3] Synthetic Data Generation for Surface Defect Detection
    Lebert, Deborah
    Plouzeau, Jeremy
    Farrugia, Jean-Philippe
    Danglade, Florence
    Merienne, Frederic
    [J]. EXTENDED REALITY, XR SALENTO 2022, PT II, 2022, 13446 : 198 - 208
  • [4] Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition
    Kang, Lei
    Rusinol, Marcal
    Fornes, Alicia
    Riba, Pau
    Villegas, Mauricio
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3491 - 3500
  • [5] A study on top-down word image generation for handwritten word recognition
    Ishidera, E
    Nishiwaki, D
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1173 - 1177
  • [6] Synthetic Generation of Handwritten Signatures Based on Spectral Analysis
    Galbally, Javier
    Fierrez, Julian
    Martinez-Diaz, Marcos
    Ortega-Garcia, Javier
    [J]. OPTICS AND PHOTONICS IN GLOBAL HOMELAND SECURITY V AND BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION VI, 2009, 7306
  • [7] Effective synthetic data generation for fake user detection
    Esmaili, Arefeh
    Farzi, Saeed
    [J]. 2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
  • [8] Generation of Synthetic Training Data for Object Detection in Piles
    Buls, Elvijs
    Kadikis, Roberts
    Cacurs, Ricards
    Arents, Janis
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
  • [9] Generation of Synthetic Data for a Radiation Detection Algorithm Competition
    Nicholson A.D.
    Peplow D.E.
    Ghawaly J.M.
    Willis M.J.
    Archer D.E.
    [J]. IEEE Transactions on Nuclear Science, 2020, 67 (08) : 1968 - 1975
  • [10] Skew detection and correction of online bangla handwritten word
    Department of Computer science, National Institute of Technology Patna, Patna, India
    不详
    [J]. Int. J. Comput. Sci. Issues, 4 4-2 (202-205):