Decoupled Representation Learning for Character Glyph Synthesis

被引:6
|
作者
Liu, Xiyan [1 ,2 ]
Meng, Gaofeng [1 ,2 ,3 ]
Chang, Jianlong [1 ,2 ]
Hu, Ruiguang [4 ]
Xiang, Shiming [1 ,2 ]
Pan, Chunhong [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, HK Inst Sci & Innovat, Ctr Artificial Intelligence & Robot, Hong Kong 999077, Peoples R China
[4] Beijing Aerosp Automat Control Inst, Beijing 100854, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Generative adversarial networks; Gallium nitride; Topology; Standards; Electronic mail; Decoding; Character glyph synthesis; decoupled representation; generative adversarial networks; IMAGE; TEXT;
D O I
10.1109/TMM.2021.3072449
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Character glyph synthesis is still an open challenging problem, which involves two related aspects, i.e., font style transfer and content consistency. In this paper, we propose a novel model named FontGAN, which integrates the character structure stylization, de-stylization and texture transfer into a unified framework. Specifically, we decouple character images into style representation and content representation, which offers fine-grained control of these two types of variables, thus improving the quality of the generated results. To effectively capture the style information, a style consistency module (SCM) is introduced. Technically, SCM exploits category-guided Kullback-Leibler divergence to explicitly model the style representation into different prior distributions. In this way, our model is capable of implementing transformations between multiple domains in one framework. In addition, we propose content prior module (CPM) to provide content prior for the model to guide the content encoding process and alleviates the problem of stroke deficiency during structure de-stylization. Benefiting from the idea of decoupling and regrouping, our FontGAN suffices to achieve many-to-many translation tasks for glyph structure. Experimental results demonstrate that the proposed FontGAN achieves the state-of-the-art performance in character glyph synthesis.
引用
收藏
页码:1787 / 1799
页数:13
相关论文
共 50 条
  • [21] CHARACTER REPRESENTATION
    GAYLORD, HE
    [J]. COMPUTERS AND THE HUMANITIES, 1995, 29 (01): : 51 - 73
  • [22] Glyph-Based Data Augmentation for Accurate Kanji Character Recognition
    Ofusa, Kenichiro
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 597 - 602
  • [23] Fast optical character recognition through glyph hashing for document conversion
    Chellapilla, K
    Simard, P
    Nickolov, R
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 829 - 833
  • [24] Beautification of Chinese Character Stroke-Segment-Mesh Glyph Stroke Curve
    Zhang, MaiKu
    Lin, Min
    Huang, HanQuan
    [J]. ADVANCES IN MULTIMEDIA, SOFTWARE ENGINEERING AND COMPUTING, VOL 2, 2011, 129 : 101 - 110
  • [25] Artistic Glyph Image Synthesis via One-Stage Few-Shot Learning
    Gao, Yue
    Guo, Yuan
    Lian, Zhouhui
    Tang, Yingmin
    Xiao, Jianguo
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (06):
  • [26] Text Classification through Glyph-aware Disentangled Character Embedding and Semantic Sub-character Augmentation
    Aoki, Takumi
    Kitada, Shunsuke
    Iyatomi, Hitoshi
    [J]. AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 1 - 7
  • [27] Glyph Enhanced Chinese Character Pre-Training for Lexical Sememe Prediction
    Lyu, Boer
    Chen, Lu
    Yu, Kai
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4549 - 4555
  • [28] Medical image fusion via decoupled representation and component-wise regularization learning
    Zhang, Rui
    Sun, Haoze
    Deng, Lizhen
    Zhu, Hu
    Qian, Wei
    [J]. Biomedical Signal Processing and Control, 2025, 100
  • [29] Glyph based representation of principal stress tensors in virtual reality environments
    Neugebauer R.
    Weidlich D.
    Scherer S.
    Wabner M.
    [J]. Prod. Eng., 2008, 2 (179-183): : 179 - 183
  • [30] A mathematical view on the decoupled sites representation
    Martini, Johannes W. R.
    Ullmann, G. Matthias
    [J]. JOURNAL OF MATHEMATICAL BIOLOGY, 2013, 66 (03) : 477 - 503