Font Group Identification Using Reconstructed Fonts

被引:0
|
作者
Cutter, Michael P. [1 ]
van Beusekom, Joost [1 ]
Shafait, Faisal [1 ]
Breuel, Thomas M. [1 ]
机构
[1] Univ Kaiserslautern, D-67663 Kaiserslautern, Germany
来源
关键词
Font Reconstruction; Font Identification; Reconstructed Font; Token Matching;
D O I
10.1117/12.873398
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ideally, digital versions of scanned documents should be represented in a format that is searchable, compressed, highly readable, and faithful to the original. These goals can theoretically be achieved through OCR and font recognition, re-typesetting the document text with original fonts. However, OCR and font recognition remain hard problems, and many historical documents use fonts that are not available in digital forms. It is desirable to be able to reconstruct fonts with vector glyphs that approximate the shapes of the letters that form a font. In this work, we address the grouping of tokens in a token-compressed document into candidate fonts. This permits us to incorporate font information into token-compressed images even when the original fonts are unknown or unavailable in digital format. This paper extends previous work in font reconstruction by proposing and evaluating an algorithm to assign a font to every character within a document. This is a necessary step to represent a scanned document image with a reconstructed font. Through our evaluation method, we have measured a 98.4% accuracy for the assignment of letters to candidate fonts in multi-font documents.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Dyslexia and Fonts: Is a Specific Font Useful?
    Bachmann, Christina
    Mengheri, Lauro
    BRAIN SCIENCES, 2018, 8 (05)
  • [2] FontFusionGAN: Refinement of Handwritten Fonts by Font Fusion
    Kumar, Avinash
    Kang, Kyeolhee
    Muhammad, Ammar Ul Hassan
    Choi, Jaeyoung
    ELECTRONICS, 2023, 12 (20)
  • [3] AN INTRODUCTION TO TYPOGRAPHIC FONTS AND DIGITAL FONT RESOURCES
    GRIFFEE, AW
    CASEY, CA
    IBM SYSTEMS JOURNAL, 1988, 27 (02) : 206 - 218
  • [4] Modeling Fonts in Context: Font Prediction on Web Designs
    Zhao, Nanxuan
    Cao, Ying
    Lau, Rynson W. H.
    COMPUTER GRAPHICS FORUM, 2018, 37 (07) : 385 - 395
  • [5] How to boss your fonts around: A primer on font technology and font management on the Macintosh.
    Gillespie, T
    LIBRARY JOURNAL, 1998, 123 (14) : 210 - 210
  • [6] Impressions2Font: Generating Fonts by Specifying Impressions
    Matsuda, Seiya
    Kimura, Akisato
    Uchida, Seiichi
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 739 - 754
  • [7] AN ANTIALIASING METHOD FOR LOW RESOLUTION FONTS BASED ON FONT STRUCTURE
    NANARD, M
    NANARD, J
    RASTER IMAGING AND DIGITAL TYPOGRAPHY, 1989, : 111 - 122
  • [8] FONTS SET FREE + CUSTOM FONT DESIGN IS REVOLUTIONIZING TYPOGRAPHY
    BARNBROOK, J
    DESIGN, 1991, (514): : 24 - 27
  • [9] Farsi Font Recognition Based On the Fonts of Text Samples Extracted by SOM
    Ziaratban, Majid
    Bagheri, Fatemeh
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2015, 15 (01): : 40 - 56
  • [10] Standard Font Screening Method for Replicating Ancient Mongolian Kanjur Fonts
    Wurihan
    Biligebatu
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE, PUBLIC HEALTH AND EDUCATION (SSPHE 2018), 2018, 196 : 23 - 25