Language Identification for Interactive Handwriting Transcription of Multilingual Documents

被引:0
|
作者
del Agua, Miguel A. [1 ]
Serrano, Nicolas [1 ]
Juan, Alfons [1 ]
机构
[1] Univ Politecn Valencia, DSIC ITI, Valencia, Spain
关键词
Language Identification; Interactive Handwriting Transcription; Multilingual Documents;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An effective approach to handwriting transcription of (old) documents is to follow a sequential, line-by-line transcription of the whole document, in which a continuously retrained system interacts with the user. In the case of multilingual documents, however, a minor yet important issue for this interactive approach is to first identify the language of the current text line image to be transcribed. In this paper, we propose a probabilistic framework and three techniques for this purpose. Empirical results are reported on an entire 764-page multilingual document for which previous empirical tests were limited to its first 180 pages, written only in Spanish.
引用
收藏
页码:596 / 603
页数:8
相关论文
共 50 条
  • [31] PROGRESS IN PROOF OF HANDWRITING AND DOCUMENTS
    Osborn, Albert S.
    JOURNAL OF CRIMINAL LAW & CRIMINOLOGY, 1933, 24 (01): : 118 - 124
  • [32] Multilingual Speech Identification Framework (MSIF) A Novel Approach in Language Identification
    Sawalkar, Swapnil
    Roy, Pinki
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 716 - 723
  • [33] Text-based Language Identification of Multilingual Names
    Giwa, Oluwapelumi
    Davel, Marelie H.
    PROCEEDINGS OF THE 2015 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2015, : 166 - 171
  • [34] A unified system for multilingual speech recognition and language identification
    Liu, Danyang
    Xu, Ji
    Zhang, Pengyuan
    Yan, Yonghong
    SPEECH COMMUNICATION, 2021, 127 : 17 - 28
  • [35] Enhancing multilingual recognition of emotion in speech by language identification
    Sagha, Hesam
    Matejka, Pavel
    Gavryukova, Maryna
    Povolny, Filip
    Marchi, Erik
    Schuller, Bjoern
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2949 - 2953
  • [36] Writer identification of Chinese handwriting documents using hidden Markov tree model
    He, Zhenyu
    You, Xinge
    Tang, Yuan Yan
    PATTERN RECOGNITION, 2008, 41 (04) : 1295 - 1307
  • [37] An Interactive Machine Translation Framework for Modernizing the Language of Historical Documents
    Domingo, Miguel
    Casacuberta, Francisco
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022), 2022, 13256 : 41 - 53
  • [38] The Language of Handwriting
    Wolfson, Rose
    JOURNAL OF PROJECTIVE TECHNIQUES, 1953, 17 (02): : 234 - 234
  • [39] Evaluating an Interactive-Predictive Paradigm on Handwriting Transcription: A Case Study and Lessons Learned
    Leiva, Luis A.
    Romero, Veronica
    Toselli, Alejandro H.
    Vidal, Enrique
    2011 35TH IEEE ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2011, : 610 - 617
  • [40] Language Identification: A New Fast Algorithm to Identify the Language of a Text in a Multilingual Corpus
    Gadri, Said
    Moussaoui, Abdelouahab
    Belabdelouahab-Fernini, Linda
    2014 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2014, : 321 - 326