Language Identification for Interactive Handwriting Transcription of Multilingual Documents

被引:0
|
作者
del Agua, Miguel A. [1 ]
Serrano, Nicolas [1 ]
Juan, Alfons [1 ]
机构
[1] Univ Politecn Valencia, DSIC ITI, Valencia, Spain
关键词
Language Identification; Interactive Handwriting Transcription; Multilingual Documents;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An effective approach to handwriting transcription of (old) documents is to follow a sequential, line-by-line transcription of the whole document, in which a continuously retrained system interacts with the user. In the case of multilingual documents, however, a minor yet important issue for this interactive approach is to first identify the language of the current text line image to be transcribed. In this paper, we propose a probabilistic framework and three techniques for this purpose. Empirical results are reported on an entire 764-page multilingual document for which previous empirical tests were limited to its first 180 pages, written only in Spanish.
引用
收藏
页码:596 / 603
页数:8
相关论文
共 50 条
  • [21] Language Identification Networks for Multilingual Everyday Recordings
    Praveen, Kiran
    Radhakrishnan, Balaji
    Sabu, Kamini
    Pandey, Abhishek
    Shaik, Mahaboob Ali Basha
    INTERSPEECH 2023, 2023, : 4124 - 4128
  • [22] Multilingual Tandem Bottleneck Feature For Language Identification
    Geng, Wang
    Li, Jie
    Zhang, Shanshan
    Cai, Xinyuan
    Xu, Bo
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 413 - 417
  • [23] Multilingual Grammar Induction with Continuous Language Identification
    Han, Wenjuan
    Wang, Ge
    Jiang, Yong
    Tu, Kewei
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5728 - 5733
  • [24] Tackling the multilingual and heterogeneous documents with the pre-trained language identifiers
    Kanfoud M.R.
    Bouramoul A.
    International Journal of Computers and Applications, 2023, 45 (05) : 391 - 402
  • [25] THE USE OF WEAK ESTIMATORS TO ACHIEVE LANGUAGE DETECTION AND TRACKING IN MULTILINGUAL DOCUMENTS
    Stensby, Aleksander
    Oommen, B. John
    Granmo, Ole-Christoffer
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (04)
  • [26] MIDAS - A visual language for interactive design of multimedia documents
    Goncalves, C
    Jorge, J
    FIFTH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN & COMPUTER GRAPHICS, VOLS 1 AND 2, 1997, : 205 - 210
  • [27] Language Identification from Handwritten Documents
    Mioulet, Luc
    Garain, Utpal
    Chatelain, Clement
    Barlas, Philippine
    Paquet, Thierry
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 676 - 680
  • [28] GMM-based Handwriting Style Identification System for Historical Documents
    Slimane, Fouad
    Schassan, Torsten
    Maergner, Volker
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 387 - 392
  • [29] Interactive Layout Analysis and Transcription Systems for Historic Handwritten Documents
    Ramos-Terrades, Oriol
    Tose, Alejandro H.
    Serrano, Nicolas
    Romero, Veronica
    Vidal, Enrique
    Juan, Alfons
    DOCENG2010: PROCEEDINGS OF THE 2010 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2010, : 219 - 222
  • [30] Interactive Groups: Fostering Collaborative Interactions in an Additional Language in a Multilingual Context
    Ugalde, Leire
    Garcia-Carrion, Rocio
    Intxausti-Intxausti, Nahia
    Zubiri-Esnaola, Harkaitz
    INTERNATIONAL JOURNAL OF SOCIOLOGY OF EDUCATION, 2023, 12 (03): : 273 - 292