A Hybrid Deep Learning Approach to Keyword Spotting in Vietnamese Stele Images

被引：0

作者：

Scius-Bertrand A. ^{[1
]}

Bui M. ^{[2
]}

Fischer A. ^{[1
]}

机构：

[1] University of Fribourg and HES-SO, Fribourg

[2] Ecole Pratique des Hautes Etudes, Paris

来源：

Informatica (Slovenia) | 2023年 / 47卷 / 03期

关键词：

annotation-free; Chu Nom; document image analysis; Hausdorff edit distance; hybrid deep learning; keyword spotting; Vietnamese steles;

D O I：

10.31449/inf.v47i3.4785

中图分类号：

学科分类号：

摘要：

In order to access the rich cultural heritage conveyed in Vietnamese steles, automatic reading of stone engravings would be a great support for historians, who are analyzing tens of thousands of stele images. Approaching the challenging problem with deep learning alone is difficult because the data-driven models require large representative datasets with expert human annotations, which are not available for the steles and costly to obtain. In this article, we present a hybrid approach to spot keywords in stele images that combines data-driven deep learning with knowledge-based structural modeling and matching of Chu Nom characters. The main advantage of the proposed method is that it is annotation-free, i.e. no human data annotation is required. In an experimental evaluation, we demonstrate that keywords can be successfully spotted with a mean average precision of more than 70% when a single engraving style is considered. © 2023 Slovene Society Informatika. All rights reserved.

引用

页码：361 / 372

页数：11

共 50 条

[21] A survey of keyword spotting techniques for printed document images
Murugappan, Abirami
Ramachandran, Baskaran
Dhavachelvan, P.
ARTIFICIAL INTELLIGENCE REVIEW, 2011, 35 (02) : 119 - 136
[22] A survey of keyword spotting techniques for printed document images
Abirami Murugappan
Baskaran Ramachandran
P. Dhavachelvan
Artificial Intelligence Review, 2011, 35 : 119 - 136
[23] Broadcasted Residual Learning for Efficient Keyword Spotting
Kim, Byeonggeun
Chang, Simyung
Lee, Jinkyu
Sung, Dooyong
INTERSPEECH 2021, 2021, : 4538 - 4542
[24] Efficient Learning-Free Keyword Spotting
Retsinas, George
Louloudis, Georgios
Stamatopoulos, Nikolaos
Gatos, Basilis
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (07) : 1587 - 1600
[25] PROGRESSIVE CONTINUAL LEARNING FOR SPOKEN KEYWORD SPOTTING
Huang, Yizheng
Hou, Nana
Chen, Nancy F.
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7552 - 7556
[26] On-Device Customization of Tiny Deep Learning Models for Keyword Spotting With Few Examples
Rusci, Manuele
Tuytelaars, Tinne
IEEE MICRO, 2023, 43 (06) : 50 - 57
[27] Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices
Leem, Seong-Gyun
Yoo, In-Chul
Yook, Dongsuk
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2019, 65 (02) : 188 - 194
[28] Deep Convolutional Spiking Neural Networks for Keyword Spotting
Yilmaz, Emre
Gevrek, Ozgur Bora
Wu, Jibin
Chen, Yuxiang
Meng, Xuanbo
Li, Haizhou
INTERSPEECH 2020, 2020, : 2557 - 2561
[29] A Deep Learning Approach for Arabic Spoken Command Spotting
Salhab, Mahmoud
Harmanani, Haidar
2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, : 330 - 334
[30] Annotation-Free Character Detection in Historical Vietnamese Stele Images
Scius-Bertrand, Anna
Jungo, Michael
Wolf, Beat
Fischer, Andreas
Bui, Marc
DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 432 - 447

← 1 2 3 4 5 →