Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration

被引:0
|
作者
Csanady, Balint [1 ]
Lukacs, Andras [1 ]
机构
[1] Eotvos Lorand Univ, Inst Math, Dept Comp Sci, AI Res Grp, Pazmany Peter Stny 1-C, H-1036 Budapest, Hungary
关键词
diacritics restoration; 1D convolutional neural network; A-TCN; small footprint; Hungarian;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diacritics restoration has become a ubiquitous task in the Latin-alphabet-based English-dominated Internet language environment. In this paper, we describe a small footprint 1D dilated convolution-based approach which operates on a character-level. We find that neural networks based on 1D dilated convolutions are competitive alternatives to solutions based on recurrent neural networks or linguistic modeling for the task of diacritics restoration. Our approach surpasses the performance of similarly sized models and is also competitive with larger models. A special feature of our solution is that it even runs locally in a web browser. We also provide a working example of this browser-based implementation. Our model is evaluated on different corpora, with emphasis on the Hungarian language. We performed comparative measurements about the generalization power of the model in relation to three Hungarian corpora. We also analyzed the errors to understand the limitation of corpus-based self-supervised training.
引用
收藏
页码:4253 / 4259
页数:7
相关论文
共 50 条
  • [1] Diacritics Restoration Using Neural Networks
    Naplava, Jakub
    Straka, Milan
    Stranak, Pavel
    Hajic, Jan
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1566 - 1573
  • [2] ROMANIAN DIACRITICS RESTORATION USING RECURRENT NEURAL NETWORKS
    Ruseti, Stefan
    Cotet, Teodor-Mihai
    Dascalu, Mihai
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE LINGUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE, 2018, : 61 - 68
  • [3] Quantum Dilated Convolutional Neural Networks
    Chen, Yixiong
    IEEE ACCESS, 2022, 10 : 20240 - 20246
  • [4] RHNet: Lightweight Dilated Convolutional Networks for Dense Objects Counting
    Yu, Rui
    Xu, Xiangyang
    Shen, Yexiong
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8455 - 8459
  • [5] Review of Lightweight Deep Convolutional Neural Networks
    Chen, Fanghui
    Li, Shouliang
    Han, Jiale
    Ren, Fengyuan
    Yang, Zhen
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (04) : 1915 - 1937
  • [6] Review of Lightweight Deep Convolutional Neural Networks
    Fanghui Chen
    Shouliang Li
    Jiale Han
    Fengyuan Ren
    Zhen Yang
    Archives of Computational Methods in Engineering, 2024, 31 : 1915 - 1937
  • [7] Review of research on lightweight convolutional neural networks
    Zhou, Yan
    Chen, Shaochang
    Wang, Yiming
    Huan, Wenming
    PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1713 - 1720
  • [8] Restoration of Compressed Picture Based on Lightweight Convolutional Neural Network
    Kuo, Tien-Ying
    Wei, Yu-Jen
    Chao, Chang-Hao
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [9] Dilated convolutional neural networks for time series forecasting
    Borovykh, Anastasia
    Bohte, Sander
    Oosterlee, Cornelis W.
    JOURNAL OF COMPUTATIONAL FINANCE, 2019, 22 (04) : 73 - 101
  • [10] Why Dilated Convolutional Neural Networks: A Proof of Their Optimality
    Contreras, Jonatan
    Ceberio, Martine
    Kreinovich, Vladik
    ENTROPY, 2021, 23 (06)