A Noise-tolerant Differentiable Learning Approach for Single Occurrence Regular Expression with Interleaving

被引:0
|
作者
Ye, Rongzhen [1 ]
Zhuang, Tianqu [1 ]
Wan, Hai [1 ]
Du, Jianfeng [2 ]
Luo, Weilin [1 ]
Liang, Pingjia [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Guangdong Univ Foreign Studies, Guangzhou Key Lab Multilingual Intelligent Proc, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of learning a single occurrence regular expression with interleaving (SOIRE) from a set of text strings possibly with noise. SOIRE fully supports interleaving and covers a large portion of regular expressions used in practice. Learning SOIREs is challenging because it requires heavy computation and text strings usually contain noise in practice. Most of the previous studies only learn restricted SOIREs and are not robust on noisy data. To tackle these issues, we propose a noise-tolerant differentiable learning approach SOIREDL for SOIRE. We design a neural network to simulate SOIRE matching and theoretically prove that certain assignments of the set of parameters learnt by the neural network, called faithful encodings, are one-to-one corresponding to SOIREs for a bounded size. Based on this correspondence, we interpret the target SOIRE from an assignment of the set of parameters of the neural network by exploring the nearest faithful encodings. Experimental results show that SOIREDL outperforms the state-of-the-art approaches, especially on noisy data.
引用
收藏
页码:4809 / 4817
页数:9
相关论文
共 50 条
  • [1] An Effective Algorithm for Learning Single Occurrence Regular Expressions with Interleaving
    Li, Yeting
    Chen, Haiming
    Zhang, Xiaolan
    Zhang, Lingqi
    IDEAS '19: PROCEEDINGS OF THE 23RD INTERNATIONAL DATABASE APPLICATIONS & ENGINEERING SYMPOSIUM (IDEAS 2019), 2019, : 189 - 198
  • [2] On the sample complexity of noise-tolerant learning
    Department of Computer Science, Dartmouth College, Hanover, NH 03755, United States
    不详
    Inf. Process. Lett., 4 (189-195):
  • [3] On the sample complexity of noise-tolerant learning
    Aslam, JA
    Decatur, SE
    INFORMATION PROCESSING LETTERS, 1996, 57 (04) : 189 - 195
  • [4] Agreement or Disagreement in Noise-tolerant Mutual Learning?
    Liu, Jiarun
    Jiang, Daguang
    Yang, Yukun
    Li, Ruirui
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4801 - 4807
  • [5] Deep learning for noise-tolerant RDFS reasoning
    Makni, Bassem
    Hendler, James
    SEMANTIC WEB, 2019, 10 (05) : 823 - 862
  • [6] Noise-tolerant parallel learning of geometric concepts
    Bshouty, NH
    Goldman, SA
    Mathias, HD
    INFORMATION AND COMPUTATION, 1998, 147 (01) : 89 - 110
  • [7] Noise-Tolerant Interactive Learning Using Pairwise Comparisons
    Xu, Yichong
    Zhang, Hongyang
    Miller, Kyle
    Singh, Aarti
    Dubrawski, Artur
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [8] Learning k-Occurrence Regular Expressions with Interleaving
    Li, Yeting
    Zhang, Xiaolan
    Cao, Jialun
    Chen, Haiming
    Gao, Chong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT II, 2019, 11447 : 70 - 85
  • [9] On Noise-Tolerant Learning of Sparse Parities and Related Problems
    Grigorescu, Elena
    Reyzin, Lev
    Vempala, Santosh
    ALGORITHMIC LEARNING THEORY, 2011, 6925 : 413 - 424
  • [10] NOISE-TOLERANT DEEP LEARNING FOR HISTOPATHOLOGICAL IMAGE SEGMENTATION
    Li, Weizhi
    Qian, Xiaoning
    Ji, Jim
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3075 - 3079