Stochastic grammatical inference of text database structure

被引:14
|
作者
Young-Lai, M [1 ]
Tompa, FW [1 ]
机构
[1] Univ Waterloo, Dept Comp Sci, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
stochastic grammatical inference; text database structure;
D O I
10.1023/A:1007653929870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For a document collection in which structural elements are identified with markup, it is often necessary to construct a grammar retrospectively that constrains element nesting and ordering. This has been addressed by others as an application of grammatical inference. We describe an approach based on stochastic grammatical inference which scales more naturally to large data sets and produces models with richer semantics. We adopt an algorithm that produces stochastic finite automata and describe modifications that enable better interactive control of results. Our experimental evaluation uses four document collections with varying structure.
引用
收藏
页码:111 / 137
页数:27
相关论文
共 50 条
  • [31] Characteristic Sets for Polynomial Grammatical Inference
    Colin de la Higuera
    Machine Learning, 1997, 27 : 125 - 138
  • [32] Towards General Algorithms for Grammatical Inference
    Clark, Alexander
    DISCOVERY SCIENCE, DS 2010, 2010, 6332 : 381 - 381
  • [33] Grammatical inference: learning automata and grammars
    Daelemans, Walter
    MACHINE TRANSLATION, 2010, 24 (3-4) : 291 - 293
  • [34] GRAMMATICAL INFERENCE - A REVIEW AND SOME CONTRIBUTIONS
    GARCIA, P
    VIDAL, E
    SEGARRA, E
    REVISTA DE INFORMATICA Y AUTOMATICA, 1989, 22 (01): : 7 - 27
  • [35] Grammatical Inference and Games: Extended Abstract
    Lucas, Simon M.
    GRAMMATICAL INFERENCE: THEORETICAL RESULTS AND APPLICATIONS, ICGI 2010, 2010, 6339 : 1 - 4
  • [36] Towards General Algorithms for Grammatical Inference
    Clark, Alexander
    ALGORITHMIC LEARNING THEORY, ALT 2010, 2010, 6331 : 11 - 30
  • [37] Characteristic sets for polynomial grammatical inference
    DelaHiguera, C
    MACHINE LEARNING, 1997, 27 (02) : 125 - 138
  • [38] Bio-inspired Grammatical Inference
    Becerra-Bonache, Leonor
    FOUNDATIONS ON NATURAL AND ARTIFICIAL COMPUTATION: 4TH INTERNATIONAL WORK-CONFERENCE ON THE INTERPLAY BETWEEN NATURAL AND ARTIFICIAL COMPUTATION, IWINAC 2011, PART I, 2011, 6686 : 313 - 322
  • [39] Introduction to the Special Issue on Grammatical Inference
    Heinz, Jeffrey
    de la Higuera, C.
    Oates, Tim
    MACHINE LEARNING, 2014, 96 (1-2) : 1 - 3
  • [40] Grammatical Inference in the Discovery of Generating Functions
    Wieczorek, Wojciech
    Nowakowski, Arkadiusz
    MAN-MACHINE INTERACTIONS 4, ICMMI 2015, 2016, 391 : 627 - 637