Stochastic grammatical inference of text database structure

被引:14
|
作者
Young-Lai, M [1 ]
Tompa, FW [1 ]
机构
[1] Univ Waterloo, Dept Comp Sci, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
stochastic grammatical inference; text database structure;
D O I
10.1023/A:1007653929870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For a document collection in which structural elements are identified with markup, it is often necessary to construct a grammar retrospectively that constrains element nesting and ordering. This has been addressed by others as an application of grammatical inference. We describe an approach based on stochastic grammatical inference which scales more naturally to large data sets and produces models with richer semantics. We adopt an algorithm that produces stochastic finite automata and describe modifications that enable better interactive control of results. Our experimental evaluation uses four document collections with varying structure.
引用
收藏
页码:111 / 137
页数:27
相关论文
共 50 条
  • [41] Using Tree Transducers for Grammatical Inference
    Sandillon-Rezer, Noemie-Fleur
    Moot, Richard
    LOGICAL ASPECTS OF COMPUTATIONAL LINGUISTICS, LACL 2011, 2011, 6736 : 235 - 250
  • [42] Grammatical inference using an evolutionary technical
    Guse Scos Venske, Sandra Mara
    de Re, Angelita Maria
    Schram, Giovani
    Tosatti, Murilo Augusto
    Kultz, Rene
    ACTA SCIENTIARUM-TECHNOLOGY, 2011, 33 (02) : 163 - 169
  • [43] Introduction to the Special Issue on Grammatical Inference
    Jeffrey Heinz
    C. de la Higuera
    Tim Oates
    Machine Learning, 2014, 96 : 1 - 3
  • [44] Grammatical Inference by Answer Set Programming
    Wieczorek, Wojciech
    Strak, Lukasz
    Nowakowski, Arkadiusz
    Unold, Olgierd
    COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 45 - 58
  • [45] Grammatical inference using tabu search
    Giordano, J.Y.
    Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science), 1996, 1147
  • [46] Grammatical Inference for the Construction of Opening Books
    Wieczorek, Wojciech
    Nowakowski, Arkadiusz
    2015 SECOND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, COMPUTER ENGINEERING, AND SOCIAL MEDIA (CSCESM), 2015, : 19 - 22
  • [47] Grammatical Inference and Language Frameworks for LANGSEC
    Wood, Kerry N.
    Harang, Richard E.
    2015 IEEE SECURITY AND PRIVACY WORKSHOPS (SPW), 2015, : 88 - 98
  • [48] GRAMMATICAL INFERENCE BASED ON HYPEREDGE REPLACEMENT
    JELTSCH, E
    KREOWSKI, HJ
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 532 : 461 - 474
  • [49] A survey of grammatical inference in software engineering
    Stevenson, Andrew
    Cordy, James R.
    SCIENCE OF COMPUTER PROGRAMMING, 2014, 96 : 444 - 459
  • [50] Grammatical inference using suffix trees
    Geertzen, J
    van Zaanen, M
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2004, 3264 : 163 - 174