Stochastic grammatical inference of text database structure

被引:14
|
作者
Young-Lai, M [1 ]
Tompa, FW [1 ]
机构
[1] Univ Waterloo, Dept Comp Sci, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
stochastic grammatical inference; text database structure;
D O I
10.1023/A:1007653929870
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For a document collection in which structural elements are identified with markup, it is often necessary to construct a grammar retrospectively that constrains element nesting and ordering. This has been addressed by others as an application of grammatical inference. We describe an approach based on stochastic grammatical inference which scales more naturally to large data sets and produces models with richer semantics. We adopt an algorithm that produces stochastic finite automata and describe modifications that enable better interactive control of results. Our experimental evaluation uses four document collections with varying structure.
引用
收藏
页码:111 / 137
页数:27
相关论文
共 50 条
  • [21] Grammatical Inference Algorithms in MATLAB
    Akram, Hasan Ibne
    de la Higuera, Colin
    Xiao, Huang
    Eckert, Claudia
    GRAMMATICAL INFERENCE: THEORETICAL RESULTS AND APPLICATIONS, ICGI 2010, 2010, 6339 : 262 - 266
  • [22] Grammatical Inference as Class Discrimination
    van Zaanen, Menno
    Gaustad, Tanja
    GRAMMATICAL INFERENCE: THEORETICAL RESULTS AND APPLICATIONS, ICGI 2010, 2010, 6339 : 245 - 257
  • [23] ALGORITHM FOR GRAMMATICAL INFERENCE.
    Smith, Howard R.
    1984, : 31 - 37
  • [24] Combination of estimation algorithms and grammatical inference techniques to learn Stochastic Context-Free Grammars
    Nevado, F
    Sánchez, JA
    Benedí, JM
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, 2000, 1891 : 196 - 206
  • [25] THE IMPLICIT AND THE EXPLICIT IN TRANSLATION - GRAMMATICAL AND NON-GRAMMATICAL INFERENCE
    ZEMB, JM
    REVUE D ESTHETIQUE, 1986, (12): : 55 - 61
  • [26] Using grammatical inference techniques to learn ontologies that describe the structure of domain instances
    Martins, Andre L.
    Pinto, H. Sofia
    Oliveira, Arlindo L.
    APPLIED ARTIFICIAL INTELLIGENCE, 2008, 22 (1-2) : 139 - 167
  • [27] Ten open problems in grammatical inference
    de la Higuera, Colin
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2006, 4201 : 32 - 44
  • [28] Phase Transitions within Grammatical Inference
    Pernot, Nicolas
    Cornuejols, Antoine
    Sebag, Michele
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 811 - 816
  • [29] A Local Search Algorithm for Grammatical Inference
    Wieczorek, Wojciech
    GRAMMATICAL INFERENCE: THEORETICAL RESULTS AND APPLICATIONS, ICGI 2010, 2010, 6339 : 217 - 229
  • [30] Protein motif prediction by grammatical inference
    Peris, Piedachu
    Lopez, Damian
    Campos, Marcelino
    Sempere, Jose M.
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2006, 4201 : 175 - 187