Data Mining for Grammatical Inference with Bioinformatics Criteria

被引:0
|
作者
Lopez, Vivian F. [1 ]
Aguilar, Ramiro [1 ]
Alonso, Luis [1 ]
Moreno, Maria N. [1 ]
Corchado, Juan M. [1 ]
机构
[1] Univ Salamanca, Dept Informat & Automat, E-37008 Salamanca, Spain
关键词
Grammatical Inference; Bioinformatic; Free Context Grammar; DNA; sequential patterns;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe both theoretical and practical results of a novel data mining process that combines hybrid techniques of association analysis and classical sequentiation algorithms of genomics to generate grammatical structures of a specific language. We used an application of a compilers generator system that allows the development of a practical application within the area of grammarware, where the concepts of the language analysis are applied to other disciplines, such as Bioinformatic. The tool allows the complexity of the obtained grammar to be measured automatically from textual data. A technique of incremental discovery of sequential patterns is presented to obtain simplified production rules, and compacted with bioinformatics criteria to make up a grammar.
引用
收藏
页码:53 / 60
页数:8
相关论文
共 50 条
  • [31] BioInformatics: Databases plus data mining (abstract)
    Siebes, A
    SOFSEM 2000: THEORY AND PRACTICE OF INFORMATICS, 2000, 1963 : 54 - 55
  • [32] A Survey of Data Mining and Deep Learning in Bioinformatics
    Lan, Kun
    Wang, Dan-tong
    Fong, Simon
    Liu, Lian-sheng
    Wong, Kelvin K. L.
    Dey, Nilanjan
    JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (08)
  • [33] A Survey of Data Mining and Deep Learning in Bioinformatics
    Kun Lan
    Dan-tong Wang
    Simon Fong
    Lian-sheng Liu
    Kelvin K. L. Wong
    Nilanjan Dey
    Journal of Medical Systems, 2018, 42
  • [34] Grammatical Inference Preface
    Eyraud, Remi
    de la Higuera, Colin
    Kanazawa, Makoto
    Yoshinaka, Ryo
    FUNDAMENTA INFORMATICAE, 2016, 146 (04) : I - II
  • [35] Grammatical inference of colonies
    Sosik, P
    Stybnar, L
    NEW TRENDS IN FORMAL LANGUAGES, 1997, 1218 : 236 - 246
  • [36] Improvement of the state merging rule on noisy data in probabilistic grammatical inference
    Habrard, A
    Bernard, M
    Sebban, M
    MACHINE LEARNING: ECML 2003, 2003, 2837 : 169 - 180
  • [37] Special Issue on Algorithms for Data and Text Mining in Bioinformatics
    Makris, Christos
    Tsakalidis, Athanasios
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2015, 24 (01)
  • [38] An architectural characterization study of data mining and bioinformatics workloads
    Ozisikyilmaz, Berkin
    Narayanan, Ramanathan
    Zambreno, Joseph
    Memik, Gokhan
    Choudhary, Alok
    PROCEEDINGS OF THE IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2006, : 61 - +
  • [39] Bioinformatics Data Mining: Is the Pipette Mightier than the Scalpel?
    Tsoulfas, Georgios
    JOURNAL OF INVESTIGATIVE SURGERY, 2021, 34 (06) : 670 - 671
  • [40] Data Mining in Bioinformatics: Selected Papers from BIOKDD
    Lonardi, Stefano
    Chen, Jake
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (02) : 195 - 196