MACROMOLECULAR QUERY LANGUAGE (MMQL) - PROTOTYPE DATA MODEL AND IMPLEMENTATION

被引:6
|
作者
SHINDYALOV, IN
CHANG, W
PU, C
BOURNE, PE
机构
[1] COLUMBIA UNIV,HOWARD HUGHES MED INST,NEW YORK,NY 10032
[2] COLUMBIA UNIV,DEPT BIOCHEM & MOLEC BIOPHYS,NEW YORK,NY 10032
来源
PROTEIN ENGINEERING | 1994年 / 7卷 / 11期
基金
美国国家科学基金会;
关键词
C++ CLASS LIBRARY; MACROMOLECULAR STRUCTURE; PATTERN EXPRESSION; PROTEIN DATA BANK; QUERY LANGUAGE;
D O I
10.1093/protein/7.11.1311
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Macromolecular query language (MMQL) is an extensible interpretive language in which to pose questions concerning the experimental or derived features of the 3-D structure of biological macromolecules. MMQL portends to be intuitive with a simple syntax, so that from a user's perspective complex queries are easily written. A number of basic queries and a more complex query-determination of structures containing a five-strand Greek key motif-are presented to illustrate the strengths and weaknesses of the language. The predominant features of MMQL are a filter and pattern grammar which are combined to express a wide range of interesting biological queries. Filters permit the selection of object attributes, for example, compound name and resolution, whereas the patterns currently implemented query primary sequence, close contacts, hydrogen bonding, secondary structure, conformation and amino acid properties (volume, polarity, isoelectric point, hydrophobicity and different forms of exposure). MMQL queries are processed by MMQLlib; a C++ class library, to which new query methods and pattern types are easily added. The prototype implementation described uses PDBlib, another C++-based class library for representing the features of biological macromolecules at the level of detail parsable from a PDB file. Since PDBlib can represent data stored in relational and object-oriented databases, as well as PDB files, once these data are loaded they too can be queried by MMQL. Performance metrics are given for queries of PDB files for which all derived data are calculated at run time and compared to a preliminary version of OOPDB, a prototype object-oriented database with a schema based on a persistent version of PDBlib which offers more efficient data access and the potential to maintain derived information. MMQLlib, PDBlib and associated software are available via anonymous ftp from cuhhca.hhmi.columbia.edu.
引用
收藏
页码:1311 / 1322
页数:12
相关论文
共 50 条
  • [1] StreamAPAS: Query Language and Data Model
    Gorawski, Marcin
    Chroszcz, Aleksander
    [J]. CISIS: 2009 INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, VOLS 1 AND 2, 2009, : 75 - 82
  • [2] The design and implementation of a Meaning Driven Data Query Language
    Kapetanios, E
    Baer, D
    Groenewoud, P
    Mueller, P
    [J]. 14TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2002, : 20 - 23
  • [3] MMQL - A language for multi-model linking and filtering
    Fuchs, S.
    Scherer, R. J.
    [J]. EWORK AND EBUSINESS IN ARCHITECTURE, ENGINEERING AND CONSTRUCTION 2014, 2015, : 273 - 280
  • [4] Multidimensional data model and query language for informetrics
    Niemi, T
    Hirvonen, L
    Järvelin, K
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (10): : 939 - 951
  • [5] ALGEBRA AND QUERY LANGUAGE FOR A HISTORICAL DATA MODEL
    SARDA, NL
    [J]. COMPUTER JOURNAL, 1990, 33 (01): : 11 - 18
  • [6] A Survey of Natural Language Processing Implementation for Data Query Systems
    Wong, Albert
    Joiner, Dakota
    Chiu, Chunyin
    Elsayed, Mohamed
    Pereira, Keegan
    Khmelevsky, Youry
    Mahony, Joe
    [J]. IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN SYSTEMS SCIENCE AND ENGINEERING (IEEE RASSE 2021), 2021,
  • [7] The NITE XML Toolkit: data model and query language
    Carletta, Jean
    Evert, Stefan
    Heid, Ulrich
    Kilgour, Jonathan
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2005, 39 (04) : 313 - 334
  • [8] The NITE XML Toolkit: Data Model and Query Language
    Jean Carletta
    Stefan Evert
    Ulrich Heid
    Jonathan Kilgour
    [J]. Language Resources and Evaluation, 2005, 39 : 313 - 334
  • [9] A data model and a query language for multimedia documents databases
    Hirzalla, N
    Karmouch, A
    [J]. MULTIMEDIA SYSTEMS, 1999, 7 (04) : 338 - 348
  • [10] An Overview of the Deco System: Data Model and Query Language; Query Processing and Optimization
    Park, Hyunjung
    Pang, Richard
    Parameswaran, Aditya
    Garcia-Molina, Hector
    Polyzotis, Neoklis
    Widom, Jennifer
    [J]. SIGMOD RECORD, 2012, 41 (04) : 22 - 27