A LANGUAGE FOR THE DEFINITION AND EXCHANGE OF BIOLOGICAL DATA SETS

被引:4
|
作者
WHITE, RJ [1 ]
ALLKIN, R [1 ]
机构
[1] ROYAL BOT GARDENS,COMP SECT,RICHMOND TW9 3AB,ENGLAND
关键词
D O I
10.1016/0895-7177(92)90163-F
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Increasing numbers of biologists and institutes are becoming involved in taxonomic database projects that for good practical and historical reasons use different hardware, software and data structures. A large number of diverse application programs, mostly with incompatible data formats, is in use by biologists for different purposes. Translating data from one program, project or format to another and developing special-purpose or 'one-off' translation software is becoming a rapidly growing burden on the biological community. A database file exchange medium to handle diverse classes of biological data is required for file transfers between different database projects. 'XDF' (the Exchange Data Format) is such a medium. Data sets prepared in XDF consist of text files that are effectively independent of any particular project. XDF is a high-level language for describing biological data, with its own syntax and command vocabulary, analogous to the high-level programming languages used to describe software algorithms. XDF files may be generated and read automatically by programs to transfer large amounts of data between sophisticated databases. Alternatively, biologists unfamiliar with the terminology, data rules and syntax of the data format required for a particular application program or database can use a text editor to create an XDF data file. We hope the existence of XDF will encourage the development of more sophisticated general-purpose programs for interactive biological data entry. XDF is being used initially with numeric and structured textual descriptive data, but is designed to be extensible to other classes of biological data such as images. Provision is made within XDF for predefined standard definitions of the common core elements of biological data sets such as the taxonomic hierarchy, biological nomenclature, descriptive material and bibliography. Using these, XDF can be used to define specialist transfer formats for particular application areas while providing strict control of the data types and data definitions used.
引用
收藏
页码:199 / 223
页数:25
相关论文
共 50 条
  • [31] Compact Representation of Large RDF Data Sets for Publishing and Exchange
    Fernandez, Javier D.
    Martinez-Prieto, Miguel A.
    Gutierrez, Claudio
    SEMANTIC WEB-ISWC 2010, PT I, 2010, 6496 : 193 - +
  • [32] The Problems of Language Identification within Hugely Multilingual Data Sets
    Xia, Fei
    Lewis, Carrie
    Lewis, William D.
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2790 - 2797
  • [34] The Expansion of the Definition of Granular Sets
    Guan, Sujie
    FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): MACHINE VISION, IMAGE PROCESSING, AND PATTERN ANALYSIS, 2012, 8349
  • [35] An Equivalent Definition of Rough Sets
    Liu, Guilong
    Huang, James Kuodo
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2008, 5306 : 52 - +
  • [36] Exceptional sets for the definition of quasiconformality
    Kallunki, S
    Koskela, P
    AMERICAN JOURNAL OF MATHEMATICS, 2000, 122 (04) : 735 - 743
  • [37] ASPECTS OF BIOLOGICAL EVOLUTION BASED ON MOLECULAR-DATA SETS
    BARNABAS, S
    BARNABAS, J
    PROCEEDINGS OF THE INDIAN ACADEMY OF SCIENCES-ANIMAL SCIENCES, 1984, 93 (03): : 141 - 158
  • [38] Reproducibility of Finding Enriched Gene Sets in Biological Data Analysis
    Zyla, Joanna
    Marczyk, Michal
    Polanska, Joanna
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 146 - 154
  • [39] DEFINITION OF CONNECTED SETS OF A MATROID
    HARTFIEL, DJ
    MAXSON, CJ
    JOURNAL OF COMBINATORIAL THEORY SERIES B, 1978, 25 (02) : 130 - 134
  • [40] Analysing large biological data sets with an improved algorithm for MIC
    Wang, Shuliang
    Zhao, Yiping
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 13 (02) : 158 - 170