A LANGUAGE FOR THE DEFINITION AND EXCHANGE OF BIOLOGICAL DATA SETS

被引:4
|
作者
WHITE, RJ [1 ]
ALLKIN, R [1 ]
机构
[1] ROYAL BOT GARDENS,COMP SECT,RICHMOND TW9 3AB,ENGLAND
关键词
D O I
10.1016/0895-7177(92)90163-F
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Increasing numbers of biologists and institutes are becoming involved in taxonomic database projects that for good practical and historical reasons use different hardware, software and data structures. A large number of diverse application programs, mostly with incompatible data formats, is in use by biologists for different purposes. Translating data from one program, project or format to another and developing special-purpose or 'one-off' translation software is becoming a rapidly growing burden on the biological community. A database file exchange medium to handle diverse classes of biological data is required for file transfers between different database projects. 'XDF' (the Exchange Data Format) is such a medium. Data sets prepared in XDF consist of text files that are effectively independent of any particular project. XDF is a high-level language for describing biological data, with its own syntax and command vocabulary, analogous to the high-level programming languages used to describe software algorithms. XDF files may be generated and read automatically by programs to transfer large amounts of data between sophisticated databases. Alternatively, biologists unfamiliar with the terminology, data rules and syntax of the data format required for a particular application program or database can use a text editor to create an XDF data file. We hope the existence of XDF will encourage the development of more sophisticated general-purpose programs for interactive biological data entry. XDF is being used initially with numeric and structured textual descriptive data, but is designed to be extensible to other classes of biological data such as images. Provision is made within XDF for predefined standard definitions of the common core elements of biological data sets such as the taxonomic hierarchy, biological nomenclature, descriptive material and bibliography. Using these, XDF can be used to define specialist transfer formats for particular application areas while providing strict control of the data types and data definitions used.
引用
收藏
页码:199 / 223
页数:25
相关论文
共 50 条
  • [21] PUTTING LEGACY DATA ON THE WEB - A REPOSITORY DEFINITION LANGUAGE
    SHKLAR, L
    SHAH, K
    BASU, C
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1995, 27 (06): : 939 - 951
  • [22] FORMAL SYSTEM OF DEFINITION OF DATA SETS AS A HIGH-LEVEL TOOL OF DATA EXTRACTION
    NURIEV, RM
    CYBERNETICS, 1987, 23 (01): : 21 - 30
  • [23] ASPECTS OF A LANGUAGE FOR UTILIZATION OF LARGE DATA-SETS
    GWEHENBERGER, G
    SCHULTHESS, W
    MANAGEMENT INFORMATICS, 1974, 3 (01): : 37 - 43
  • [24] Data management and extraction of biological information from large data sets
    Mount, David
    IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-ANIMAL, 2008, 44 : S3 - S4
  • [25] Performance of an ensemble clustering algorithm on biological data sets
    Pirim, Harun
    Gautam, Dilip
    Bhowmik, Tanmay
    Perkins, Andy D.
    Ekşioglu, Burak
    Alkan, Ahmet
    Mathematical and Computational Applications, 2011, 16 (01) : 87 - 96
  • [26] Classification using small fuzzy biological data sets
    Diederich, J
    Fortuner, R
    1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1429 - 1434
  • [27] CHILDES - CHILD LANGUAGE DATA EXCHANGE SYSTEM
    不详
    PSYCHOLOGIE-SCHWEIZERISCHE ZEITSCHRIFT FUR PSYCHOLOGIE UND IHRE ANDWENDUNGEN, 1986, 45 (04): : 322 - 323
  • [28] Datalog as a Query Language for Data Exchange Systems
    Arenas, Marcelo
    Barcelo, Pablo
    Reutter, Juan L.
    DATALOG RELOADED: FIRST INTERNATIONALWORKSHOP, DATALOG 2010, 2011, 6702 : 302 - 320
  • [29] THE CHILD LANGUAGE DATA EXCHANGE SYSTEM - AN UPDATE
    MACWHINNEY, B
    SNOW, C
    JOURNAL OF CHILD LANGUAGE, 1990, 17 (02) : 457 - 472
  • [30] A STORAGE STRUCTURE DEFINITION LANGUAGE FOR CODASYL DATA-BASES
    NEGRI, M
    ZICARI, R
    INFORMATION SYSTEMS, 1984, 9 (01) : 59 - 68