KAIKObase: An integrated silkworm genome database and data mining tool

被引:100
|
作者
Shimomura, Michihiko [2 ]
Minami, Hiroshi [2 ]
Suetsugu, Yoshitaka [1 ]
Ohyanagi, Hajime [2 ]
Satoh, Chikatada [2 ]
Antonio, Baltazar [3 ]
Nagamura, Yoshiaki [3 ]
Kadono-Okuda, Keiko [1 ]
Kajiwara, Hideyuki [3 ]
Sezutsu, Hideki [1 ]
Nagaraju, Javaregowda [4 ]
Goldsmith, Marian R. [5 ]
Xia, Qingyou [6 ]
Yamamoto, Kimiko [1 ]
Mita, Kazuei [1 ]
机构
[1] Natl Inst Agrobiol Sci, Tsukuba, Ibaraki 3058634, Japan
[2] Mitsubishi Space Software Co Ltd, Tsukuba, Ibaraki 3050032, Japan
[3] Natl Inst Agrobiol Sci, Tsukuba, Ibaraki 3058602, Japan
[4] Ctr DNA Fingerprinting & Diagnost, Hyderabad 500001, Andhra Pradesh, India
[5] Univ Rhode Isl, Dept Biol Sci, Kingston, RI 02881 USA
[6] Chongqing Univ, Inst Agr & Life Sci, Chongqing 400030, Peoples R China
来源
BMC GENOMICS | 2009年 / 10卷
关键词
BOMBYX-MORI; SEQUENCE; MAP;
D O I
10.1186/1471-2164-10-486
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The silkworm, Bombyx mori, is one of the most economically important insects in many developing countries owing to its large-scale cultivation for silk production. With the development of genomic and biotechnological tools, B. mori has also become an important bioreactor for production of various recombinant proteins of biomedical interest. In 2004, two genome sequencing projects for B. mori were reported independently by Chinese and Japanese teams; however, the datasets were insufficient for building long genomic scaffolds which are essential for unambiguous annotation of the genome. Now, both the datasets have been merged and assembled through a joint collaboration between the two groups. Description: Integration of the two data sets of silkworm whole-genome-shotgun sequencing by the Japanese and Chinese groups together with newly obtained fosmid-and BAC-end sequences produced the best continuity (similar to 3.7 Mb in N50 scaffold size) among the sequenced insect genomes and provided a high degree of nucleotide coverage (88%) of all 28 chromosomes. In addition, a physical map of BAC contigs constructed by fingerprinting BAC clones and a SNP linkage map constructed using BAC-end sequences were available. In parallel, proteomic data from two-dimensional polyacrylamide gel electrophoresis in various tissues and developmental stages were compiled into a silkworm proteome database. Finally, a Bombyx trap database was constructed for documenting insertion positions and expression data of transposon insertion lines. Conclusion: For efficient usage of genome information for functional studies, genomic sequences, physical and genetic map information and EST data were compiled into KAIKObase, an integrated silkworm genome database which consists of 4 map viewers, a gene viewer, and sequence, keyword and position search systems to display results and data at the level of nucleotide sequence, gene, scaffold and chromosome. Integration of the silkworm proteome database and the Bombyx trap database with KAIKObase led to a high-grade, user-friendly, and comprehensive silkworm genome database which is now available from URL: http://sgp.dna.affrc.go.jp/KAIKObase/.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Web Database Based on Data Mining
    Yang-bo, Wu
    [J]. INFORMATION COMPUTING AND APPLICATIONS, ICICA 2013, PT II, 2013, 392 : 76 - 84
  • [32] Data mining with a distributed deductive database
    Maskarinec, M
    Neumann, K
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL I AND II, 1999, : 115 - 121
  • [33] Data mining: a database perspective.
    Sousa, MS
    Mattoso, MLQ
    Ebecken, NFF
    [J]. DATA MINING, 1998, : 413 - 431
  • [34] Spatial data mining: A database approach
    Ester, M
    Kriegel, HP
    Sander, J
    [J]. ADVANCES IN SPATIAL DATABASES, 1997, 1262 : 47 - 66
  • [35] Database compression with data mining methods
    Goh, CL
    Aisaka, K
    Tsukamoto, H
    Nishio, S
    [J]. INFORMATION ORGANIZATION AND DATABASES: FOUNDATIONS OF DATA ORGANIZATION, 2000, 579 : 177 - 190
  • [36] A tool for data mining support
    Hubal, M
    Bednár, P
    [J]. INTELLIGENT TECHNOLOGIES - THEORY AND APPLICATIONS: NEW TRENDS IN INTELLIGENT TECHNOLOGIES, 2002, 76 : 196 - 200
  • [37] A Database for Data Mining Applications in Astronomy
    McConnell, S.
    Henry, G.
    Sturgeon, R.
    Hurley, R.
    [J]. ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XX, 2011, 442 : 529 - 532
  • [38] A data mining approach to database compression
    Lee, Chin-Feng
    Changchien, S. Wesley
    Wang, Wei-Tse
    Shen, Jau-Ji
    [J]. INFORMATION SYSTEMS FRONTIERS, 2006, 8 (03) : 147 - 161
  • [39] A data mining approach to database compression
    Chin-Feng Lee
    S. Wesley Changchien
    Wei-Tse Wang
    Jau-Ji Shen
    [J]. Information Systems Frontiers, 2006, 8 : 147 - 161
  • [40] Data mining on parallel database systems
    Sousa, M
    Mattoso, M
    Ebecken, N
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1147 - 1154