DRDB: An Online Date Palm Genomic Resource Database

被引:8
|
作者
He, Zilong [1 ]
Zhang, Chengwei [1 ,2 ,3 ]
Liu, Wanfei [1 ,4 ,5 ,6 ]
Lin, Qiang [1 ,5 ,6 ]
Wei, Ting [1 ]
Aljohi, Hasan A. [5 ,6 ]
Chen, Wei-Hua [2 ]
Hu, Songnian [1 ,3 ]
机构
[1] Chinese Acad Sci, Beijing Inst Genom, CAS Key Lab Genome Sci & Informat, Beijing, Peoples R China
[2] Huazhong Univ Sci & Technol, Coll Life Sci & Technol, Wuhan, Hubei, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Grail Sci Co Ltd, Shenyang, Liaoning, Peoples R China
[5] King Abdulaziz City Sci & Technol, Joint Ctr Genom Res, Riyadh, Saudi Arabia
[6] Chinese Acad Sci, Riyadh, Saudi Arabia
来源
基金
中国国家自然科学基金;
关键词
date palm; short sequence repeat; single nucleotide polymorphism; genome variation; cultivar classification; PHOENIX-DACTYLIFERA L; MICROSATELLITE MARKERS; DISCOVERY; PROGRAM;
D O I
10.3389/fpls.2017.01889
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Background: Date palm (Phoenix dactylifera L.) is a cultivated woody plant with agricultural and economic importance in many countries around the world. With the advantages of next generation sequencing technologies, genome sequences for many date palm cultivars have been released recently. Short sequence repeat (SSR) and single nucleotide polymorphism (SNP) can be identified from these genomic data, and have been proven to be very useful biomarkers in plant genome analysis and breeding. Results: Here, we first improved the date palm genome assembly using 130X of HiSeq data generated in our lab. Then 246,445 SSRs (214,901 SSRs and 31,544 compound SSRs) were annotated in this genome assembly; among the SSRs, mononucleotide SSRs (58.92%) were the most abundant, followed by di-(29.92%), tri- (8.14%), tetra-(2.47%), penta-(0.36%), and hexa-nucleotide SSRs (0.19%). The high-quality PCR primer pairs were designed for most (174,497; 70.81% out of total) SSRs. We also annotated 6,375,806 SNPs with raw read depth >= 3 in 90% cultivars. To further reduce false positive SNPs, we only kept 5,572,650 (87.40% out of total) SNPs with at least 20% cultivars support for downstream analyses. The high-quality PCR primer pairs were also obtained for 4,177,778 (65.53%) SNPs. We reconstructed the phylogenetic relationships among the 62 cultivars using these variants and found that they can be divided into three clusters, namely North Africa, Egypt - Sudan, and Middle East - South Asian, with Egypt -Sudan being the admixture of North Africa and Middle East - South Asian cultivars; we further confirmed these clusters using principal component analysis. Moreover, 34,346 SSRs and 4,177,778 SNPs with PCR primers were assigned to shared cultivars for cultivar classification and diversity analysis. All these SSRs, SNPs and their classification are available in our database, and can be used for cultivar identification, comparison, and molecular breeding. Conclusion: DRDB is a comprehensive genomic resource database of date palm. It can serve as a bioinformatics platform for date palm genomics, genetics, and molecular breeding. DRDB is freely available at http://drdb.big.ac.cn/home.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] The Littorina sequence database (LSD) - an online resource for genomic data
    Canback, Bjorn
    Andre, Carl
    Galindo, Juan
    Johannesson, Kerstin
    Johansson, Tomas
    Panova, Marina
    Tunlid, Anders
    Butlin, Roger
    MOLECULAR ECOLOGY RESOURCES, 2012, 12 (01) : 142 - 148
  • [2] Genomic Insights into Date Palm Origins
    Gros-Balthazard, Muriel
    Hazzouri, Khaled Michel
    Flowers, Jonathan Mark
    GENES, 2018, 9 (10)
  • [3] Amaranth Genomic Resource Database: an integrated database resource of Amaranth genes and genomics
    Singh, Akshay
    Mahato, Ajay Kumar
    Maurya, Avantika
    Rajkumar, S.
    Singh, A. K.
    Bhardwaj, Rakesh
    Kaushik, S. K.
    Kumar, Sandeep
    Gupta, Veena
    Singh, Kuldeep
    Singh, Rakesh
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [4] The Peptaibiotics Database - A Comprehensive Online Resource
    Neumann, Nora K. N.
    Stoppacher, Norbert
    Zeilinger, Susanne
    Degenkolb, Thomas
    Bruecknerd, Hans
    Schuhmacher, Rainer
    CHEMISTRY & BIODIVERSITY, 2015, 12 (05) : 743 - 751
  • [5] An online potato pedigree database resource
    Van Berloo R.
    Hutten R.C.B.
    Van Eck H.J.
    Visser R.G.F.
    Potato Research, 2007, 50 (1) : 45 - 57
  • [6] GENOMIC AND EVOLUTIONARY DIVERSITY OF LTR RETROTRANSPOSONS IN DATE PALM (PHOENIX DACTYLIFERA)
    Nouroz, Faisal
    Mukaramin
    PAKISTAN JOURNAL OF BOTANY, 2019, 51 (05) : 1637 - 1644
  • [7] SAGER: a database of Symbiodiniaceae and Algal Genomic Resource
    Yu, Liying
    Li, Tangcheng
    Li, Ling
    Lin, Xin
    Li, Hongfei
    Liu, Chichi
    Guo, Chentao
    Lin, Senjie
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2020,
  • [8] BBGD: an online database for blueberry genomic data
    Alkharouf, Nadim W.
    Dhanaraj, Anik L.
    Naik, Dhananjay
    Overall, Chris
    Matthews, Benjamin F.
    Rowland, Lisa J.
    BMC PLANT BIOLOGY, 2007, 7 (1)
  • [9] BBGD: an online database for blueberry genomic data
    Nadim W Alkharouf
    Anik L Dhanaraj
    Dhananjay Naik
    Chris Overall
    Benjamin F Matthews
    Lisa J Rowland
    BMC Plant Biology, 7
  • [10] SGR: an online genomic resource for the woodland strawberry
    Darwish, Omar
    Slovin, Janet P.
    Kang, Chunying
    Hollender, Courtney A.
    Geretz, Aviva
    Houston, Sam
    Liu, Zhongchi
    Alkharouf, Nadim W.
    BMC PLANT BIOLOGY, 2013, 13