NCBI Rosaceae gene and mRNA sequences
NCBI Rosaceae gene and mRNA sequences is composed of gene-coding sequence data, not from the whole genome sequences, downloaded from NCBI nr database. Distinct gene symbols for each species are stored as GDR Gene Database dataset and linked to multiple gene/mRNA sequences from NCBI Rosaceae gene and mRNA sequence dataset. Predicted genes from whole genome assemblies are also linked when users submit those data.
Initial Download: Genbank data were downloaded from NCBI nucleotide database on Dec 31 2014. The Genbank file was then parsed into tab-delimited files, and loaded into GDR using the Tripal Genbank Parser developed at the Main Bioinformatics Lab.
2016 Update: the NCBI Data was updated on Apr 5th 2016. This update included 2,094 genes as well as 6,126 mRNA and 6,126 polypeptide sequences.
2021 Update: the NCBI Data was updated on Jun 3rd 2021. This update included 43,692 genes as well as 35,284 mRNA and 35,279 polypeptide sequences.