Prunus avium Whole Genome Assembly v1.0 & Annotation v1 (v1.0.a1)
About the Assembly
The sweet cherry (Prunus avium) genome sequences were determined using next-generation sequencing technology. The total length of the assembled sequences was 272.4 Mb, consisting of 10,148 scaffold sequences with an N50 length of 219.6 kb. The sequences covered 77.8% of the 352.9 Mb sweet cherry genome, as estimated by k-mer analysis, and included >96.0% of the core eukaryotic genes. 43,349 complete and partial protein-encoding genes were predicted. A high-density consensus map with 2,382 loci was constructed using double-digest restriction site–associated DNA sequencing. Comparing the genetic maps of sweet cherry and peach revealed high synteny between the two genomes; thus the scaffolds were integrated into pseudomolecules using map- and synteny-based strategies. Whole-genome resequencing of six modern cultivars found 1,016,866 SNPs and 162,402 insertions/deletions, out of which 0.7% were deleterious. The sequence variants, as well as simple sequence repeats, can be used as DNA markers.
Shirasawa K, Isuzugawa K, Ikenaga M, Saito Y, Yamamoto T, Hirakawa H, Isobe S (2017)
The Prunus avium v1.0.a1 genome assembly files are available in FASTA and GFF3 formats. There are a total of 9 pseudomolecules and 10,148 scaffolds in this assembly.
The Prunus avium v1.0.a1 genome gene prediction files are available in FASTA and GFF3 formats.
Homology of the Prunus avium v1.0.a1 transcript swas determined by pairwise sequence comparison using the blastx algorithm against various protein databases. The results are available for download in Excel format. An expectation value cutoff less than 1e-6 was used for Arabidoposis proteins and 1e-9 for the NCBI nr, Uniprot SwissProt, and Uniprot TrEMBL databases.
Functional annotation for the Prunus avium v1.0.a1 genome are available for download below. The Prunus avium transcripts were analyzed using InterProScan in order to assign InterPro domains and Gene Ontology (GO) terms. Pathways analysis was performed using the KEGG Automatic Annotation Server (KAAS).
All assembly and annotation files are available for download by selecting the desired data type in the left-hand "Resources" side bar. Each data type page will provide a description of the available files and links to download. Alternatively, you can browse all available files on the FTP repository.
The Prunus avium v1.0.a1 genome repeat files are available in GFF3 formats.
The Prunus avium v1.0.a1 genome markers and DNA polymorphisms are downloadable as BED and VCF files (provided by Shirasawa et. al.)