Pyrus communis Bartlett DH Genome v2.0
Gareth Linsmith, Stephane Rombauts, Sara Montanari, Cecilia H. Deng, Jean-Marc Celton, Philippe Guérif, Chang Liu, Rolf Lohaus, Jason D. Zurn, Alessandro Cestaro, Nahla V. Bassil, Linda V. Bakker, Elio Schijlen, Susan E. Gardiner, Yves Lespinasse, Charles-Eric Durel, Riccardo Velasco, David Neale, David Chagné, Yves Van de Peer, Michela Troggio, Luca Bianco. Pseudo-chromosome length genome assembly of a double haploid ‘Bartlett’ pear (Pyrus communis L.). doi: https://doi.org/10.1101/651778
We report an improved assembly and scaffolding of the European pear (Pyrus communis L.) genome (referred to as BartlettDHv2.0), obtained using a combination of Pacific Biosciences RSII Long read sequencing (PacBio), Bionano optical mapping, chromatin interaction capture (Hi-C), and genetic mapping. A total of 496.9 million bases (Mb) corresponding to 97% of the estimated genome size were assembled into 494 scaffolds. Hi-C data and a high-density genetic map allowed us to anchor and orient 87% of the sequence on the 17 chromosomes of the pear genome. About 50% (247 Mb) of the genome consists of repetitive sequences. Comparison with previous assemblies of Pyrus communis and Pyrus x bretschneideri confirmed the presence of 37,445 protein-coding genes, which is 13% fewer than previously predicted.
Homology of the Pyrus communis BartlettDH genome v2.0 proteins was determined by pairwise sequence comparison using the blastp algorithm against various protein databases. An expectation value cutoff less than 1e-9 was used for the NCBI nr (Release 2018-05) and 1e-6 for the Arabidoposis proteins (Araport11), UniProtKB/SwissProt (Release 2019-01), and UniProtKB/TrEMBL (Release 2019-01) databases. The best hit reports are available for download in Excel format.
All assembly and annotation files are available for download by selecting the desired data type in the left-hand side bar. Each data type page will provide a description of the available files and links to download.
The Pyrus communis Bartlett DH v2.0 genome gene prediction files are available in FASTA and GFF3 formats.
Functional annotation files for the Pyrus communis Bartlett DH genome v2.0 are available for download below. The Pyrus communis DH genome v2.0 proteins were analyzed using InterProScan in order to assign InterPro domains and Gene Ontology (GO) terms. Pathways analysis was performed using the KEGG Automatic Annotation Server (KAAS).
Transcript alignments were performed by the GDR Team of Main Bioinformatics Lab at WSU. The alignment tool 'BLAT' was used to map transcripts to the Pyrus communis genome assembly. Alignments with an alignment length of 97% and 97% identify were preserved. The available files are in GFF3 format.