A chromosome-length genome assembly and annotation of blackberry (Rubus argutus, cv. 'Hillquist').

Publication Overview
TitleA chromosome-length genome assembly and annotation of blackberry (Rubus argutus, cv. 'Hillquist').
AuthorsBrůna T, Aryal R, Dudchenko O, Sargent DJ, Mead D, Buti M, Cavallini A, Hytönen T, Andrés J, Pham M, Weisz D, Mascagni F, Usai G, Natali L, Bassil N, Fernandez GE, Lomsadze A, Armour M, Olukolu B, Poorten T, Britton C, Davik J, Ashrafi H, Aiden EL, Borodovsky M, Worthington M
TypeJournal Article
Journal NameG3 (Bethesda, Md.)
Year2022
CitationBrůna T, Aryal R, Dudchenko O, Sargent DJ, Mead D, Buti M, Cavallini A, Hytönen T, Andrés J, Pham M, Weisz D, Mascagni F, Usai G, Natali L, Bassil N, Fernandez GE, Lomsadze A, Armour M, Olukolu B, Poorten T, Britton C, Davik J, Ashrafi H, Aiden EL, Borodovsky M, Worthington M. A chromosome-length genome assembly and annotation of blackberry (Rubus argutus, cv. 'Hillquist').. G3 (Bethesda, Md.). 2022 Nov 04.

Abstract

Blackberries (Rubus spp.) are the fourth most economically important berry crop worldwide. Genome assemblies and annotations have been developed for Rubus species in subgenus Idaeobatus, including black raspberry (R. occidentalis), red raspberry (R. idaeus), and R. chingii, but very few genomic resources exist for blackberries and their relatives in subgenus Rubus. Here we present a chromosome-length assembly and annotation of the diploid blackberry germplasm accession 'Hillquist' (R. argutus). 'Hillquist' is the only known source of primocane-fruiting (annual-fruiting) in tetraploid fresh-market blackberry breeding programs and is represented in the pedigree of many important cultivars worldwide. The 'Hillquist' assembly, generated using PacBio long reads scaffolded with Hi-C sequencing, consisted of 298 Mb, of which 270 Mb (90%) was placed on seven chromosome-length scaffolds with an average length of 38.6 Mb. Approximately 52.8% of the genome was composed of repetitive elements. The genome sequence was highly collinear with a novel maternal haplotype-resolved linkage map of the tetraploid blackberry selection A-2551TN and genome assemblies of R. chingii and red raspberry. A total of 38,503 protein-coding genes were predicted using the assembly and Iso-Seq and RNA-seq data, of which 72% were functionally annotated. Eighteen flowering gene homologs within a previously mapped locus aligning to an 11.2 Mb region on chromosome Ra02 were identified as potential candidate genes for primocane-fruiting. The utility of the 'Hillquist' genome has been demonstrated here by the development of the first genotyping-by-sequencing based linkage map of tetraploid blackberry and the identification of possible candidate genes for primocane-fruiting. This chromosome-length assembly will facilitate future studies in Rubus biology, genetics, and genomics and strengthen applied breeding programs.