Fragaria x ananassa Benihoppe Genome v1.0 Assembly & Annotation

Overview
Analysis NameFragaria x ananassa Benihoppe Genome v1.0 Assembly & Annotation
MethodHifiasm (v0.16.1-r375), Juicer and 3D-DNA (na)
SourcePacBio HiFi, Hi-C and ultra-long ONT reads
Date performed2023-12-06

Publication:

Song, Y., Peng, Y., Liu, L., Li, G., Zhao, X., Wang, X., Cao, S., Muyle, A., Zhou, Y., & Zhou, H. (2023). Phased gap-free genome assembly of octoploid cultivated strawberry illustrates the genetic and epigenetic divergence among subgenomes. Horticulture Research, https://doi.org/10.1093/hr/uhad252

Abstract:

The genetic and epigenetic mechanisms underlying the coexistence and coordination of the four diverged subgenomes (ABCD) in octoploid strawberries (Fragaria x ananassa) remains poorly understood. In this study, we have assembled a haplotype-phased gap-free octoploid genome for the strawberry, which allowed us to uncover the sequence, structure, and epigenetic divergences among the subgenomes. The diploid progenitors of the octoploid strawberry, apart from subgenome A (Fragaria vesca), have been a subject of public controversy. Phylogenomic analyses revealed a close relationship between diploid species Fragaria iinumae and subgenomes B, C, and D. Subgenome A, closely related to F. vesca, retains the highest number of genes, exhibits the lowest content of transposable elements (TEs), experiences the strongest purifying selection, shows the lowest DNA methylation levels, and displays the highest expression level compared to the other three subgenomes. Transcriptome and DNA methylome analyses revealed that subgenome A-biased genes were enriched in fruit development biological processes. In contrast, although subgenomes B, C, and D contain equivalent amounts of repetitive sequences, they exhibit diverged methylation levels, particularly for TEs located near genes. Taken together, our findings provide valuable insights into the evolutionary patterns of subgenome structure, divergence and epigenetic dynamics in octoploid strawberries, which could be utilized in strawberry genetics and breeding research.

Table S1 Benchmarking universal single-copy orthologs (BUSCO) v5 analysis of genome assembly and transcripts of 'Benihoppe'

 

Genome assembly

Transcriptome assembly

Complete BUSCOs (C)

1604 (99.4%)

1551 (96.0%)

Complete and single-copy BUSCOs (S)

29 (1.8%)

117 (7.2%)

Complete and duplicated BUSCOs (D)

1575 (97.6%)

1434 (88.8%)

Fragmented BUSCOs (F)

3 (0.2%)

20 (1.2%)

Missing BUSCOs (M)

7 (0.4%)

43 (2.8%)

Total BUSCO groups searched

1614 (100.0%)

1614 (100.0%)

Homology

Homology of the Fragaria x ananassa Benihoppe genome v1.0 proteins was determined by pairwise sequence comparison using the blastp algorithm against various protein databases. An expectation value cutoff less than 1e-6  for the Arabidoposis proteins (Araport11, 2022-09), UniProtKB/SwissProt (Release 2023-07), and UniProtKB/TrEMBL (Release 2023-07) databases. The best hit reports are available for download in Excel format. 

Protein Homologs

Fragaria x ananassa Benihoppe v1.0 proteins with arabidopsis (Araport11) homologs (EXCEL file) Fananassa_Benihoppe_v1.0_vs_arabidopsis.xlsx.gz
Fragaria x ananassa Benihoppe v1.0 proteins with arabidopsis (Araport11) (FASTA file) Fananassa_Benihoppe_v1.0_vs_arabidopsis_hit.fasta.gz
Fragaria x ananassa Benihoppe v1.0 proteins without arabidopsis (Araport11) (FASTA file) Fananassa_Benihoppe_v1.0_vs_arabidopsis_noHit.fasta.gz
Fragaria x ananassa Benihoppe v1.0 proteins with SwissProt homologs (EXCEL file) Fananassa_Benihoppe_v1.0_vs_swissprot.xlsx.gz
Fragaria x ananassa Benihoppe v1.0 proteins with SwissProt (FASTA file) Fananassa_Benihoppe_v1.0_vs_swissprot_hit.fasta.gz
Fragaria x ananassa Benihoppe v1.0 proteins without SwissProt (FASTA file) Fananassa_Benihoppe_v1.0_vs_swissprot_noHit.fasta.gz
Fragaria x ananassa Benihoppe v1.0 proteins with TrEMBL homologs (EXCEL file) Fananassa_Benihoppe_v1.0_vs_trembl.xlsx.gz
Fragaria x ananassa Benihoppe v1.0 proteins with TrEMBL (FASTA file) Fananassa_Benihoppe_v1.0_vs_trembl_hit.fasta.gz
Fragaria x ananassa Benihoppe v1.0 proteins without TrEMBL (FASTA file) Fananassa_Benihoppe_v1.0_vs_trembl_noHit.fasta.gz
Assembly

The Fragaria x ananassa Benihoppe genome v1.0 assembly file is available in FASTA format.

Downloads

Chromosomes(FASTA file) Fananassa_Benihoppe_v1.0.a1.fasta.gz
Chromosomes(Masked FASTA file) Fananassa_Benihoppe_masked.v1.0.a1.fasta.gz
Repeats(FASTA file) Fananassa_Benihoppe_v1.0.a1.fasta.gz
Gene Predictions

The Fragaria x ananassa Benihoppe genome v1.0 gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (GFF3 file) Fananassa_Benihoppe_v1.0.a1.genes.gff3.gz
Protein sequences (FASTA file) GDR_Fananassa_Benihoppe_v1.0.a1.pep.fasta.gz
CDS sequences (FASTA file) GDR_Fananassa_Benihoppe_v1.0.a1.cds.fasta.gz
cDNA sequences (FASTA file) GDR_Fananassa_Benihoppe_v1.0.a1.cDNA.fasta.gz
Functional Analysis

Functional annotation for the Fragaria x ananassa Benihoppe genome v1.0 are available for download below. The Fragaria x ananassa Benihoppe genome v1.0 proteins were analyzed using InterProScan in order to assign InterPro domains and Gene Ontology (GO) terms. Pathways analysis was performed using the KEGG Automatic Annotation Server (KAAS).

Downloads

GO assignments from InterProScan Fananassa_Benihoppe_v1.0_genes2GO.xlsx.gz
IPR assignments from InterProScan Fananassa_Benihoppe_v1.0_genes2IPR.xlsx.gz
Proteins mapped to KEGG Orthologs Fananassa_Benihoppe_v1.0_KEGG-orthologis.xlsx.gz
Proteins mapped to KEGG Pathways Fananassa_Benihoppe_v1.0_KEGG-pathways.xlsx.gz
Transcript Alignments
Transcript alignments were performed by the GDR Team of Main Bioinformatics Lab at WSU. The alignment tool 'BLAT' was used to map transcripts to the Fragaria x ananassa Benihoppe genome assembly. Alignments with an alignment length of 97% and 97% identify were preserved. The available files are in GFF3.

 

Fragaria x ananassa GDR RefTrans v1 Fananassa_Benihoppe_v1.0_f.x.ananassa_GDR_reftransV1
Prunus avium GDR RefTrans v1 Fananassa_Benihoppe_v1.0_p.avium_GDR_reftransV1
Prunus persica GDR RefTrans v1 Fananassa_Benihoppe_v1.0_p.persica_GDR_reftransV1
Rosa GDR RefTrans v1 Fananassa_Benihoppe_v1.0_rosa_GDR_reftransV1
Rubus GDR RefTrans v2 Fananassa_Benihoppe_v1.0_rubus_GDR_reftransV2
Malus_x_domestica GDR RefTrans v1 Fananassa_Benihoppe_v1.0_m.x.domestica_GDR_reftransV1
Pyrus GDR RefTrans v1 Fananassa_Benihoppe_v1.0_pyrus_GDR_reftransV1