Malus x domestica Golden Delicious GDT2T Genome v1.0 Assembly & Annotation

Overview
Analysis NameMalus x domestica Golden Delicious GDT2T Genome v1.0 Assembly & Annotation
MethodHifiasm, Juicer, 3D-DNA, NextDenovo (v.0.16, v1.5, v180922, v2.4.0)
SourcePacBio HiFi reads,ONT reads and Hi-C reads
Date performed2024-07-09

Publication:

Ying Su, Xuanwen Yang, Yuwei Wang, Jialei Li, Qiming Long, Shuo Cao, Xu Wang, Zhenya Liu, Siyang Huang, Zhuyifu Chen, Yanling Peng, Fan Zhang, Hui Xue, Xuejing Cao, Mengyan Zhang, Gulbar Yisilam, Zhenzhou Chu, Yuan Gao, Yongfeng Zhou, Zhongjie Liu, Hua Xiao, Xinmin Tian, Phased telomere-to-telomere reference genome and pangenome reveal an expansion of resistance genes during apple domestication, Plant Physiology, 2024;, kiae258, https://doi.org/10.1093/plphys/kiae258

Abstract

The cultivated apple (Malus domestica Borkh.) is a cross-pollinated perennial fruit tree of great economic importance. Earlier versions of apple reference genomes were unphased, fragmented, and lacked comprehensive insights into the apple's highly heterozygous genome, which impeded advances in genetic studies and breeding programs. In this study, we assembled a haplotype-resolved telomere-to-telomere (T2T) reference genome for the diploid apple cultivar Golden Delicious. Subsequently, we constructed a pangenome based on 12 assemblies from wild and cultivated species to investigate the dynamic changes of functional genes. Our results revealed the gene gain and loss events during apple domestication. Compared with cultivated species, more gene families in wild species were significantly enriched in oxidative phosphorylation, pentose metabolic process, responses to salt, and abscisic acid biosynthesis process. Our analyses also demonstrated a higher prevalence of different types of resistance gene analogs (RGAs) in cultivars than their wild relatives, partially attributed to segmental and tandem duplication events in certain RGAs classes. Structural variations, mainly deletions and insertions, have affected the presence and absence of TIR-NB-ARC-LRR, NB-ARC-LRR, and CC-NB-ARC-LRR genes. Additionally, hybridization/introgression from wild species has also contributed to the expansion of resistance genes in domesticated apples. Our haplotype-resolved T2T genome and pangenome provide important resources for genetic studies of apples, emphasizing the need to study the evolutionary mechanisms of resistance genes in apple breeding.

Homology

Homology of the Malus domestica Golden Delicious GDT2T genome v1.0 proteins was determined by pairwise sequence comparison using the blastp algorithm against various protein databases. An expectation value cutoff less than 1e-6  for the Arabidoposis proteins (Araport11, 2022-09), UniProtKB/SwissProt (Release 2024-03), and UniProtKB/TrEMBL (Release 2024-03) databases. The best hit reports are available for download in Excel format. 

Protein Homologs

M. domestica Golden Delicious GDT2T v1.0 proteins with arabidopsis (Araport11) homologs (EXCEL file) GDT2T_v1.0_vs_arabidopsis.xlsx.gz
M. domestica Golden Delicious GDT2T v1.0 proteins with arabidopsis (Araport11) (FASTA file) GDT2T_v1.0_vs_arabidopsis_hit.fasta.gz
M. domestica Golden Delicious GDT2T v1.0 proteins without arabidopsis (Araport11) (FASTA file) GDT2T_v1.0_vs_arabidopsis_noHit.fasta.gz
M. domestica Golden Delicious GDT2T v1.0 proteins with SwissProt homologs (EXCEL file) GDT2T_v1.0_vs_swissprot.xlsx.gz
M. domestica Golden Delicious GDT2T v1.0 proteins with SwissProt (FASTA file) GDT2T_v1.0_vs_swissprot_hit.fasta.gz
M. domestica Golden Delicious GDT2T v1.0 proteins without SwissProt (FASTA file) GDT2T_v1.0_vs_swissprot_noHit.fasta.gz
M. domestica Golden Delicious GDT2T v1.0 proteins with TrEMBL homologs (EXCEL file) GDT2T_v1.0_vs_trembl.xlsx.gz
M. domestica Golden Delicious GDT2T v1.0 proteins with TrEMBL (FASTA file) GDT2T_v1.0_vs_trembl_hit.fasta.gz
M. domestica Golden Delicious GDT2T v1.0 proteins without TrEMBL (FASTA file) GDT2T_v1.0_vs_trembl_noHit.fasta.gz
Assembly

The Malus x domestica Golden Delicious GDT2T Genome v1.0 assembly files are available in FASTA format.

Downloads

Chromosomes (haplome A) (FASTA file) Golden_Delicious_GDT2T_HAP1_v1.0.a1.fasta.gz
Chromosomes (haplome B) (FASTA file) Golden_Delicious_GDT2T_HAP2_v1.0.a1.fasta.gz
Repeats (haplome A) (FASTA file) Golden_Delicious_GDT2T_HAP1_v1.0.a1.TElib.fasta.gz
Repeats (haplome A) (GFF3 file) Golden_Delicious_GDT2T_HAP2_v1.0.a1.TEanno.gff3.gz
Repeats (haplome B) (FASTA file) Golden_Delicious_GDT2T_HAP2_v1.0.a1.TElib.fasta.gz
Repeats (haplome B) (GFF3 file) Golden_Delicious_GDT2T_HAP1_v1.0.a1.TEanno.gff3.gz
Gene Predictions

The Malus x domestica Golden Delicious GDT2T Genome v1.0 gene prediction file are available in GFF3 and FASTA format.

Downloads

Genes (haplome A) (GFF3 file) Golden_Delicious_GDT2T_HAP1_v1.0.a1.genes.gff3.gz
Genes (haplome B) (GFF3 file) Golden_Delicious_GDT2T_HAP2_v1.0.a1.genes.gff3.gz
Protein sequences (haplome A) (FASTA file) Golden_Delicious_GDT2T_HAP1_v1.0.a1.protein.fa.gz
Protein sequences (haplome B) (FASTA file) Golden_Delicious_GDT2T_HAP2_v1.0.a1.protein.fa.gz
CDS sequences (haplome A) (FASTA file) Golden_Delicious_GDT2T_HAP1_v1.0.a1.cds.fa.gz
CDS sequences (haplome B) (FASTA file) Golden_Delicious_GDT2T_HAP2_v1.0.a1.cds.fa.gz
mRNA sequences (haplome A) (FASTA file) Golden_Delicious_GDT2T_HAP1_v1.0.a1.trans.fa.gz
mRNA sequences (haplome B) (FASTA file) Golden_Delicious_GDT2T_HAP2_v1.0.a1.trans.fa.gz
Functional Analysis

Functional annotation for the Malus x domesticaa Golden Delicious GDT2T genome v1.0 are available for download below. The M. domestica Golden Delicious GDT2T genome v1.0 proteins were analyzed using InterProScan in order to assign InterPro domains and Gene Ontology (GO) terms. Pathways analysis was performed using the KEGG Automatic Annotation Server (KAAS).

Downloads

GO assignments from InterProScan GDT2T_v1.0_genes2GO.xlsx.gz
IPR assignments from InterProScan GDT2T_v1.0_genes2IPR.xlsx.gz
Proteins mapped to KEGG Orthologs GDT2T_v1.0_KEGG-orthologis.xlsx.gz
Proteins mapped to KEGG Pathways GDT2T_v1.0_KEGG-pathways.xlsx.gz
Transcript Alignments
Transcript alignments were performed by the GDR Team of Main Bioinformatics Lab at WSU. The alignment tool 'BLAT' was used to map transcripts to the Prunus domestica Golden Delicious GDT2T genome assembly. Alignments with an alignment length of 97% and 97% identify were preserved. The available files are in GFF3.

 

Fragaria x ananassa GDR RefTrans v1 GDT2T_v1.0_f.x.ananassa_GDR_reftransV1
P. armeniaca GDR RefTrans v1 GDT2T_v1.0_p.avium_GDR_reftransV1
Prunus persica GDR RefTrans v1 GDT2T_v1.0_p.persica_GDR_reftransV1
Rosa GDR RefTrans v1 GDT2T_v1.0_rosa_GDR_reftransV1
Rubus GDR RefTrans v2 GDT2T_v1.0_rubus_GDR_reftransV2
Malus_x_domestica GDR RefTrans v1 GDT2T_v1.0_m.x.domestica_GDR_reftransV1
Pyrus GDR RefTrans v1 GDT2T_v1.0_pyrus_GDR_reftransV1