Malus x domestica 'MM106' Whole Genome v1.0.a2 Assembly & Annotation

Overview
Analysis NameMalus x domestica 'MM106' Whole Genome v1.0.a2 Assembly & Annotation
MethodHifiasm (v0.15_r334)
SourcePacbio v104.0 M106 reads
Date performed2023-03-08

Publication

Li, W., Chu, C., Li, H. et al. Near-gapless and haplotype-resolved apple genomes provide insights into the genetic basis of rootstock-induced dwarfing. Nat Genet (2024) https://doi.org/10.1038/s41588-024-01657-2

Description

Dwarfing rootstocks have transformed the production of cultivated apples; however, the genetic basis of rootstock-induced dwarfing remains largely unclear. We have assembled chromosome-level, near-gapless and haplotype-resolved genomes for the popular dwarfing rootstock ‘M9’, the semi-vigorous rootstock ‘MM106’ and ‘Fuji’, one of the most commonly grown apple cultivars. The apple orthologue of auxin response factor 3 (MdARF3) is in the Dw1 region of ‘M9’, the major locus for rootstock-induced dwarfing. Comparing ‘M9’ and ‘MM106’ genomes revealed a 9,723-bp allele-specific long terminal repeat retrotransposon/gypsy insertion, DwTE, located upstream of MdARF3. DwTE is cosegregated with the dwarfing trait in two segregating populations, suggesting its prospective utility in future dwarfing rootstock breeding. In addition, our pipeline discovered mobile mRNAs that may contribute to the development of dwarfed scion architecture. Our research provides valuable genomic resources and applicable methodology, which have the potential to accelerate breeding dwarfing rootstocks for apple and other perennial woody fruit trees.

Annotation of Whole Genome Assembly 

  Protein
Unphased 50,227
Haplome A 50,247
Haplome B 50,423

Statistics of Genome Assembly

  Unphased Haplome A Haplome B
Genome size (bp) 704,846,311 688,999,876 682,572,451
GC content 38.41% 38.38% 38.39%
Genome sequence No. 1,193 156 116
Maximum genome sequence length (bp) 55,895,690 55,065,909 55,900,941
Minimum genome sequence length (bp) 9,804 22,389 19,530
Average genome sequence length (bp) 590,818 4,416,666 5,884,245
Genome sequence N50 (bp) 38,872,429 38,161,495 37,779,591
Genome sequence N90 (bp) 30,688,119 30,413,969 31,423,387
Homology

Homology of the Malus x domestica MM106 genome v1.0.a2 proteins was determined by pairwise sequence comparison using the blastp algorithm against various protein databases. An expectation value cutoff less than 1e-6  for the Arabidoposis proteins (Araport11, 2022-09), UniProtKB/SwissProt (Release 2023-07), and UniProtKB/TrEMBL (Release 2023-07) databases. The best hit reports are available for download in Excel format. 

Protein Homologs

Malus x domestica v1.0.a2 proteins with arabidopsis (Araport11) homologs (EXCEL file) MM106_v1.0.a2_vs_arabidopsis.xlsx.gz
Malus x domestica v1.0.a2 proteins with arabidopsis (Araport11) (FASTA file) MM106_v1.0.a2_vs_arabidopsis_hit.fasta.gz
Malus x domestica v1.0.a2 proteins without arabidopsis (Araport11) (FASTA file) MM106_v1.0.a2_vs_arabidopsis_noHit.fasta.gz
Malus x domestica v1.0.a2 proteins with SwissProt homologs (EXCEL file) MM106_v1.0.a2_vs_swissprot.xlsx.gz
Malus x domestica v1.0.a2 proteins with SwissProt (FASTA file) MM106_v1.0.a2_vs_swissprot_hit.fasta.gz
Malus x domestica v1.0.a2 proteins without SwissProt (FASTA file) MM106_v1.0.a2_vs_swissprot_noHit.fasta.gz
Malus x domestica v1.0.a2 proteins with TrEMBL homologs (EXCEL file) MM106_v1.0.a2_vs_trembl.xlsx.gz
Malus x domestica v1.0.a2 proteins with TrEMBL (FASTA file) MM106_v1.0.a2_vs_trembl_hit.fasta.gz
Malus x domestica v1.0.a2 proteins without TrEMBL (FASTA file) MM106_v1.0.a2_vs_trembl_noHit.fasta.gz
Assembly

The Malus x domestica MM106 Genome v1.0.a2 assembly files are available in FASTA format.

Downloads

Chromosomes (unphased) (FASTA file) Malus_x_domestica_MM106_unphased_V1.0.a2.fasta.gz
Chromosomes (haplome A) (FASTA file) Malus_x_domestica_MM106_HapA_V1.0.a2.fasta.gz
Chromosomes (haplome B) (FASTA file) Malus_x_domestica_MM106_HapB_V1.0.a2.fasta.gz
Gene Predictions

The Malus x domestica MM106 v1.0.a2 genome gene prediction files are available in GFF3 and FASTA format.

Downloads

Genes (unphased) (GFF3 file) MM106_unphased_V1.0.a2.genes.gff3.gz
Genes (haplome A) (GFF3 file) MM106_HapA_V1.0.a2.genes.gff3.gz
Genes (haplome B) (GFF3 file) MM106_HapB_V1.0.a2.genes.gff3.gz
Protein sequences (unphased) (FASTA file) MM106_unphased_V1.0.a2.pep.fasta.gz
Protein sequences (haplome A) (FASTA file) MM106_HapA_V1.0.a2.pep.fasta.gz
Protein sequences (haplome B) (FASTA file) MM106_HapB_V1.0.a2.pep.fasta.gz
CDS sequences (unphased) (FASTA file) MM106_unphased_V1.0.a2.cds.fasta.gz
CDS sequences (haplome A) (FASTA file) MM106_HapA_V1.0.a2.cds.fasta.gz
CDS sequences (haplome B) (FASTA file) MM106_HapB_V1.0.a2.cds.fasta.gz
Functional Analysis

Functional annotations for the Malus x domestica MM106 genome v1.0.a2 are available for download below. The Malus x domestica MM106 genome v1.0.a2 proteins were analyzed using InterProScan in order to assign InterPro domains and Gene Ontology (GO) terms. Pathways analysis was performed using the KEGG Automatic Annotation Server (KAAS).

Downloads

GO assignments from InterProScan MM106_v1.0.a2_genes2GO.xlsx.gz
IPR assignments from InterProScan MM106_v1.0.a2_genes2IPR.xlsx.gz
Proteins mapped to KEGG Orthologs MM106_v1.0.a2_KEGG-orthologis.xlsx.gz
Proteins mapped to KEGG Pathways MM106_v1.0.a2_KEGG-pathways.xlsx.gz
Transcript Alignments
Transcript alignments were performed by the GDR Team of Main Bioinformatics Lab at WSU. The alignment tool 'BLAT' was used to map transcripts to the Malus x domestica MM106 genome assembly. Alignments with an alignment length of 97% and 97% identify were preserved. The available files are in GFF3 format.

 

Malus x domestica GDR RefTrans v1 MM106_v1.0.a2_f.x.ananassa_GDR_reftransV1
Prunus avium GDR RefTrans v1 MM106_v1.0.a2_p.avium_GDR_reftransV1
Prunus persica GDR RefTrans v1 MM106_v1.0.a2_p.persica_GDR_reftransV1
Rosa GDR RefTrans v1 MM106_v1.0.a2_rosa_GDR_reftransV1
Rubus GDR RefTrans v2 MM106_v1.0.a2_rubus_GDR_reftransV2
Malus_x_domestica GDR RefTrans v1 MM106_v1.0.a2_m.x.domestica_GDR_reftransV1
Pyrus GDR RefTrans v1 MM106_v1.0.a2_pyrus_GDR_reftransV1