Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CA_12543851_Lg12_MDP0000210506_MAF20_MDP0000210506_exon2RosBREEDSNP_SNP_CA_12543851_Lg12_MDP0000210506_MAF20_MDP0000210506_exon2genetic_marker
RosBREEDSNP_SNP_GA_35135007_Lg5_62137_MAF20_62137_exon1RosBREEDSNP_SNP_GA_35135007_Lg5_62137_MAF20_62137_exon1genetic_marker
RosBREEDSNP_SNP_GA_5683344_Lg17_00606_MAF10_MDP0000486718_exon1RosBREEDSNP_SNP_GA_5683344_Lg17_00606_MAF10_MDP0000486718_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_234_5105104Pear_Bartlett_RosBREEDSNP_SNP_CT_234_5105104genetic_marker
RosBREEDSNP_SNP_TG_21317748_Lg12_01426_MAF20_411066_exon1RosBREEDSNP_SNP_TG_21317748_Lg12_01426_MAF20_411066_exon1genetic_marker
RosBREEDSNP_SNP_TC_21387181_Lg12_01426_MAF40_1680072_exon2RosBREEDSNP_SNP_TC_21387181_Lg12_01426_MAF40_1680072_exon2genetic_marker
RosBREEDSNP_SNP_GA_23976782_Lg12_01798_MAF10_MDP0000146103_exon2RosBREEDSNP_SNP_GA_23976782_Lg12_01798_MAF10_MDP0000146103_exon2genetic_marker
RosBREEDSNP_SNP_CT_29063582_Lg12_00334_MAF40_1650968_exon1RosBREEDSNP_SNP_CT_29063582_Lg12_00334_MAF40_1650968_exon1genetic_marker
RosBREEDSNP_SNP_CA_29070012_Lg12_00334_MAF30_MDP0000163756_exon2RosBREEDSNP_SNP_CA_29070012_Lg12_00334_MAF30_MDP0000163756_exon2genetic_marker
RosBREEDSNP_SNP_CT_29019870_Lg12_01769_MAF20_491569_exon2RosBREEDSNP_SNP_CT_29019870_Lg12_01769_MAF20_491569_exon2genetic_marker
RosBREEDSNP_SNP_GT_13099646_Lg1_01494_MAF40_464452_exon1RosBREEDSNP_SNP_GT_13099646_Lg1_01494_MAF40_464452_exon1genetic_marker
RosBREEDSNP_SNP_AG_21422131_Lg4_01558_MAF30_MDP0000270110_exon4RosBREEDSNP_SNP_AG_21422131_Lg4_01558_MAF30_MDP0000270110_exon4genetic_marker
FFRJ0CP02EF3PU_120FFRJ0CP02EF3PU_120genetic_marker
RosBREEDSNP_SNP_CT_32711249_Lg12_00840_MAF20_573182_exon1RosBREEDSNP_SNP_CT_32711249_Lg12_00840_MAF20_573182_exon1genetic_marker
RosBREEDSNP_SNP_GT_32732927_Lg12_00840_MAF10_MDP0000138835_exon3RosBREEDSNP_SNP_GT_32732927_Lg12_00840_MAF10_MDP0000138835_exon3genetic_marker
RosBREEDSNP_SNP_CT_32461729_Lg12_00362_MAF20_793506_exon1RosBREEDSNP_SNP_CT_32461729_Lg12_00362_MAF20_793506_exon1genetic_marker
RosBREEDSNP_SNP_GA_11024902_Lg15_02869_MAF10_MDP0000792563_exon1RosBREEDSNP_SNP_GA_11024902_Lg15_02869_MAF10_MDP0000792563_exon1genetic_marker
snpCN899883snpCN899883genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_232_3193584Pear_Bartlett_RosBREEDSNP_SNP_CT_232_3193584genetic_marker
RosBREEDSNP_SNP_AC_35124816_Lg12_01665_MAF40_MDP0000494111_exon1RosBREEDSNP_SNP_AC_35124816_Lg12_01665_MAF40_MDP0000494111_exon1genetic_marker
RosBREEDSNP_SNP_GA_34947963_Lg12_01647_MAF40_517970_exon1RosBREEDSNP_SNP_GA_34947963_Lg12_01647_MAF40_517970_exon1genetic_marker
GDsnp01647GDsnp01647genetic_marker
RosBREEDSNP_SNP_CA_34914300_Lg12_01647_MAF30_1665651_exon1RosBREEDSNP_SNP_CA_34914300_Lg12_01647_MAF30_1665651_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_182_5080116Pear_Bartlett_RosBREEDSNP_SNP_AG_182_5080116genetic_marker
RosBREEDSNP_SNP_TC_35445616_Lg12_01793_MAF50_1622722_exon9RosBREEDSNP_SNP_TC_35445616_Lg12_01793_MAF50_1622722_exon9genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica