Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_AG_7254175_Lg15_00274_MAF40_1678650_exon1RosBREEDSNP_SNP_AG_7254175_Lg15_00274_MAF40_1678650_exon1genetic_marker
RosBREEDSNP_SNP_GA_6942588_Lg15_snpDT001928_MAF20_1676443_exon1RosBREEDSNP_SNP_GA_6942588_Lg15_snpDT001928_MAF20_1676443_exon1genetic_marker
RosBREEDSNP_SNP_GA_8229724_Lg15_00796_MAF50_614785_exon1RosBREEDSNP_SNP_GA_8229724_Lg15_00796_MAF50_614785_exon1genetic_marker
RosBREEDSNP_SNP_TC_8963216_Lg15_00894_MAF30_1684474_exon10RosBREEDSNP_SNP_TC_8963216_Lg15_00894_MAF30_1684474_exon10genetic_marker
RosBREEDSNP_SNP_GA_13677933_Lg15_01587_MAF50_1626292_exon1RosBREEDSNP_SNP_GA_13677933_Lg15_01587_MAF50_1626292_exon1genetic_marker
RosBREEDSNP_SNP_CT_22112531_Lg15_01813_MAF50_MDP0000313840_exon1RosBREEDSNP_SNP_CT_22112531_Lg15_01813_MAF50_MDP0000313840_exon1genetic_marker
RosBREEDSNP_SNP_AG_15835969_Lg15_00273_MAF10_MDP0000163438_exon3RosBREEDSNP_SNP_AG_15835969_Lg15_00273_MAF10_MDP0000163438_exon3genetic_marker
RosBREEDSNP_SNP_AG_16444112_Lg15_01229_MAF10_121480_exon5RosBREEDSNP_SNP_AG_16444112_Lg15_01229_MAF10_121480_exon5genetic_marker
RosBREEDSNP_SNP_AG_21316906_Lg15_MDP0000677258__MDP0000677258_exon4RosBREEDSNP_SNP_AG_21316906_Lg15_MDP0000677258__MDP0000677258_exon4genetic_marker
RosBREEDSNP_SNP_CT_21317002_Lg15_MDP0000677258__MDP0000677258_exon4RosBREEDSNP_SNP_CT_21317002_Lg15_MDP0000677258__MDP0000677258_exon4genetic_marker
RosBREEDSNP_SNP_AG_21320047_Lg15_MDP0000677258_MAF50_MDP0000677258_exon1RosBREEDSNP_SNP_AG_21320047_Lg15_MDP0000677258_MAF50_MDP0000677258_exon1genetic_marker
RosBREEDSNP_SNP_GA_24689342_Lg15_02859_MAF40_MDP0000150372_exon1RosBREEDSNP_SNP_GA_24689342_Lg15_02859_MAF40_MDP0000150372_exon1genetic_marker
RosBREEDSNP_SNP_GT_11554302_Lg2_01005_MAF20_1643422_exon1RosBREEDSNP_SNP_GT_11554302_Lg2_01005_MAF20_1643422_exon1genetic_marker
RosBREEDSNP_SNP_AG_27126107_Lg15_182111_MAF40_182111_exon1RosBREEDSNP_SNP_AG_27126107_Lg15_182111_MAF40_182111_exon1genetic_marker
GDsnp00429GDsnp00429genetic_marker
GDsnp02003GDsnp02003genetic_marker
GDsnp00816GDsnp00816genetic_marker
RosBREEDSNP_SNP_TC_27876981_Lg15_00208_MAF20_1638877_exon1RosBREEDSNP_SNP_TC_27876981_Lg15_00208_MAF20_1638877_exon1genetic_marker
RosBREEDSNP_SNP_CT_2485699_Lg15_02844_MAF50_MDP0000321199_exon4RosBREEDSNP_SNP_CT_2485699_Lg15_02844_MAF50_MDP0000321199_exon4genetic_marker
RosBREEDSNP_SNP_GA_33700351_Lg15_01897_MAF40_MDP0000170511_exon4RosBREEDSNP_SNP_GA_33700351_Lg15_01897_MAF40_MDP0000170511_exon4genetic_marker
RosBREEDSNP_SNP_AG_3603469_Lg15_01047_MAF10_MDP0000283970_exon1RosBREEDSNP_SNP_AG_3603469_Lg15_01047_MAF10_MDP0000283970_exon1genetic_marker
RosBREED_RosBREEDSNP_SNP_AG_64597_Lg10RosBREED_RosBREEDSNP_SNP_AG_64597_Lg10genetic_marker
RosBREEDSNP_SNP_GA_42727752_Lg15_01133_MAF40_1636710_exon4RosBREEDSNP_SNP_GA_42727752_Lg15_01133_MAF40_1636710_exon4genetic_marker
RosBREEDSNP_SNP_TC_45022535_Lg15_ACS_MAF50_1618441_exon1RosBREEDSNP_SNP_TC_45022535_Lg15_ACS_MAF50_1618441_exon1genetic_marker
RosBREEDSNP_SNP_GA_45058256_Lg15_ACS_MAF50_MDP0000318443_exon5RosBREEDSNP_SNP_GA_45058256_Lg15_ACS_MAF50_MDP0000318443_exon5genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica