Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_AG_4025888_Lg2_01476_MAF10_203524_exon1RosBREEDSNP_SNP_AG_4025888_Lg2_01476_MAF10_203524_exon1genetic_marker
RosBREEDSNP_SNP_GA_4048746_Lg2_01476_MAF30_336678_exon1RosBREEDSNP_SNP_GA_4048746_Lg2_01476_MAF30_336678_exon1genetic_marker
RosBREEDSNP_SNP_CT_4326366_Lg2_01945_MAF50_1633451_exon1RosBREEDSNP_SNP_CT_4326366_Lg2_01945_MAF50_1633451_exon1genetic_marker
RosBREEDSNP_SNP_CT_4328900_Lg2_01945_MAF30_MDP0000137471_exon4RosBREEDSNP_SNP_CT_4328900_Lg2_01945_MAF30_MDP0000137471_exon4genetic_marker
RosBREEDSNP_SNP_TC_4335862_Lg2_01945_MAF40_MDP0000281368_exon1RosBREEDSNP_SNP_TC_4335862_Lg2_01945_MAF40_MDP0000281368_exon1genetic_marker
RosBREEDSNP_SNP_AG_3257068_Lg2_RosCOS1998_MAF20_110688_exon1RosBREEDSNP_SNP_AG_3257068_Lg2_RosCOS1998_MAF20_110688_exon1genetic_marker
RosBREEDSNP_SNP_GT_4923623_Lg2_RosCOS3565_MAF40_1644877_exon3RosBREEDSNP_SNP_GT_4923623_Lg2_RosCOS3565_MAF40_1644877_exon3genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_293_5055608Pear_Bartlett_RosBREEDSNP_SNP_TC_293_5055608genetic_marker
RosBREEDSNP_SNP_AG_5329429_Lg2_00158_MAF40_MDP0000602219_exon1RosBREEDSNP_SNP_AG_5329429_Lg2_00158_MAF40_MDP0000602219_exon1genetic_marker
RosBREEDSNP_SNP_TC_5370279_Lg2_00158_MAF10_MDP0000276264_exon1RosBREEDSNP_SNP_TC_5370279_Lg2_00158_MAF10_MDP0000276264_exon1genetic_marker
RosBREEDSNP_SNP_CT_6263352_Lg2_00308_MAF30_MDP0000292425_exon1RosBREEDSNP_SNP_CT_6263352_Lg2_00308_MAF30_MDP0000292425_exon1genetic_marker
RosBREEDSNP_SNP_CT_12901428_Lg2_01552_MAF30_525812_exon2RosBREEDSNP_SNP_CT_12901428_Lg2_01552_MAF30_525812_exon2genetic_marker
RosBREEDSNP_SNP_AG_12942637_Lg2_01552_MAF50_MDP0000683486_exon1RosBREEDSNP_SNP_AG_12942637_Lg2_01552_MAF50_MDP0000683486_exon1genetic_marker
RosBREEDSNP_SNP_CT_28093574_Lg7_135556_MAF30_135556_exon1RosBREEDSNP_SNP_CT_28093574_Lg7_135556_MAF30_135556_exon1genetic_marker
RosBREEDSNP_SNP_GA_10695663_Lg2_01223_MAF50_265612_exon1RosBREEDSNP_SNP_GA_10695663_Lg2_01223_MAF50_265612_exon1genetic_marker
RosBREEDSNP_SNP_GA_8931588_Lg2_00407_MAF20_1644091_exon1RosBREEDSNP_SNP_GA_8931588_Lg2_00407_MAF20_1644091_exon1genetic_marker
RosBREEDSNP_SNP_TC_47073089_Lg15_167070_MAF20_167070_exon1RosBREEDSNP_SNP_TC_47073089_Lg15_167070_MAF20_167070_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_166_5101067Pear_Bartlett_RosBREEDSNP_SNP_TC_166_5101067genetic_marker
RosBREEDSNP_SNP_TG_47073324_Lg15_167070_MAF40_167070_exon1RosBREEDSNP_SNP_TG_47073324_Lg15_167070_MAF40_167070_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_232_5069871Pear_Bartlett_RosBREEDSNP_SNP_CT_232_5069871genetic_marker
RosBREEDSNP_SNP_TC_11380347_Lg2_00159_MAF30_490743_exon4RosBREEDSNP_SNP_TC_11380347_Lg2_00159_MAF30_490743_exon4genetic_marker
RosBREEDSNP_SNP_TC_11503103_Lg2_01005_MAF40_254568_exon1RosBREEDSNP_SNP_TC_11503103_Lg2_01005_MAF40_254568_exon1genetic_marker
RosBREEDSNP_SNP_TC_47073227_Lg15_167070__167070_exon1RosBREEDSNP_SNP_TC_47073227_Lg15_167070__167070_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_317_407017Pear_Bartlett_RosBREEDSNP_SNP_TC_317_407017genetic_marker
GDsnp00159GDsnp00159genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica