Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_GA_23370407_Lg4_01567_MAF40_MDP0000821137_exon3RosBREEDSNP_SNP_GA_23370407_Lg4_01567_MAF40_MDP0000821137_exon3genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_476_5095891Pear_Bartlett_RosBREEDSNP_SNP_AG_476_5095891genetic_marker
GDsnp00283GDsnp00283genetic_marker
RosBREEDSNP_SNP_CT_12888634_Lg2_01552_MAF40_1644276_exon3RosBREEDSNP_SNP_CT_12888634_Lg2_01552_MAF40_1644276_exon3genetic_marker
RosBREEDSNP_SNP_TC_477752_Lg5_MDP0000217438_MAF10_MDP0000217438_exon2RosBREEDSNP_SNP_TC_477752_Lg5_MDP0000217438_MAF10_MDP0000217438_exon2genetic_marker
RosBREEDSNP_SNP_AG_13033111_Lg10_RosCOS724_MAF40_MDP0000193868_exon3RosBREEDSNP_SNP_AG_13033111_Lg10_RosCOS724_MAF40_MDP0000193868_exon3genetic_marker
RosBREEDSNP_SNP_CT_33490066_Lg10_01683_MAF20_1641026_exon8RosBREEDSNP_SNP_CT_33490066_Lg10_01683_MAF20_1641026_exon8genetic_marker
RosBREEDSNP_SNP_CA_6171980_Lg5_01333_MAF10_1671392_exon1RosBREEDSNP_SNP_CA_6171980_Lg5_01333_MAF10_1671392_exon1genetic_marker
RosBREEDSNP_SNP_AG_22015716_Lg4_MDP0000157871__MDP0000157871_exon1RosBREEDSNP_SNP_AG_22015716_Lg4_MDP0000157871__MDP0000157871_exon1genetic_marker
RosBREEDSNP_SNP_AC_5215656_Lg5_00544_MAF40_922834_exon1RosBREEDSNP_SNP_AC_5215656_Lg5_00544_MAF40_922834_exon1genetic_marker
RosBREEDSNP_SNP_GA_7095813_Lg5_01151_MAF50_1629285_exon8RosBREEDSNP_SNP_GA_7095813_Lg5_01151_MAF50_1629285_exon8genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CA_255_5066108Pear_Bartlett_RosBREEDSNP_SNP_CA_255_5066108genetic_marker
RosBREEDSNP_SNP_GT_14442489_Lg5_245169_MAF30_245169_exon2RosBREEDSNP_SNP_GT_14442489_Lg5_245169_MAF30_245169_exon2genetic_marker
RosBREEDSNP_SNP_TC_14325682_Lg5_02834_MAF10_1685287_exon3RosBREEDSNP_SNP_TC_14325682_Lg5_02834_MAF10_1685287_exon3genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_155_5054185Pear_Bartlett_RosBREEDSNP_SNP_CT_155_5054185genetic_marker
RosBREEDSNP_SNP_CT_21981726_Lg4_01965_MAF20_MDP0000589895_exon1RosBREEDSNP_SNP_CT_21981726_Lg4_01965_MAF20_MDP0000589895_exon1genetic_marker
RosBREEDSNP_SNP_TC_9223583_Lg5_01896_MAF20_209995_exon1RosBREEDSNP_SNP_TC_9223583_Lg5_01896_MAF20_209995_exon1genetic_marker
GDsnp00971GDsnp00971genetic_marker
RosBREEDSNP_SNP_TC_9228815_Lg5_01896_MAF30_MDP0000212691_exon8RosBREEDSNP_SNP_TC_9228815_Lg5_01896_MAF30_MDP0000212691_exon8genetic_marker
RosBREEDSNP_SNP_AG_9253841_Lg5_01896_MAF10_1654111_exon1RosBREEDSNP_SNP_AG_9253841_Lg5_01896_MAF10_1654111_exon1genetic_marker
GDsnp00373GDsnp00373genetic_marker
RosBREEDSNP_SNP_AG_9226772_Lg5_01896_MAF40_1643090_exon1RosBREEDSNP_SNP_AG_9226772_Lg5_01896_MAF40_1643090_exon1genetic_marker
GDsnp01896GDsnp01896genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_251_5087097Pear_Bartlett_RosBREEDSNP_SNP_CT_251_5087097genetic_marker
RosBREEDSNP_SNP_GA_8408827_Lg11_00185_MAF20_1632007_exon3RosBREEDSNP_SNP_GA_8408827_Lg11_00185_MAF20_1632007_exon3genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica