Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_GA_12108562_Lg13_RosCOS3331_MAF20_1676265_exon1RosBREEDSNP_SNP_GA_12108562_Lg13_RosCOS3331_MAF20_1676265_exon1genetic_marker
RosBREEDSNP_SNP_AG_20455509_Lg16_01237_MAF50_484960_exon3RosBREEDSNP_SNP_AG_20455509_Lg16_01237_MAF50_484960_exon3genetic_marker
RosBREEDSNP_SNP_CT_6824928_Lg16_02194_MAF20_1633886_exon1RosBREEDSNP_SNP_CT_6824928_Lg16_02194_MAF20_1633886_exon1genetic_marker
RosBREEDSNP_SNP_TC_20351261_Lg13_02063_MAF30_1627812_exon2RosBREEDSNP_SNP_TC_20351261_Lg13_02063_MAF30_1627812_exon2genetic_marker
RosBREEDSNP_SNP_TC_20185279_Lg13_00532_MAF50_MDP0000220963_exon2RosBREEDSNP_SNP_TC_20185279_Lg13_00532_MAF50_MDP0000220963_exon2genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AC_203_5052677Pear_Bartlett_RosBREEDSNP_SNP_AC_203_5052677genetic_marker
RosBREEDSNP_SNP_TC_19065297_Lg7_02875_MAF30_MDP0000449918_exon1RosBREEDSNP_SNP_TC_19065297_Lg7_02875_MAF30_MDP0000449918_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_221_5067150Pear_Bartlett_RosBREEDSNP_SNP_GA_221_5067150genetic_marker
RosBREEDSNP_SNP_GA_826847_Lg4_01597_MAF50_175965_exon1RosBREEDSNP_SNP_GA_826847_Lg4_01597_MAF50_175965_exon1genetic_marker
GDsnp02487GDsnp02487genetic_marker
RosBREEDSNP_SNP_CT_2466886_Lg2_00959_MAF20_MDP0000210239_exon5RosBREEDSNP_SNP_CT_2466886_Lg2_00959_MAF20_MDP0000210239_exon5genetic_marker
RosBREEDSNP_SNP_CT_19548750_Lg2_01840_MAF20_MDP0000736846_exon2RosBREEDSNP_SNP_CT_19548750_Lg2_01840_MAF20_MDP0000736846_exon2genetic_marker
RosBREEDSNP_SNP_CT_238231_Lg14_01888_MAF50_1669462_exon1RosBREEDSNP_SNP_CT_238231_Lg14_01888_MAF50_1669462_exon1genetic_marker
RosBREEDSNP_SNP_AG_635156_Lg14_AT2_MAF20_MDP0000475891_exon1RosBREEDSNP_SNP_AG_635156_Lg14_AT2_MAF20_MDP0000475891_exon1genetic_marker
RosBREEDSNP_SNP_CT_15154776_Lg6_RosCOS1295_MAF20_MDP0000530476_exon1RosBREEDSNP_SNP_CT_15154776_Lg6_RosCOS1295_MAF20_MDP0000530476_exon1genetic_marker
RosBREEDSNP_SNP_CT_2110465_Lg14_01846_MAF30_1669065_exon1RosBREEDSNP_SNP_CT_2110465_Lg14_01846_MAF30_1669065_exon1genetic_marker
RosBREEDSNP_SNP_CT_2138918_Lg14_01846_MAF30_506172_exon1RosBREEDSNP_SNP_CT_2138918_Lg14_01846_MAF30_506172_exon1genetic_marker
RosBREEDSNP_SNP_TC_20557590_Lg11_01187_MAF30_MDP0000647595_exon1RosBREEDSNP_SNP_TC_20557590_Lg11_01187_MAF30_MDP0000647595_exon1genetic_marker
RosBREEDSNP_SNP_TC_20561104_Lg11_01187_MAF40_MDP0000647595_exon3RosBREEDSNP_SNP_TC_20561104_Lg11_01187_MAF40_MDP0000647595_exon3genetic_marker
RosBREEDSNP_SNP_GA_2970938_Lg12_RosCOS1934_MAF30_1681178_exon3RosBREEDSNP_SNP_GA_2970938_Lg12_RosCOS1934_MAF30_1681178_exon3genetic_marker
RosBREEDSNP_SNP_AG_6473268_Lg14_01401_MAF30_1657233_exon1RosBREEDSNP_SNP_AG_6473268_Lg14_01401_MAF30_1657233_exon1genetic_marker
GDsnp02214GDsnp02214genetic_marker
GDsnp02706GDsnp02706genetic_marker
RosBREEDSNP_SNP_GA_5852068_Lg1_00152_MAF20_MDP0000220135_exon3RosBREEDSNP_SNP_GA_5852068_Lg1_00152_MAF20_MDP0000220135_exon3genetic_marker
RosBREEDSNP_SNP_CT_5860779_Lg1_00152_MAF10_MDP0000483599_exon1RosBREEDSNP_SNP_CT_5860779_Lg1_00152_MAF10_MDP0000483599_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica