Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CA_20361226_Lg6_225863_MAF40_225863_exon1RosBREEDSNP_SNP_CA_20361226_Lg6_225863_MAF40_225863_exon1genetic_marker
RosBREEDSNP_SNP_TG_5226467_Lg7_MDP0000152302_MAF40_MDP0000152302_exon3RosBREEDSNP_SNP_TG_5226467_Lg7_MDP0000152302_MAF40_MDP0000152302_exon3genetic_marker
RosBREEDSNP_SNP_AC_22216427_Lg6_02165_MAF50_1684365_exon3RosBREEDSNP_SNP_AC_22216427_Lg6_02165_MAF50_1684365_exon3genetic_marker
RosBREEDSNP_SNP_GA_5404705_Lg7_324951_MAF20_324951_exon1RosBREEDSNP_SNP_GA_5404705_Lg7_324951_MAF20_324951_exon1genetic_marker
RosBREEDSNP_SNP_CT_23527470_Lg6_01502_MAF10_1672298_exon1RosBREEDSNP_SNP_CT_23527470_Lg6_01502_MAF10_1672298_exon1genetic_marker
RosBREEDSNP_SNP_GA_23443053_Lg14_00062_MAF40_1688347_exon5RosBREEDSNP_SNP_GA_23443053_Lg14_00062_MAF40_1688347_exon5genetic_marker
RosBREEDSNP_SNP_GA_27789755_Lg6_RosCOS750_MAF10_MDP0000600124_exon3RosBREEDSNP_SNP_GA_27789755_Lg6_RosCOS750_MAF10_MDP0000600124_exon3genetic_marker
RosBREEDSNP_SNP_GA_30104465_Lg6_02138_MAF30_820238_exon1RosBREEDSNP_SNP_GA_30104465_Lg6_02138_MAF30_820238_exon1genetic_marker
RosBREEDSNP_SNP_GA_27356847_Lg6_RosCOS1881_MAF40_1653566_exon2RosBREEDSNP_SNP_GA_27356847_Lg6_RosCOS1881_MAF40_1653566_exon2genetic_marker
RosBREEDSNP_SNP_AG_2832830_Lg7_00802_MAF20_MDP0000388416_exon2RosBREEDSNP_SNP_AG_2832830_Lg7_00802_MAF20_MDP0000388416_exon2genetic_marker
RosBREEDSNP_SNP_CT_3581379_Lg7_01717_MAF50_1654084_exon3RosBREEDSNP_SNP_CT_3581379_Lg7_01717_MAF50_1654084_exon3genetic_marker
RosBREEDSNP_SNP_TC_3587218_Lg7_01717_MAF40_562136_exon1RosBREEDSNP_SNP_TC_3587218_Lg7_01717_MAF40_562136_exon1genetic_marker
RosBREEDSNP_SNP_AC_35533813_Lg2_00267_MAF10_MDP0000465049_exon1RosBREEDSNP_SNP_AC_35533813_Lg2_00267_MAF10_MDP0000465049_exon1genetic_marker
RosBREEDSNP_SNP_TG_31146800_Lg12_01759_MAF10_MDP0000663289_exon1RosBREEDSNP_SNP_TG_31146800_Lg12_01759_MAF10_MDP0000663289_exon1genetic_marker
RosBREEDSNP_SNP_AG_3860536_Lg7_RosCOS418_MAF40_MDP0000281358_exon1RosBREEDSNP_SNP_AG_3860536_Lg7_RosCOS418_MAF40_MDP0000281358_exon1genetic_marker
RosBREEDSNP_SNP_CT_28113435_Lg1_108136_MAF30_108136_exon1RosBREEDSNP_SNP_CT_28113435_Lg1_108136_MAF30_108136_exon1genetic_marker
RosBREEDSNP_SNP_TC_4180170_Lg7_RosCOS3240_MAF50_MDP0000127900_exon4RosBREEDSNP_SNP_TC_4180170_Lg7_RosCOS3240_MAF50_MDP0000127900_exon4genetic_marker
RosBREEDSNP_SNP_AC_4393940_Lg7_01170_MAF30_MDP0000696109_exon3RosBREEDSNP_SNP_AC_4393940_Lg7_01170_MAF30_MDP0000696109_exon3genetic_marker
RosBREEDSNP_SNP_TC_6102142_Lg7_01988_MAF30_MDP0000154685_exon3RosBREEDSNP_SNP_TC_6102142_Lg7_01988_MAF30_MDP0000154685_exon3genetic_marker
RosBREEDSNP_SNP_CT_5721533_Lg17_00606_MAF30_10746_exon1RosBREEDSNP_SNP_CT_5721533_Lg17_00606_MAF30_10746_exon1genetic_marker
RosBREEDSNP_SNP_AG_4477859_Lg7_8384__8384_exon1RosBREEDSNP_SNP_AG_4477859_Lg7_8384__8384_exon1genetic_marker
RosBREEDSNP_SNP_TC_6098706_Lg7_01988_MAF30_MDP0000220903_exon1RosBREEDSNP_SNP_TC_6098706_Lg7_01988_MAF30_MDP0000220903_exon1genetic_marker
RosBREEDSNP_SNP_GA_35134996_Lg5_62137_MAF20_62137_exon1RosBREEDSNP_SNP_GA_35134996_Lg5_62137_MAF20_62137_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_423_5093752Pear_Bartlett_RosBREEDSNP_SNP_CT_423_5093752genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_154_5103263Pear_Bartlett_RosBREEDSNP_SNP_GA_154_5103263genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica