Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_TG_45884863_Lg15_RosCOS1627_MAF30_362819_exon3RosBREEDSNP_SNP_TG_45884863_Lg15_RosCOS1627_MAF30_362819_exon3genetic_marker
RosBREEDSNP_SNP_TC_49247422_Lg15_01707_MAF30_509521_exon1RosBREEDSNP_SNP_TC_49247422_Lg15_01707_MAF30_509521_exon1genetic_marker
RosBREEDSNP_SNP_AG_49267247_Lg15_01707_MAF50_MDP0000158447_exon6RosBREEDSNP_SNP_AG_49267247_Lg15_01707_MAF50_MDP0000158447_exon6genetic_marker
RosBREEDSNP_SNP_TG_50064602_Lg15_01778_MAF40_MDP0000266480_exon3RosBREEDSNP_SNP_TG_50064602_Lg15_01778_MAF40_MDP0000266480_exon3genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_252_5129042Pear_Bartlett_RosBREEDSNP_SNP_AG_252_5129042genetic_marker
RosBREEDSNP_SNP_TC_55610693_Lg15_327586_MAF20_327586_exon1RosBREEDSNP_SNP_TC_55610693_Lg15_327586_MAF20_327586_exon1genetic_marker
RosBREEDSNP_SNP_GA_51301407_Lg15_00236_MAF30_1651413_exon2RosBREEDSNP_SNP_GA_51301407_Lg15_00236_MAF30_1651413_exon2genetic_marker
RosBREEDSNP_SNP_AG_54518780_Lg15_01774_MAF30_1645728_exon1RosBREEDSNP_SNP_AG_54518780_Lg15_01774_MAF30_1645728_exon1genetic_marker
RosBREEDSNP_SNP_CT_55338420_Lg15_01392_MAF30_732277_exon1RosBREEDSNP_SNP_CT_55338420_Lg15_01392_MAF30_732277_exon1genetic_marker
RosBREEDSNP_SNP_GA_55332967_Lg15_01392_MAF20_MDP0000135349_exon8RosBREEDSNP_SNP_GA_55332967_Lg15_01392_MAF20_MDP0000135349_exon8genetic_marker
GDsnp01600GDsnp01600genetic_marker
RosBREEDSNP_SNP_AC_1548240_Lg16_LAR1_MAF30_287180_exon1RosBREEDSNP_SNP_AC_1548240_Lg16_LAR1_MAF30_287180_exon1genetic_marker
RosBREEDSNP_SNP_TC_3437104_Lg16_00353_MAF10_495480_exon1RosBREEDSNP_SNP_TC_3437104_Lg16_00353_MAF10_495480_exon1genetic_marker
RosBREEDSNP_SNP_CT_3066832_Lg16_01116_MAF50_MDP0000316569_exon2RosBREEDSNP_SNP_CT_3066832_Lg16_01116_MAF50_MDP0000316569_exon2genetic_marker
RosBREEDSNP_SNP_GA_4020543_Lg16_01179_MAF30_1659407_exon5RosBREEDSNP_SNP_GA_4020543_Lg16_01179_MAF30_1659407_exon5genetic_marker
GDsnp00555GDsnp00555genetic_marker
RosBREEDSNP_SNP_GA_3756948_Lg16_01186_MAF30_271874_exon1RosBREEDSNP_SNP_GA_3756948_Lg16_01186_MAF30_271874_exon1genetic_marker
RosBREEDSNP_SNP_CA_4428275_Lg16_182011_MAF10_182011_exon1RosBREEDSNP_SNP_CA_4428275_Lg16_182011_MAF10_182011_exon1genetic_marker
RosBREEDSNP_SNP_TC_4428268_Lg16_182011_MAF10_182011_exon1RosBREEDSNP_SNP_TC_4428268_Lg16_182011_MAF10_182011_exon1genetic_marker
RosBREEDSNP_SNP_TC_5365087_Lg16_00030_MAF40_1630198_exon1RosBREEDSNP_SNP_TC_5365087_Lg16_00030_MAF40_1630198_exon1genetic_marker
RosBREEDSNP_SNP_TC_6025455_Lg16_01003_MAF40_MDP0000121918_exon1RosBREEDSNP_SNP_TC_6025455_Lg16_01003_MAF40_MDP0000121918_exon1genetic_marker
RosBREEDSNP_SNP_CT_6047197_Lg16_01003_MAF50_153552_exon2RosBREEDSNP_SNP_CT_6047197_Lg16_01003_MAF50_153552_exon2genetic_marker
RosBREEDSNP_SNP_TC_17544043_Lg13_00390_MAF30_MDP0000427027_exon1RosBREEDSNP_SNP_TC_17544043_Lg13_00390_MAF30_MDP0000427027_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_172_5079092Pear_Bartlett_RosBREEDSNP_SNP_TC_172_5079092genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_448_5076902Pear_Bartlett_RosBREEDSNP_SNP_GA_448_5076902genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica