Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CT_35615663_Lg12_01855_MAF20_1653033_exon2RosBREEDSNP_SNP_CT_35615663_Lg12_01855_MAF20_1653033_exon2genetic_marker
RosBREEDSNP_SNP_CT_35144800_Lg12_01665_MAF50_1680916_exon3RosBREEDSNP_SNP_CT_35144800_Lg12_01665_MAF50_1680916_exon3genetic_marker
RosBREEDSNP_SNP_TC_35605655_Lg12_01855_MAF10_MDP0000216404_exon5RosBREEDSNP_SNP_TC_35605655_Lg12_01855_MAF10_MDP0000216404_exon5genetic_marker
RosBREEDSNP_SNP_AG_35171763_Lg12_01665_MAF10_1643807_exon3RosBREEDSNP_SNP_AG_35171763_Lg12_01665_MAF10_1643807_exon3genetic_marker
RosBREEDSNP_SNP_AG_35906824_Lg12_325708_MAF30_325708_exon1RosBREEDSNP_SNP_AG_35906824_Lg12_325708_MAF30_325708_exon1genetic_marker
RosBREEDSNP_SNP_CA_35395640_Lg12_01793_MAF40_MDP0000276157_exon2RosBREEDSNP_SNP_CA_35395640_Lg12_01793_MAF40_MDP0000276157_exon2genetic_marker
RosBREEDSNP_SNP_AG_35640118_Lg12_01855_MAF10_390812_exon1RosBREEDSNP_SNP_AG_35640118_Lg12_01855_MAF10_390812_exon1genetic_marker
RosBREEDSNP_SNP_CT_35398250_Lg12_01793_MAF20_1632370_exon1RosBREEDSNP_SNP_CT_35398250_Lg12_01793_MAF20_1632370_exon1genetic_marker
RosBREED_RosBREEDSNP_SNP_CA_36256785_Lg12RosBREED_RosBREEDSNP_SNP_CA_36256785_Lg12genetic_marker
RosBREEDSNP_SNP_GA_35149501_Lg12_01665_MAF40_127871_exon1RosBREEDSNP_SNP_GA_35149501_Lg12_01665_MAF40_127871_exon1genetic_marker
GDsnp00368GDsnp00368genetic_marker
RosBREEDSNP_SNP_TG_564831_Lg13_01917_MAF30_1635090_exon3RosBREEDSNP_SNP_TG_564831_Lg13_01917_MAF30_1635090_exon3genetic_marker
RosBREEDSNP_SNP_GA_2188843_Lg13_00098_MAF30_1687731_exon1RosBREEDSNP_SNP_GA_2188843_Lg13_00098_MAF30_1687731_exon1genetic_marker
RosBREEDSNP_SNP_GT_2161379_Lg13_00098_MAF40_MDP0000635134_exon1RosBREEDSNP_SNP_GT_2161379_Lg13_00098_MAF40_MDP0000635134_exon1genetic_marker
RosBREEDSNP_SNP_GA_1181616_Lg16_01588_MAF20_1632539_exon4RosBREEDSNP_SNP_GA_1181616_Lg16_01588_MAF20_1632539_exon4genetic_marker
RosBREEDSNP_SNP_TC_9191531_Lg13_01742_MAF30_1632702_exon3RosBREEDSNP_SNP_TC_9191531_Lg13_01742_MAF30_1632702_exon3genetic_marker
RosBREEDSNP_SNP_CT_1898890_Lg16_00047_MAF10_1669458_exon1RosBREEDSNP_SNP_CT_1898890_Lg16_00047_MAF10_1669458_exon1genetic_marker
RosBREEDSNP_SNP_TC_3078798_Lg16_01116_MAF40_1625909_exon1RosBREEDSNP_SNP_TC_3078798_Lg16_01116_MAF40_1625909_exon1genetic_marker
RosBREEDSNP_SNP_CA_3081478_Lg16_01116_MAF20_MDP0000151597_exon1RosBREEDSNP_SNP_CA_3081478_Lg16_01116_MAF20_MDP0000151597_exon1genetic_marker
RosBREEDSNP_SNP_CA_2951030_Lg2_RosCOS2955_MAF10_940098_exon1RosBREEDSNP_SNP_CA_2951030_Lg2_RosCOS2955_MAF10_940098_exon1genetic_marker
RosBREEDSNP_SNP_GA_6451264_Lg13_65890_MAF20_65890_exon1RosBREEDSNP_SNP_GA_6451264_Lg13_65890_MAF20_65890_exon1genetic_marker
RosBREEDSNP_SNP_CA_8622383_Lg13_RosCOS2067_MAF30_1632184_exon1RosBREEDSNP_SNP_CA_8622383_Lg13_RosCOS2067_MAF30_1632184_exon1genetic_marker
RosBREEDSNP_SNP_TC_7514383_Lg13_AT4_MAF40_MDP0000279182_exon2RosBREEDSNP_SNP_TC_7514383_Lg13_AT4_MAF40_MDP0000279182_exon2genetic_marker
RosBREEDSNP_SNP_TC_8705446_Lg13_RosCOS2067_MAF10_MDP0000187392_exon5RosBREEDSNP_SNP_TC_8705446_Lg13_RosCOS2067_MAF10_MDP0000187392_exon5genetic_marker
RosBREEDSNP_SNP_GA_10657898_Lg13_00140_MAF30_1627277_exon5RosBREEDSNP_SNP_GA_10657898_Lg13_00140_MAF30_1627277_exon5genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica