Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CA_1447290_Lg9_01339_MAF40_154165_exon1RosBREEDSNP_SNP_CA_1447290_Lg9_01339_MAF40_154165_exon1genetic_marker
RosBREEDSNP_SNP_GA_34473516_Lg5_RosCOS2737_MAF40_1636172_exon1RosBREEDSNP_SNP_GA_34473516_Lg5_RosCOS2737_MAF40_1636172_exon1genetic_marker
RosBREEDSNP_SNP_TC_11852167_Lg7_85261_MAF40_85261_exon1RosBREEDSNP_SNP_TC_11852167_Lg7_85261_MAF40_85261_exon1genetic_marker
RosBREEDSNP_SNP_TC_37417867_Lg5_02674_MAF40_MDP0000396187_exon1RosBREEDSNP_SNP_TC_37417867_Lg5_02674_MAF40_MDP0000396187_exon1genetic_marker
RosBREEDSNP_SNP_CT_24445080_Lg5_AT11_MAF30_1661846_exon2RosBREEDSNP_SNP_CT_24445080_Lg5_AT11_MAF30_1661846_exon2genetic_marker
RosBREEDSNP_SNP_CT_1301573_Lg10_RosCOS2296_MAF20_233424_exon1RosBREEDSNP_SNP_CT_1301573_Lg10_RosCOS2296_MAF20_233424_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_203_5139511Pear_Bartlett_RosBREEDSNP_SNP_CT_203_5139511genetic_marker
RosBREEDSNP_SNP_CT_2609621_Lg10_00839_MAF20_1656281_exon2RosBREEDSNP_SNP_CT_2609621_Lg10_00839_MAF20_1656281_exon2genetic_marker
RosBREEDSNP_SNP_CT_8577910_Lg17_00039_MAF20_MDP0000409121_exon1RosBREEDSNP_SNP_CT_8577910_Lg17_00039_MAF20_MDP0000409121_exon1genetic_marker
RosBREEDSNP_SNP_GA_3758545_Lg7_254973_MAF30_254973_exon1RosBREEDSNP_SNP_GA_3758545_Lg7_254973_MAF30_254973_exon1genetic_marker
RosBREEDSNP_SNP_GA_45033031_Lg15_ACS_MAF10_MDP0000309577_exon2RosBREEDSNP_SNP_GA_45033031_Lg15_ACS_MAF10_MDP0000309577_exon2genetic_marker
RosBREEDSNP_SNP_GA_31080088_Lg12_00915_MAF20_MDP0000306076_exon4RosBREEDSNP_SNP_GA_31080088_Lg12_00915_MAF20_MDP0000306076_exon4genetic_marker
RosBREEDSNP_SNP_AG_7269986_Lg10_120493_MAF20_120493_exon2RosBREEDSNP_SNP_AG_7269986_Lg10_120493_MAF20_120493_exon2genetic_marker
RosBREEDSNP_SNP_CT_3019034_Lg6_RosCOS408_MAF50_MDP0000412907_exon2RosBREEDSNP_SNP_CT_3019034_Lg6_RosCOS408_MAF50_MDP0000412907_exon2genetic_marker
RosBREEDSNP_SNP_GT_31094539_Lg12_00915_MAF30_405687_exon1RosBREEDSNP_SNP_GT_31094539_Lg12_00915_MAF30_405687_exon1genetic_marker
RosBREEDSNP_SNP_GA_31106692_Lg12_00915_MAF30_769120_exon2RosBREEDSNP_SNP_GA_31106692_Lg12_00915_MAF30_769120_exon2genetic_marker
RosBREEDSNP_SNP_TC_22732031_Lg17_00798_MAF50_MDP0000307493_exon5RosBREEDSNP_SNP_TC_22732031_Lg17_00798_MAF50_MDP0000307493_exon5genetic_marker
RosBREEDSNP_SNP_GT_2991122_Lg6_RosCOS408_MAF50_1685757_exon2RosBREEDSNP_SNP_GT_2991122_Lg6_RosCOS408_MAF50_1685757_exon2genetic_marker
RosBREEDSNP_SNP_GA_2988366_Lg6_RosCOS408_MAF20_1664413_exon1RosBREEDSNP_SNP_GA_2988366_Lg6_RosCOS408_MAF20_1664413_exon1genetic_marker
RosBREEDSNP_SNP_CT_4097483_Lg6_00181_MAF40_343474_exon1RosBREEDSNP_SNP_CT_4097483_Lg6_00181_MAF40_343474_exon1genetic_marker
RosBREEDSNP_SNP_GA_4870027_Lg6_325347_MAF40_325347_exon1RosBREEDSNP_SNP_GA_4870027_Lg6_325347_MAF40_325347_exon1genetic_marker
RosBREEDSNP_SNP_CA_5180646_Lg6_00082_MAF40_MDP0000693038_exon1RosBREEDSNP_SNP_CA_5180646_Lg6_00082_MAF40_MDP0000693038_exon1genetic_marker
RosBREEDSNP_SNP_GT_5208523_Lg6_00082_MAF40_MDP0000677395_exon1RosBREEDSNP_SNP_GT_5208523_Lg6_00082_MAF40_MDP0000677395_exon1genetic_marker
RosBREEDSNP_SNP_CT_10675958_Lg13_00140_MAF30_MDP0000264431_exon10RosBREEDSNP_SNP_CT_10675958_Lg13_00140_MAF30_MDP0000264431_exon10genetic_marker
RosBREEDSNP_SNP_TC_9338293_Lg6_02258_MAF40_1675875_exon2RosBREEDSNP_SNP_TC_9338293_Lg6_02258_MAF40_1675875_exon2genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica