Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_AG_7131583_Lg7_01872_MAF40_1648950_exon1RosBREEDSNP_SNP_AG_7131583_Lg7_01872_MAF40_1648950_exon1genetic_marker
RosBREEDSNP_SNP_GA_2305855_Lg16_00719_MAF30_767049_exon1RosBREEDSNP_SNP_GA_2305855_Lg16_00719_MAF30_767049_exon1genetic_marker
RosBREEDSNP_SNP_AG_8026567_Lg7_02619_MAF10_MDP0000322530_exon2RosBREEDSNP_SNP_AG_8026567_Lg7_02619_MAF10_MDP0000322530_exon2genetic_marker
RosBREEDSNP_SNP_TG_9956676_Lg7_328139_MAF20_328139_exon1RosBREEDSNP_SNP_TG_9956676_Lg7_328139_MAF20_328139_exon1genetic_marker
RosBREEDSNP_SNP_GT_9956304_Lg7_328139_MAF20_328139_exon1RosBREEDSNP_SNP_GT_9956304_Lg7_328139_MAF20_328139_exon1genetic_marker
RosBREEDSNP_SNP_GA_9254965_Lg7_RosCOS2471_MAF50_MDP0000308927_exon4RosBREEDSNP_SNP_GA_9254965_Lg7_RosCOS2471_MAF50_MDP0000308927_exon4genetic_marker
RosBREEDSNP_SNP_CT_9259306_Lg7_RosCOS2471_MAF50_MDP0000308927_exon2RosBREEDSNP_SNP_CT_9259306_Lg7_RosCOS2471_MAF50_MDP0000308927_exon2genetic_marker
RosBREEDSNP_SNP_GA_25671752_Lg2_01935_MAF10_1619549_exon3RosBREEDSNP_SNP_GA_25671752_Lg2_01935_MAF10_1619549_exon3genetic_marker
snpEB106582snpEB106582genetic_marker
RosBREEDSNP_SNP_GT_11564786_Lg7_266124__266124_exon3RosBREEDSNP_SNP_GT_11564786_Lg7_266124__266124_exon3genetic_marker
RosBREEDSNP_SNP_CA_11564586_Lg7_266124_MAF10_266124_exon2RosBREEDSNP_SNP_CA_11564586_Lg7_266124_MAF10_266124_exon2genetic_marker
RosBREED_RosBREEDSNP_SNP_AG_22298896_Lg5RosBREED_RosBREEDSNP_SNP_AG_22298896_Lg5genetic_marker
RosBREEDSNP_SNP_CT_11043303_Lg7_01882_MAF20_546862_exon1RosBREEDSNP_SNP_CT_11043303_Lg7_01882_MAF20_546862_exon1genetic_marker
RosBREEDSNP_SNP_CT_43012110_Lg15_RosCOS1842_MAF40_9800_exon1RosBREEDSNP_SNP_CT_43012110_Lg15_RosCOS1842_MAF40_9800_exon1genetic_marker
RosBREEDSNP_SNP_CT_19090925_Lg7_02875_MAF20_MDP0000242117_exon3RosBREEDSNP_SNP_CT_19090925_Lg7_02875_MAF20_MDP0000242117_exon3genetic_marker
RosBREEDSNP_SNP_CT_19126204_Lg7_02875_MAF30_1626168_exon5RosBREEDSNP_SNP_CT_19126204_Lg7_02875_MAF30_1626168_exon5genetic_marker
RosBREEDSNP_SNP_TC_14006533_Lg1_01889_MAF20_MDP0000263325_exon3RosBREEDSNP_SNP_TC_14006533_Lg1_01889_MAF20_MDP0000263325_exon3genetic_marker
RosBREEDSNP_SNP_TC_16998097_Lg1_MDP0000595715__MDP0000595715_exon1RosBREEDSNP_SNP_TC_16998097_Lg1_MDP0000595715__MDP0000595715_exon1genetic_marker
RosBREEDSNP_SNP_GT_17264077_Lg1_RosCOS3372_MAF50_762097_exon1RosBREEDSNP_SNP_GT_17264077_Lg1_RosCOS3372_MAF50_762097_exon1genetic_marker
RosBREEDSNP_SNP_GA_24336685_Lg7_01756_MAF20_MDP0000151871_exon5RosBREEDSNP_SNP_GA_24336685_Lg7_01756_MAF20_MDP0000151871_exon5genetic_marker
GDsnp01756GDsnp01756genetic_marker
RosBREEDSNP_SNP_TG_6955011_Lg8_RosCOS3599_MAF20_MDP0000420644_exon1RosBREEDSNP_SNP_TG_6955011_Lg8_RosCOS3599_MAF20_MDP0000420644_exon1genetic_marker
RosBREEDSNP_SNP_TC_6980732_Lg8_RosCOS3599_MAF20_MDP0000622737_exon1RosBREEDSNP_SNP_TC_6980732_Lg8_RosCOS3599_MAF20_MDP0000622737_exon1genetic_marker
GDsnp00699GDsnp00699genetic_marker
RosBREEDSNP_SNP_CT_25206470_Lg7_00699_MAF30_MDP0000300347_exon1RosBREEDSNP_SNP_CT_25206470_Lg7_00699_MAF30_MDP0000300347_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica