Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_GA_49286326_Lg15_01707_MAF40_390435_exon1RosBREEDSNP_SNP_GA_49286326_Lg15_01707_MAF40_390435_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_950_5071830Pear_Bartlett_RosBREEDSNP_SNP_AG_950_5071830genetic_marker
GDsnp00246GDsnp00246genetic_marker
RosBREEDSNP_SNP_GT_33137250_Lg14_00023_MAF50_776045_exon1RosBREEDSNP_SNP_GT_33137250_Lg14_00023_MAF50_776045_exon1genetic_marker
GDsnp00299GDsnp00299genetic_marker
RosBREEDSNP_SNP_CA_376810_Lg9_01189_MAF40_747730_exon1RosBREEDSNP_SNP_CA_376810_Lg9_01189_MAF40_747730_exon1genetic_marker
RosBREEDSNP_SNP_CA_998345_Lg9_01328_MAF50_813727_exon1RosBREEDSNP_SNP_CA_998345_Lg9_01328_MAF50_813727_exon1genetic_marker
RosBREEDSNP_SNP_AG_1405140_Lg9_01339_MAF50_MDP0000484463_exon1RosBREEDSNP_SNP_AG_1405140_Lg9_01339_MAF50_MDP0000484463_exon1genetic_marker
RosBREEDSNP_SNP_CT_1451072_Lg9_01339_MAF20_652059_exon2RosBREEDSNP_SNP_CT_1451072_Lg9_01339_MAF20_652059_exon2genetic_marker
RosBREEDSNP_SNP_CA_1790459_Lg9_00169_MAF10_1628909_exon8RosBREEDSNP_SNP_CA_1790459_Lg9_00169_MAF10_1628909_exon8genetic_marker
RosBREEDSNP_SNP_TG_2364606_Lg9_02134_MAF30_MDP0000390865_exon1RosBREEDSNP_SNP_TG_2364606_Lg9_02134_MAF30_MDP0000390865_exon1genetic_marker
RosBREEDSNP_SNP_AG_2249956_Lg9_95072_MAF20_95072_exon1RosBREEDSNP_SNP_AG_2249956_Lg9_95072_MAF20_95072_exon1genetic_marker
RosBREEDSNP_SNP_AG_2311570_Lg9_02134_MAF20_MDP0000218391_exon3RosBREEDSNP_SNP_AG_2311570_Lg9_02134_MAF20_MDP0000218391_exon3genetic_marker
RosBREEDSNP_SNP_AC_2052479_Lg9_01573_MAF10_516777_exon1RosBREEDSNP_SNP_AC_2052479_Lg9_01573_MAF10_516777_exon1genetic_marker
RosBREEDSNP_SNP_CT_3036440_Lg9_00679_MAF30_462560_exon1RosBREEDSNP_SNP_CT_3036440_Lg9_00679_MAF30_462560_exon1genetic_marker
RosBREEDSNP_SNP_TC_3077900_Lg9_00679_MAF50_MDP0000252227_exon11RosBREEDSNP_SNP_TC_3077900_Lg9_00679_MAF50_MDP0000252227_exon11genetic_marker
RosBREEDSNP_SNP_TG_9980877_Lg11_RosCOS3333_MAF20_MDP0000196079_exon2RosBREEDSNP_SNP_TG_9980877_Lg11_RosCOS3333_MAF20_MDP0000196079_exon2genetic_marker
RosBREEDSNP_SNP_GA_15499840_Lg5_01616_MAF10_MDP0000416441_exon2RosBREEDSNP_SNP_GA_15499840_Lg5_01616_MAF10_MDP0000416441_exon2genetic_marker
RosBREEDSNP_SNP_AG_3730213_Lg9_00605_MAF50_MDP0000205248_exon4RosBREEDSNP_SNP_AG_3730213_Lg9_00605_MAF50_MDP0000205248_exon4genetic_marker
GDsnp01603GDsnp01603genetic_marker
GDsnp00605GDsnp00605genetic_marker
RosBREEDSNP_SNP_AG_4461663_Lg9_RosCOS1285_MAF30_1687591_exon3RosBREEDSNP_SNP_AG_4461663_Lg9_RosCOS1285_MAF30_1687591_exon3genetic_marker
RosBREEDSNP_SNP_TC_5391429_Lg9_00514_MAF30_502274_exon1RosBREEDSNP_SNP_TC_5391429_Lg9_00514_MAF30_502274_exon1genetic_marker
RosBREEDSNP_SNP_TG_5372070_Lg9_00514_MAF20_1646977_exon3RosBREEDSNP_SNP_TG_5372070_Lg9_00514_MAF20_1646977_exon3genetic_marker
RosBREEDSNP_SNP_GA_11377339_Lg7_02293_MAF10_829562_exon1RosBREEDSNP_SNP_GA_11377339_Lg7_02293_MAF10_829562_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica