Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_TC_16128155_Lg2_02686_MAF20_407655_exon1RosBREEDSNP_SNP_TC_16128155_Lg2_02686_MAF20_407655_exon1genetic_marker
RosBREEDSNP_SNP_AG_17180168_Lg2_327287_MAF30_327287_exon1RosBREEDSNP_SNP_AG_17180168_Lg2_327287_MAF30_327287_exon1genetic_marker
RosBREEDSNP_SNP_AC_23074147_Lg2_RosCOS292_MAF20_MDP0000590787_exon1RosBREEDSNP_SNP_AC_23074147_Lg2_RosCOS292_MAF20_MDP0000590787_exon1genetic_marker
RosBREEDSNP_SNP_CT_3825745_Lg13_00565_MAF10_613831_exon2RosBREEDSNP_SNP_CT_3825745_Lg13_00565_MAF10_613831_exon2genetic_marker
RosBREEDSNP_SNP_TC_4417988_Lg2_RosCOS2816_MAF10_1681929_exon2RosBREEDSNP_SNP_TC_4417988_Lg2_RosCOS2816_MAF10_1681929_exon2genetic_marker
RosBREEDSNP_SNP_GA_8302300_Lg8_MDP0000174369_MAF10_MDP0000174369_exon1RosBREEDSNP_SNP_GA_8302300_Lg8_MDP0000174369_MAF10_MDP0000174369_exon1genetic_marker
RosBREEDSNP_SNP_TC_27298641_Lg2_02271_MAF40_952256_exon1RosBREEDSNP_SNP_TC_27298641_Lg2_02271_MAF40_952256_exon1genetic_marker
RosBREEDSNP_SNP_GA_32422302_Lg2_327917_MAF30_327917_exon1RosBREEDSNP_SNP_GA_32422302_Lg2_327917_MAF30_327917_exon1genetic_marker
RosBREEDSNP_SNP_AG_32422411_Lg2_327917_MAF30_327917_exon1RosBREEDSNP_SNP_AG_32422411_Lg2_327917_MAF30_327917_exon1genetic_marker
RosBREEDSNP_SNP_AG_5396031_Lg10_327917_MAF30_327917_exon1RosBREEDSNP_SNP_AG_5396031_Lg10_327917_MAF30_327917_exon1genetic_marker
RosBREEDSNP_SNP_CT_7170435_Lg13_00128_MAF30_1660285_exon4RosBREEDSNP_SNP_CT_7170435_Lg13_00128_MAF30_1660285_exon4genetic_marker
RosBREEDSNP_SNP_AG_33487877_Lg2_167070_MAF10_167070_exon1RosBREEDSNP_SNP_AG_33487877_Lg2_167070_MAF10_167070_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GT_272_5076160Pear_Bartlett_RosBREEDSNP_SNP_GT_272_5076160genetic_marker
RosBREEDSNP_SNP_TG_32981524_Lg2_02223_MAF20_MDP0000491627_exon1RosBREEDSNP_SNP_TG_32981524_Lg2_02223_MAF20_MDP0000491627_exon1genetic_marker
RosBREEDSNP_SNP_TC_5844818_Lg7_00658_MAF10_1652555_exon1RosBREEDSNP_SNP_TC_5844818_Lg7_00658_MAF10_1652555_exon1genetic_marker
RosBREEDSNP_SNP_GA_33959044_Lg2_01349_MAF40_MDP0000402893_exon1RosBREEDSNP_SNP_GA_33959044_Lg2_01349_MAF40_MDP0000402893_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_171_5065103Pear_Bartlett_RosBREEDSNP_SNP_CT_171_5065103genetic_marker
RosBREEDSNP_SNP_CT_36168224_Lg2_00914_MAF10_614353_exon2RosBREEDSNP_SNP_CT_36168224_Lg2_00914_MAF10_614353_exon2genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_344_5112579Pear_Bartlett_RosBREEDSNP_SNP_GA_344_5112579genetic_marker
RosBREEDSNP_SNP_CT_1844624_Lg7_01433_MAF50_651152_exon3RosBREEDSNP_SNP_CT_1844624_Lg7_01433_MAF50_651152_exon3genetic_marker
RosBREEDSNP_SNP_CT_38935589_Lg2_00995_MAF30_MDP0000211904_exon5RosBREEDSNP_SNP_CT_38935589_Lg2_00995_MAF30_MDP0000211904_exon5genetic_marker
RosBREEDSNP_SNP_GA_39231189_Lg2_01938_MAF20_MDP0000277119_exon2RosBREEDSNP_SNP_GA_39231189_Lg2_01938_MAF20_MDP0000277119_exon2genetic_marker
RosBREEDSNP_SNP_GA_34176489_Lg2_RosCOS2050_MAF20_41201_exon1RosBREEDSNP_SNP_GA_34176489_Lg2_RosCOS2050_MAF20_41201_exon1genetic_marker
GDsnp00167GDsnp00167genetic_marker
RosBREEDSNP_SNP_AG_18474457_Lg3_00955_MAF50_MDP0000225878_exon5RosBREEDSNP_SNP_AG_18474457_Lg3_00955_MAF50_MDP0000225878_exon5genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica