Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_TC_10411650_Lg15_00543_MAF50_MDP0000144056_exon1RosBREEDSNP_SNP_TC_10411650_Lg15_00543_MAF50_MDP0000144056_exon1genetic_marker
RosBREEDSNP_SNP_TC_3832093_Lg4_00966_MAF40_526062_exon1RosBREEDSNP_SNP_TC_3832093_Lg4_00966_MAF40_526062_exon1genetic_marker
RosBREEDSNP_SNP_GA_12030066_Lg6_01366_MAF40_886759_exon1RosBREEDSNP_SNP_GA_12030066_Lg6_01366_MAF40_886759_exon1genetic_marker
RosBREEDSNP_SNP_CT_7056995_Lg2_RosCOS1581_MAF40_1648387_exon1RosBREEDSNP_SNP_CT_7056995_Lg2_RosCOS1581_MAF40_1648387_exon1genetic_marker
RosBREEDSNP_SNP_TC_28606194_Lg9_02059_MAF40_MDP0000644952_exon1RosBREEDSNP_SNP_TC_28606194_Lg9_02059_MAF40_MDP0000644952_exon1genetic_marker
RosBREEDSNP_SNP_TC_6697023_Lg16_01173_MAF50_728139_exon1RosBREEDSNP_SNP_TC_6697023_Lg16_01173_MAF50_728139_exon1genetic_marker
RosBREEDSNP_SNP_AG_13341945_Lg4_RosCOS3641_MAF40_1621447_exon6RosBREEDSNP_SNP_AG_13341945_Lg4_RosCOS3641_MAF40_1621447_exon6genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_626_5084186Pear_Bartlett_RosBREEDSNP_SNP_CT_626_5084186genetic_marker
RosBREEDSNP_SNP_TC_12704433_Lg3_02020_MAF40_MDP0000193273_exon5RosBREEDSNP_SNP_TC_12704433_Lg3_02020_MAF40_MDP0000193273_exon5genetic_marker
RosBREEDSNP_SNP_AG_12699579_Lg3_02020_MAF20_54478_exon1RosBREEDSNP_SNP_AG_12699579_Lg3_02020_MAF20_54478_exon1genetic_marker
RosBREEDSNP_SNP_CT_30391137_Lg10_01299_MAF30_MDP0000119942_exon2RosBREEDSNP_SNP_CT_30391137_Lg10_01299_MAF30_MDP0000119942_exon2genetic_marker
RosBREEDSNP_SNP_GT_16060029_Lg4_01777_MAF30_MDP0000922741_exon1RosBREEDSNP_SNP_GT_16060029_Lg4_01777_MAF30_MDP0000922741_exon1genetic_marker
RosBREEDSNP_SNP_TC_16123245_Lg4_00321_MAF30_9344_exon1RosBREEDSNP_SNP_TC_16123245_Lg4_00321_MAF30_9344_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_351_5052000Pear_Bartlett_RosBREEDSNP_SNP_TC_351_5052000genetic_marker
RosBREEDSNP_SNP_GA_18765317_Lg4_00324_MAF30_490687_exon1RosBREEDSNP_SNP_GA_18765317_Lg4_00324_MAF30_490687_exon1genetic_marker
RosBREEDSNP_SNP_CT_18541730_Lg4_02277_MAF10_MDP0000294856_exon1RosBREEDSNP_SNP_CT_18541730_Lg4_02277_MAF10_MDP0000294856_exon1genetic_marker
RosBREEDSNP_SNP_GA_7613095_Lg9_00893_MAF30_MDP0000225805_exon2RosBREEDSNP_SNP_GA_7613095_Lg9_00893_MAF30_MDP0000225805_exon2genetic_marker
RosBREEDSNP_SNP_TC_593291_Lg13_01917_MAF10_MDP0000697770_exon1RosBREEDSNP_SNP_TC_593291_Lg13_01917_MAF10_MDP0000697770_exon1genetic_marker
RosBREEDSNP_SNP_AG_20940280_Lg4_00686_MAF30_MDP0000702560_exon1RosBREEDSNP_SNP_AG_20940280_Lg4_00686_MAF30_MDP0000702560_exon1genetic_marker
RosBREEDSNP_SNP_CT_21138715_Lg4_01236_MAF20_MDP0000315217_exon1RosBREEDSNP_SNP_CT_21138715_Lg4_01236_MAF20_MDP0000315217_exon1genetic_marker
GDsnp00619GDsnp00619genetic_marker
RosBREEDSNP_SNP_GA_2319323_Lg16_00719_MAF20_MDP0000234754_exon1RosBREEDSNP_SNP_GA_2319323_Lg16_00719_MAF20_MDP0000234754_exon1genetic_marker
RosBREEDSNP_SNP_TG_21916340_Lg4_00523_MAF40_403564_exon3RosBREEDSNP_SNP_TG_21916340_Lg4_00523_MAF40_403564_exon3genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_181_5112639Pear_Bartlett_RosBREEDSNP_SNP_TC_181_5112639genetic_marker
RosBREEDSNP_SNP_AG_23395001_Lg4_01567_MAF40_488642_exon1RosBREEDSNP_SNP_AG_23395001_Lg4_01567_MAF40_488642_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica