Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CT_24565225_Lg3_RosCOS430_MAF20_1659609_exon2RosBREEDSNP_SNP_CT_24565225_Lg3_RosCOS430_MAF20_1659609_exon2genetic_marker
RosBREEDSNP_SNP_TC_27126226_Lg13_MDP0000716623_MAF50_MDP0000716623_exon1RosBREEDSNP_SNP_TC_27126226_Lg13_MDP0000716623_MAF50_MDP0000716623_exon1genetic_marker
RosBREEDSNP_SNP_CT_6269014_Lg8_RosCOS1254_MAF40_MDP0000155792_exon1RosBREEDSNP_SNP_CT_6269014_Lg8_RosCOS1254_MAF40_MDP0000155792_exon1genetic_marker
RosBREEDSNP_SNP_AC_27018761_Lg13_MDP0000163961__MDP0000163961_exon1RosBREEDSNP_SNP_AC_27018761_Lg13_MDP0000163961__MDP0000163961_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_231_5053509Pear_Bartlett_RosBREEDSNP_SNP_GA_231_5053509genetic_marker
GDsnp02407GDsnp02407genetic_marker
RosBREEDSNP_SNP_AG_27018760_Lg13_MDP0000163961_MAF10_MDP0000163961_exon1RosBREEDSNP_SNP_AG_27018760_Lg13_MDP0000163961_MAF10_MDP0000163961_exon1genetic_marker
RosBREEDSNP_SNP_CT_22731798_Lg11_00743_MAF10_MDP0000573656_exon1RosBREEDSNP_SNP_CT_22731798_Lg11_00743_MAF10_MDP0000573656_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_210_5073880Pear_Bartlett_RosBREEDSNP_SNP_AG_210_5073880genetic_marker
RosBREEDSNP_SNP_AG_26508831_Lg11_138333_MAF20_138333_exon1RosBREEDSNP_SNP_AG_26508831_Lg11_138333_MAF20_138333_exon1genetic_marker
RosBREEDSNP_SNP_GA_26508753_Lg11_138333_MAF40_138333_exon1RosBREEDSNP_SNP_GA_26508753_Lg11_138333_MAF40_138333_exon1genetic_marker
GDsnp02800GDsnp02800genetic_marker
RosBREEDSNP_SNP_TC_28449600_Lg11_02800_MAF30_MDP0000571011_exon1RosBREEDSNP_SNP_TC_28449600_Lg11_02800_MAF30_MDP0000571011_exon1genetic_marker
GDsnp00662GDsnp00662genetic_marker
RosBREEDSNP_SNP_GA_11372671_Lg14_02214_MAF20_MDP0000286281_exon6RosBREEDSNP_SNP_GA_11372671_Lg14_02214_MAF20_MDP0000286281_exon6genetic_marker
RosBREEDSNP_SNP_GA_30592203_Lg11_02838_MAF20_MDP0000171255_exon8RosBREEDSNP_SNP_GA_30592203_Lg11_02838_MAF20_MDP0000171255_exon8genetic_marker
RosBREEDSNP_SNP_TC_32999468_Lg11_00566_MAF40_MDP0000265225_exon2RosBREEDSNP_SNP_TC_32999468_Lg11_00566_MAF40_MDP0000265225_exon2genetic_marker
RosBREEDSNP_SNP_TG_33459287_Lg11_02576_MAF50_382426_exon2RosBREEDSNP_SNP_TG_33459287_Lg11_02576_MAF50_382426_exon2genetic_marker
RosBREEDSNP_SNP_GT_34226486_Lg11_02910_MAF20_1656183_exon1RosBREEDSNP_SNP_GT_34226486_Lg11_02910_MAF20_1656183_exon1genetic_marker
RosBREEDSNP_SNP_CA_34267142_Lg11_02910_MAF10_1657398_exon1RosBREEDSNP_SNP_CA_34267142_Lg11_02910_MAF10_1657398_exon1genetic_marker
RosBREEDSNP_SNP_AC_35859098_Lg11_00795_MAF40_MDP0000126702_exon6RosBREEDSNP_SNP_AC_35859098_Lg11_00795_MAF40_MDP0000126702_exon6genetic_marker
RosBREEDSNP_SNP_TG_35936616_Lg11_RosCOS372_MAF50_826662_exon1RosBREEDSNP_SNP_TG_35936616_Lg11_RosCOS372_MAF50_826662_exon1genetic_marker
RosBREEDSNP_SNP_CT_36886760_Lg11_141951_MAF20_141951_exon1RosBREEDSNP_SNP_CT_36886760_Lg11_141951_MAF20_141951_exon1genetic_marker
RosBREEDSNP_SNP_TG_37676795_Lg11_MDP0000258603_MAF20_MDP0000258603_exon18RosBREEDSNP_SNP_TG_37676795_Lg11_MDP0000258603_MAF20_MDP0000258603_exon18genetic_marker
RosBREEDSNP_SNP_AG_38444510_Lg3_00319_MAF40_530902_exon1RosBREEDSNP_SNP_AG_38444510_Lg3_00319_MAF40_530902_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica