Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CT_27064466_Lg1_01693_MAF20_95538_exon1RosBREEDSNP_SNP_CT_27064466_Lg1_01693_MAF20_95538_exon1genetic_marker
RosBREEDSNP_SNP_CT_25403611_Lg7_02291_MAF40_926073_exon1RosBREEDSNP_SNP_CT_25403611_Lg7_02291_MAF40_926073_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_319_5065462Pear_Bartlett_RosBREEDSNP_SNP_TC_319_5065462genetic_marker
RosBREEDSNP_SNP_GA_15174679_Lg6_RosCOS1295_MAF40_346404_exon2RosBREEDSNP_SNP_GA_15174679_Lg6_RosCOS1295_MAF40_346404_exon2genetic_marker
RosBREED_RosBREEDSNP_SNP_TC_15037669_Lg6RosBREED_RosBREEDSNP_SNP_TC_15037669_Lg6genetic_marker
RosBREEDSNP_SNP_CA_27835935_Lg9_02482_MAF30_166068_exon2RosBREEDSNP_SNP_CA_27835935_Lg9_02482_MAF30_166068_exon2genetic_marker
RosBREEDSNP_SNP_TG_7627830_Lg8_RosCOS3656_MAF30_391382_exon1RosBREEDSNP_SNP_TG_7627830_Lg8_RosCOS3656_MAF30_391382_exon1genetic_marker
RosBREEDSNP_SNP_GT_20005208_Lg17_01298_MAF20_1629877_exon1RosBREEDSNP_SNP_GT_20005208_Lg17_01298_MAF20_1629877_exon1genetic_marker
RosBREEDSNP_SNP_TC_35481496_Lg10_RosCOS663_MAF10_586118_exon1RosBREEDSNP_SNP_TC_35481496_Lg10_RosCOS663_MAF10_586118_exon1genetic_marker
RosBREEDSNP_SNP_GA_18732709_Lg17_00235_MAF20_1660220_exon3RosBREEDSNP_SNP_GA_18732709_Lg17_00235_MAF20_1660220_exon3genetic_marker
RosBREEDSNP_SNP_TG_33270439_Lg14_01548_MAF20_95229_exon1RosBREEDSNP_SNP_TG_33270439_Lg14_01548_MAF20_95229_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_179_5118987Pear_Bartlett_RosBREEDSNP_SNP_TC_179_5118987genetic_marker
RosBREEDSNP_SNP_TC_21060256_Lg17_01525_MAF30_1626827_exon11RosBREEDSNP_SNP_TC_21060256_Lg17_01525_MAF30_1626827_exon11genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_519_5116509Pear_Bartlett_RosBREEDSNP_SNP_CT_519_5116509genetic_marker
RosBREEDSNP_SNP_CT_24804248_Lg17_00341_MAF40_1654737_exon1RosBREEDSNP_SNP_CT_24804248_Lg17_00341_MAF40_1654737_exon1genetic_marker
RosBREEDSNP_SNP_CT_25499641_Lg17_01467_MAF30_MDP0000281526_exon1RosBREEDSNP_SNP_CT_25499641_Lg17_01467_MAF30_MDP0000281526_exon1genetic_marker
RosBREEDSNP_SNP_TG_25497259_Lg17_01467_MAF40_204957_exon1RosBREEDSNP_SNP_TG_25497259_Lg17_01467_MAF40_204957_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_240_5092398Pear_Bartlett_RosBREEDSNP_SNP_TC_240_5092398genetic_marker
RosBREEDSNP_SNP_CT_26169975_Lg17_95201_MAF40_95201_exon1RosBREEDSNP_SNP_CT_26169975_Lg17_95201_MAF40_95201_exon1genetic_marker
RosBREEDSNP_SNP_GA_14833187_Lg1_01500_MAF20_1625667_exon2RosBREEDSNP_SNP_GA_14833187_Lg1_01500_MAF20_1625667_exon2genetic_marker
RosBREEDSNP_SNP_GT_3742841_Lg2_226115_MAF20_226115_exon1RosBREEDSNP_SNP_GT_3742841_Lg2_226115_MAF20_226115_exon1genetic_marker
RosBREEDSNP_SNP_TG_2487461_Lg2_00959_MAF20_35333_exon1RosBREEDSNP_SNP_TG_2487461_Lg2_00959_MAF20_35333_exon1genetic_marker
GDsnp01735GDsnp01735genetic_marker
RosBREEDSNP_SNP_TC_8475399_Lg3_RosCOS3572_MAF50_MDP0000721948_exon1RosBREEDSNP_SNP_TC_8475399_Lg3_RosCOS3572_MAF50_MDP0000721948_exon1genetic_marker
RosBREEDSNP_SNP_TC_10671767_Lg3_00874_MAF20_637991_exon2RosBREEDSNP_SNP_TC_10671767_Lg3_00874_MAF20_637991_exon2genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica