Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CT_11763475_Lg5_RosCOS544_MAF10_1675698_exon1RosBREEDSNP_SNP_CT_11763475_Lg5_RosCOS544_MAF10_1675698_exon1genetic_marker
RosBREEDSNP_SNP_AC_12035560_Lg5_01319_MAF50_9918_exon2RosBREEDSNP_SNP_AC_12035560_Lg5_01319_MAF50_9918_exon2genetic_marker
RosBREEDSNP_SNP_CT_24276617_Lg10_RosCOS3104_MAF20_1624917_exon2RosBREEDSNP_SNP_CT_24276617_Lg10_RosCOS3104_MAF20_1624917_exon2genetic_marker
RosBREEDSNP_SNP_GA_24417621_Lg5_AT11_MAF30_MDP0000277380_exon1RosBREEDSNP_SNP_GA_24417621_Lg5_AT11_MAF30_MDP0000277380_exon1genetic_marker
RosBREEDSNP_SNP_AC_24420668_Lg5_AT11_MAF50_361047_exon1RosBREEDSNP_SNP_AC_24420668_Lg5_AT11_MAF50_361047_exon1genetic_marker
RosBREEDSNP_SNP_CT_13600964_Lg7_01305_MAF30_MDP0000269737_exon1RosBREEDSNP_SNP_CT_13600964_Lg7_01305_MAF30_MDP0000269737_exon1genetic_marker
RosBREEDSNP_SNP_AG_17594367_Lg10_01987_MAF40_1641049_exon6RosBREEDSNP_SNP_AG_17594367_Lg10_01987_MAF40_1641049_exon6genetic_marker
RosBREEDSNP_SNP_TC_15472298_Lg5_01616_MAF30_1682170_exon2RosBREEDSNP_SNP_TC_15472298_Lg5_01616_MAF30_1682170_exon2genetic_marker
GDsnp01616GDsnp01616genetic_marker
RosBREEDSNP_SNP_GT_23278011_Lg10_RosCOS1786_MAF20_1648518_exon3RosBREEDSNP_SNP_GT_23278011_Lg10_RosCOS1786_MAF20_1648518_exon3genetic_marker
RosBREEDSNP_SNP_CT_19507035_Lg5_01478_MAF30_1642139_exon2RosBREEDSNP_SNP_CT_19507035_Lg5_01478_MAF30_1642139_exon2genetic_marker
RosBREEDSNP_SNP_AG_9904632_Lg11_RosCOS3333_MAF40_MDP0000278972_exon2RosBREEDSNP_SNP_AG_9904632_Lg11_RosCOS3333_MAF40_MDP0000278972_exon2genetic_marker
RosBREEDSNP_SNP_TG_13662840_Lg5_00978_MAF40_326182_exon1RosBREEDSNP_SNP_TG_13662840_Lg5_00978_MAF40_326182_exon1genetic_marker
RosBREEDSNP_SNP_GA_18646194_Lg5_00323_MAF40_1671136_exon1RosBREEDSNP_SNP_GA_18646194_Lg5_00323_MAF40_1671136_exon1genetic_marker
RosBREEDSNP_SNP_CT_18754379_Lg5_MDP0000309680_MAF30_MDP0000309680_exon1RosBREEDSNP_SNP_CT_18754379_Lg5_MDP0000309680_MAF30_MDP0000309680_exon1genetic_marker
RosBREEDSNP_SNP_GT_18586613_Lg5_00323_MAF50_55502_exon1RosBREEDSNP_SNP_GT_18586613_Lg5_00323_MAF50_55502_exon1genetic_marker
RosBREEDSNP_SNP_GA_20021703_Lg8_00311_MAF10_MDP0000248297_exon1RosBREEDSNP_SNP_GA_20021703_Lg8_00311_MAF10_MDP0000248297_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GT_185_5083299Pear_Bartlett_RosBREEDSNP_SNP_GT_185_5083299genetic_marker
RosBREEDSNP_SNP_CT_19680406_Lg5_00867_MAF40_1632497_exon1RosBREEDSNP_SNP_CT_19680406_Lg5_00867_MAF40_1632497_exon1genetic_marker
RosBREEDSNP_SNP_CT_20335489_Lg5_RosCOS452_MAF20_1629981_exon2RosBREEDSNP_SNP_CT_20335489_Lg5_RosCOS452_MAF20_1629981_exon2genetic_marker
GDsnp01722GDsnp01722genetic_marker
RosBREEDSNP_SNP_CA_4926338_Lg2_RosCOS3565_MAF40_1675500_exon1RosBREEDSNP_SNP_CA_4926338_Lg2_RosCOS3565_MAF40_1675500_exon1genetic_marker
RosBREEDSNP_SNP_AC_10896908_Lg8_00584_MAF40_MDP0000614935_exon2RosBREEDSNP_SNP_AC_10896908_Lg8_00584_MAF40_MDP0000614935_exon2genetic_marker
RosBREEDSNP_SNP_CT_24579162_Lg5_01304_MAF40_432533_exon1RosBREEDSNP_SNP_CT_24579162_Lg5_01304_MAF40_432533_exon1genetic_marker
RosBREEDSNP_SNP_TC_11303456_Lg8_RosCOS936_MAF50_MDP0000280399_exon1RosBREEDSNP_SNP_TC_11303456_Lg8_RosCOS936_MAF50_MDP0000280399_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica