Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
Pear_Bartlett_RosBREEDSNP_SNP_TG_708_2465344Pear_Bartlett_RosBREEDSNP_SNP_TG_708_2465344genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TG_186_5139143Pear_Bartlett_RosBREEDSNP_SNP_TG_186_5139143genetic_marker
RosBREEDSNP_SNP_CT_26763910_Lg1_01228_MAF40_855624_exon1RosBREEDSNP_SNP_CT_26763910_Lg1_01228_MAF40_855624_exon1genetic_marker
RosBREEDSNP_SNP_AC_26758625_Lg1_01228_MAF10_223072_exon1RosBREEDSNP_SNP_AC_26758625_Lg1_01228_MAF10_223072_exon1genetic_marker
RosBREEDSNP_SNP_TG_3937877_Lg16_01179_MAF20_1634068_exon4RosBREEDSNP_SNP_TG_3937877_Lg16_01179_MAF20_1634068_exon4genetic_marker
RosBREEDSNP_SNP_GA_26726638_Lg1_01228_MAF50_1651849_exon2RosBREEDSNP_SNP_GA_26726638_Lg1_01228_MAF50_1651849_exon2genetic_marker
RosBREEDSNP_SNP_AC_27124099_Lg1_01693_MAF10_MDP0000225224_exon2RosBREEDSNP_SNP_AC_27124099_Lg1_01693_MAF10_MDP0000225224_exon2genetic_marker
RosBREEDSNP_SNP_CT_21182412_Lg4_01236_MAF10_392737_exon1RosBREEDSNP_SNP_CT_21182412_Lg4_01236_MAF10_392737_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AC_250_5079577Pear_Bartlett_RosBREEDSNP_SNP_AC_250_5079577genetic_marker
RosBREEDSNP_SNP_TC_28113428_Lg1_108136_MAF20_108136_exon1RosBREEDSNP_SNP_TC_28113428_Lg1_108136_MAF20_108136_exon1genetic_marker
RosBREEDSNP_SNP_CT_31736133_Lg1_325795_MAF50_325795_exon2RosBREEDSNP_SNP_CT_31736133_Lg1_325795_MAF50_325795_exon2genetic_marker
GDsnp01371GDsnp01371genetic_marker
RosBREEDSNP_SNP_TC_28113401_Lg1_108136_MAF20_108136_exon1RosBREEDSNP_SNP_TC_28113401_Lg1_108136_MAF20_108136_exon1genetic_marker
RosBREEDSNP_SNP_TC_34463741_Lg5_RosCOS2737_MAF40_MDP0000858126_exon1RosBREEDSNP_SNP_TC_34463741_Lg5_RosCOS2737_MAF40_MDP0000858126_exon1genetic_marker
RosBREEDSNP_SNP_GA_34174698_Lg1_01678_MAF20_251361_exon1RosBREEDSNP_SNP_GA_34174698_Lg1_01678_MAF20_251361_exon1genetic_marker
RosBREEDSNP_SNP_TC_34226610_Lg1_01678_MAF10_1646222_exon1RosBREEDSNP_SNP_TC_34226610_Lg1_01678_MAF10_1646222_exon1genetic_marker
RosBREEDSNP_SNP_AG_35457943_Lg1_02092_MAF30_192745_exon1RosBREEDSNP_SNP_AG_35457943_Lg1_02092_MAF30_192745_exon1genetic_marker
RosBREEDSNP_SNP_GA_35492705_Lg1_02092_MAF50_1666970_exon1RosBREEDSNP_SNP_GA_35492705_Lg1_02092_MAF50_1666970_exon1genetic_marker
RosBREEDSNP_SNP_GA_35939914_Lg1_12966_MAF40_12966_exon1RosBREEDSNP_SNP_GA_35939914_Lg1_12966_MAF40_12966_exon1genetic_marker
RosBREEDSNP_SNP_CT_30208839_Lg8_01764_MAF10_MDP0000273337_exon1RosBREEDSNP_SNP_CT_30208839_Lg8_01764_MAF10_MDP0000273337_exon1genetic_marker
RosBREEDSNP_SNP_GT_5177248_Lg3_01957_MAF50_MDP0000134469_exon2RosBREEDSNP_SNP_GT_5177248_Lg3_01957_MAF50_MDP0000134469_exon2genetic_marker
RosBREEDSNP_SNP_TG_1555360_Lg2_snpCO903605_MAF20_1677573_exon2RosBREEDSNP_SNP_TG_1555360_Lg2_snpCO903605_MAF20_1677573_exon2genetic_marker
RosBREEDSNP_SNP_AG_1939660_Lg2_AAT1_MAF40_MDP0000145727_exon4RosBREEDSNP_SNP_AG_1939660_Lg2_AAT1_MAF40_MDP0000145727_exon4genetic_marker
RosBREEDSNP_SNP_AC_3435372_Lg2_01641_MAF30_1637004_exon3RosBREEDSNP_SNP_AC_3435372_Lg2_01641_MAF30_1637004_exon3genetic_marker
RosBREEDSNP_SNP_AG_3425722_Lg2_01641_MAF20_1660502_exon2RosBREEDSNP_SNP_AG_3425722_Lg2_01641_MAF20_1660502_exon2genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica