Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_CT_25466366_Lg17_01467_MAF30_651806_exon1RosBREEDSNP_SNP_CT_25466366_Lg17_01467_MAF30_651806_exon1genetic_marker
RosBREED_RosBREEDSNP_SNP_CT_7022178_Lg8RosBREED_RosBREEDSNP_SNP_CT_7022178_Lg8genetic_marker
RosBREEDSNP_SNP_TC_19023204_Lg15_01838_MAF30_MDP0000526640_exon2RosBREEDSNP_SNP_TC_19023204_Lg15_01838_MAF30_MDP0000526640_exon2genetic_marker
RosBREEDSNP_SNP_TC_32549950_Lg9_01066_MAF30_MDP0000275261_exon10RosBREEDSNP_SNP_TC_32549950_Lg9_01066_MAF30_MDP0000275261_exon10genetic_marker
RosBREEDSNP_SNP_TC_32851678_Lg9_MYB10_MAF20_MDP0000573302_exon2RosBREEDSNP_SNP_TC_32851678_Lg9_MYB10_MAF20_MDP0000573302_exon2genetic_marker
RosBREEDSNP_SNP_TG_33061238_Lg9_01200_MAF50_376336_exon2RosBREEDSNP_SNP_TG_33061238_Lg9_01200_MAF50_376336_exon2genetic_marker
RosBREEDSNP_SNP_CT_35078781_Lg8_00299_MAF30_1686857_exon1RosBREEDSNP_SNP_CT_35078781_Lg8_00299_MAF30_1686857_exon1genetic_marker
RosBREEDSNP_SNP_AG_31149185_Lg12_01759_MAF50_1631167_exon2RosBREEDSNP_SNP_AG_31149185_Lg12_01759_MAF50_1631167_exon2genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_298_5078744Pear_Bartlett_RosBREEDSNP_SNP_GA_298_5078744genetic_marker
RosBREEDSNP_SNP_AG_579463_Lg15_00349_MAF30_1655727_exon5RosBREEDSNP_SNP_AG_579463_Lg15_00349_MAF30_1655727_exon5genetic_marker
RosBREEDSNP_SNP_CT_829546_Lg10_00106_MAF40_332480_exon1RosBREEDSNP_SNP_CT_829546_Lg10_00106_MAF40_332480_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_253_5095476Pear_Bartlett_RosBREEDSNP_SNP_GA_253_5095476genetic_marker
RosBREEDSNP_SNP_TC_2436431_Lg10_01909_MAF40_MDP0000381010_exon1RosBREEDSNP_SNP_TC_2436431_Lg10_01909_MAF40_MDP0000381010_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_211_5085563Pear_Bartlett_RosBREEDSNP_SNP_AG_211_5085563genetic_marker
RosBREEDSNP_SNP_CA_31162577_Lg12_01759_MAF40_MDP0000275933_exon3RosBREEDSNP_SNP_CA_31162577_Lg12_01759_MAF40_MDP0000275933_exon3genetic_marker
RosBREEDSNP_SNP_TC_4013035_Lg10_01810_MAF30_MDP0000256461_exon3RosBREEDSNP_SNP_TC_4013035_Lg10_01810_MAF30_MDP0000256461_exon3genetic_marker
RosBREEDSNP_SNP_GA_7199471_Lg10_02051_MAF40_MDP0000169672_exon2RosBREEDSNP_SNP_GA_7199471_Lg10_02051_MAF40_MDP0000169672_exon2genetic_marker
RosBREEDSNP_SNP_CT_7121587_Lg10_02051_MAF30_MDP0000599316_exon1RosBREEDSNP_SNP_CT_7121587_Lg10_02051_MAF30_MDP0000599316_exon1genetic_marker
RosBREEDSNP_SNP_CT_9388261_Lg10_00875_MAF20_MDP0000180840_exon1RosBREEDSNP_SNP_CT_9388261_Lg10_00875_MAF20_MDP0000180840_exon1genetic_marker
RosBREEDSNP_SNP_GT_10702352_Lg10_01867_MAF20_591133_exon1RosBREEDSNP_SNP_GT_10702352_Lg10_01867_MAF20_591133_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_218_5091955Pear_Bartlett_RosBREEDSNP_SNP_AG_218_5091955genetic_marker
RosBREEDSNP_SNP_TC_11955594_Lg10_CXE1_MAF20_MDP0000119954_exon3RosBREEDSNP_SNP_TC_11955594_Lg10_CXE1_MAF20_MDP0000119954_exon3genetic_marker
RosBREEDSNP_SNP_TC_16605968_Lg1_02538_MAF20_1649659_exon1RosBREEDSNP_SNP_TC_16605968_Lg1_02538_MAF20_1649659_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_AG_222_5060333Pear_Bartlett_RosBREEDSNP_SNP_AG_222_5060333genetic_marker
RosBREEDSNP_SNP_CT_25209921_Lg15_RosCOS588_MAF30_MDP0000689709_exon1RosBREEDSNP_SNP_CT_25209921_Lg15_RosCOS588_MAF30_MDP0000689709_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica
Properties
Additional details for this publication include:
Property NameValue
Publication ModelElectronic-Print
ISSN1932-6203
eISSN1932-6203
Publication Date2013
Journal AbbreviationPLoS ONE
LanguageEnglish
Language AbbrENG
Publication TypeJournal Article
Cross References
This publication is also available in the following databases:
DatabaseAccession
PMID: PubMedPMID:23826289