Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_AG_31102332_Lg3_01779_MAF20_43168_exon1RosBREEDSNP_SNP_AG_31102332_Lg3_01779_MAF20_43168_exon1genetic_marker
RosBREEDSNP_SNP_AG_32713288_Lg3_01329_MAF30_220555_exon2RosBREEDSNP_SNP_AG_32713288_Lg3_01329_MAF30_220555_exon2genetic_marker
RosBREEDSNP_SNP_GT_33284134_Lg3_01667_MAF20_MDP0000260760_exon1RosBREEDSNP_SNP_GT_33284134_Lg3_01667_MAF20_MDP0000260760_exon1genetic_marker
RosBREEDSNP_SNP_GA_29948697_Lg3_01937_MAF20_767709_exon1RosBREEDSNP_SNP_GA_29948697_Lg3_01937_MAF20_767709_exon1genetic_marker
RosBREEDSNP_SNP_CT_8281738_Lg7_RosCOS2129_MAF20_510412_exon1RosBREEDSNP_SNP_CT_8281738_Lg7_RosCOS2129_MAF20_510412_exon1genetic_marker
RosBREEDSNP_SNP_TC_18207227_Lg8_02689_MAF40_MDP0000311217_exon1RosBREEDSNP_SNP_TC_18207227_Lg8_02689_MAF40_MDP0000311217_exon1genetic_marker
RosBREEDSNP_SNP_TC_2791748_Lg10_00056_MAF50_1665906_exon1RosBREEDSNP_SNP_TC_2791748_Lg10_00056_MAF50_1665906_exon1genetic_marker
RosBREEDSNP_SNP_GA_10720575_Lg10_01867_MAF40_1682345_exon2RosBREEDSNP_SNP_GA_10720575_Lg10_01867_MAF40_1682345_exon2genetic_marker
RosBREEDSNP_SNP_CT_14946443_Lg10_00015_MAF10_MDP0000505985_exon1RosBREEDSNP_SNP_CT_14946443_Lg10_00015_MAF10_MDP0000505985_exon1genetic_marker
RosBREEDSNP_SNP_CT_8416086_Lg11_00185_MAF30_MDP0000729342_exon1RosBREEDSNP_SNP_CT_8416086_Lg11_00185_MAF30_MDP0000729342_exon1genetic_marker
RosBREEDSNP_SNP_GA_8445216_Lg11_00185_MAF40_531216_exon5RosBREEDSNP_SNP_GA_8445216_Lg11_00185_MAF40_531216_exon5genetic_marker
RosBREEDSNP_SNP_AG_21721114_Lg13_02069_MAF30_1664580_exon1RosBREEDSNP_SNP_AG_21721114_Lg13_02069_MAF30_1664580_exon1genetic_marker
RosBREEDSNP_SNP_GA_8352805_Lg15_226490_MAF40_226490_exon1RosBREEDSNP_SNP_GA_8352805_Lg15_226490_MAF40_226490_exon1genetic_marker
RosBREEDSNP_SNP_GA_24825508_Lg17_00341_MAF50_MDP0000296152_exon3RosBREEDSNP_SNP_GA_24825508_Lg17_00341_MAF50_MDP0000296152_exon3genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_520_5060110Pear_Bartlett_RosBREEDSNP_SNP_GA_520_5060110genetic_marker
FFRJ0CP02D06FA_314FFRJ0CP02D06FA_314genetic_marker
FFRJ0CP02DN2YW_251FFRJ0CP02DN2YW_251genetic_marker
FFRJ0CP02EX84K_196FFRJ0CP02EX84K_196genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica
Properties
Additional details for this publication include:
Property NameValue
Publication ModelElectronic-Print
ISSN1932-6203
eISSN1932-6203
Publication Date2013
Journal AbbreviationPLoS ONE
LanguageEnglish
Language AbbrENG
Publication TypeJournal Article
Cross References
This publication is also available in the following databases:
DatabaseAccession
PMID: PubMedPMID:23826289