Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

Publication Overview
TitleEvaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin
AuthorsTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ
TypeJournal Article
Journal NamePloS one
Volume8
Issue6
Year2013
Page(s)e67407
CitationTroggio M, Surbanovski N, Bianco L, Moretto M, Giongo L, Banchi E, Viola R, Fernández FF, Costa F, Velasco R, Cestaro A, Sargent DJ. Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin. PloS one. 2013; 8(6):e67407.

Abstract

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.

Features
This publication contains information about 818 features:
Feature NameUniquenameType
RosBREEDSNP_SNP_AG_22846134_Lg1_MdExp7_MAF40_1635125_exon1RosBREEDSNP_SNP_AG_22846134_Lg1_MdExp7_MAF40_1635125_exon1genetic_marker
RosBREEDSNP_SNP_TC_22531181_Lg1_RosCOS1157_MAF30_409923_exon1RosBREEDSNP_SNP_TC_22531181_Lg1_RosCOS1157_MAF30_409923_exon1genetic_marker
RosBREEDSNP_SNP_CT_27490010_Lg14_01213_MAF40_508966_exon2RosBREEDSNP_SNP_CT_27490010_Lg14_01213_MAF40_508966_exon2genetic_marker
RosBREEDSNP_SNP_GA_27487573_Lg14_01213_MAF30_1647799_exon1RosBREEDSNP_SNP_GA_27487573_Lg14_01213_MAF30_1647799_exon1genetic_marker
RosBREEDSNP_SNP_GA_9982432_Lg9_02031_MAF10_MDP0000277628_exon1RosBREEDSNP_SNP_GA_9982432_Lg9_02031_MAF10_MDP0000277628_exon1genetic_marker
RosBREEDSNP_SNP_GA_26714279_Lg1_01228_MAF40_1626564_exon5RosBREEDSNP_SNP_GA_26714279_Lg1_01228_MAF40_1626564_exon5genetic_marker
RosBREEDSNP_SNP_AC_26231142_Lg7_01516_MAF20_901175_exon2RosBREEDSNP_SNP_AC_26231142_Lg7_01516_MAF20_901175_exon2genetic_marker
RosBREEDSNP_SNP_TC_26507937_Lg7_54616_MAF30_54616_exon1RosBREEDSNP_SNP_TC_26507937_Lg7_54616_MAF30_54616_exon1genetic_marker
RosBREEDSNP_SNP_TC_30446117_Lg1_00782_MAF10_679939_exon1RosBREEDSNP_SNP_TC_30446117_Lg1_00782_MAF10_679939_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_162_5051543Pear_Bartlett_RosBREEDSNP_SNP_CT_162_5051543genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_CT_381_5138036Pear_Bartlett_RosBREEDSNP_SNP_CT_381_5138036genetic_marker
FFRJ0CP02D9PSM_143FFRJ0CP02D9PSM_143genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_GA_382_3657329Pear_Bartlett_RosBREEDSNP_SNP_GA_382_3657329genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_215_5109742Pear_Bartlett_RosBREEDSNP_SNP_TC_215_5109742genetic_marker
RosBREEDSNP_SNP_CT_5166014_Lg8_01378_MAF30_1645550_exon1RosBREEDSNP_SNP_CT_5166014_Lg8_01378_MAF30_1645550_exon1genetic_marker
RosBREEDSNP_SNP_TC_5120857_Lg8_01378_MAF50_29486_exon1RosBREEDSNP_SNP_TC_5120857_Lg8_01378_MAF50_29486_exon1genetic_marker
Pear_Bartlett_RosBREEDSNP_SNP_TC_1031_5090797Pear_Bartlett_RosBREEDSNP_SNP_TC_1031_5090797genetic_marker
RosBREEDSNP_SNP_AC_5817927_Lg7_00658_MAF50_1685505_exon1RosBREEDSNP_SNP_AC_5817927_Lg7_00658_MAF50_1685505_exon1genetic_marker
RosBREEDSNP_SNP_CT_10562520_Lg8_snpCO066276_MAF50_MDP0000751527_exon3RosBREEDSNP_SNP_CT_10562520_Lg8_snpCO066276_MAF50_MDP0000751527_exon3genetic_marker
RosBREEDSNP_SNP_GA_33541820_Lg10_01683_MAF10_MDP0000744586_exon2RosBREEDSNP_SNP_GA_33541820_Lg10_01683_MAF10_MDP0000744586_exon2genetic_marker
RosBREEDSNP_SNP_CT_12969881_Lg8_01132_MAF30_8636_exon1RosBREEDSNP_SNP_CT_12969881_Lg8_01132_MAF30_8636_exon1genetic_marker
RosBREEDSNP_SNP_GA_13741742_Lg8_RosCOS256_MAF30_70071_exon1RosBREEDSNP_SNP_GA_13741742_Lg8_RosCOS256_MAF30_70071_exon1genetic_marker
RosBREEDSNP_SNP_TC_13739176_Lg8_RosCOS256_MAF30_MDP0000312450_exon2RosBREEDSNP_SNP_TC_13739176_Lg8_RosCOS256_MAF30_MDP0000312450_exon2genetic_marker
RosBREEDSNP_SNP_TG_8522809_Lg10_00260_MAF40_483111_exon1RosBREEDSNP_SNP_TG_8522809_Lg10_00260_MAF40_483111_exon1genetic_marker
RosBREEDSNP_SNP_TC_22820602_Lg4_02025_MAF10_MDP0000185953_exon1RosBREEDSNP_SNP_TC_22820602_Lg4_02025_MAF10_MDP0000185953_exon1genetic_marker

Pages

Featuremaps
This publication contains information about 2 maps:
Map Name
Apple-M432-2013
Apple-M432-2013-physical-Malus-domestica