XVIth QTLMAS: simulated dataset and comparative analysis of submitted results for QTL mapping and genomic evaluation
نویسندگان
چکیده
BACKGROUND A common dataset was simulated and made available to participants of the XVI(th) QTL-MAS workshop. Tasks for the participants were to detect QTLs affecting three traits, to assess their possible pleiotropic effects, and to evaluate the breeding values in a candidate population without phenotypes using genomic information. METHODS Four generations consisting of 20 males and 1000 females were generated by mating each male with 50 females. The genome consisted of 5 chromosomes, each of 100 Mb size and carrying 2,000 equally distributed SNPs. Three traits were simulated in order to mimic milk yield, fat yield and fat content. Genetic (co)variances were generated from 50 QTLs with pleiotropic effects. Phenotypes for all traits were expressed only in females, and were provided for the first 3 generations. Fourteen methods for detecting single-trait QTL and 3 methods for investigating their pleiotropic nature were proposed. QTL mapping results were compared according to the following criteria: number of true QTL detected; number of false positives; and the proportion of the true genetic variance explained by submitted positions. Eleven methods for estimating direct genomic values of the candidate population were proposed. Accuracies and bias of predictions were assessed by comparing estimated direct genomic values with true breeding values. RESULTS The number of true detections ranged from 0 to 8 across methods and traits, false positives from 0 to 15, and the proportion of genetic variance captured from 0 to 0.82, respectively. The accuracy and bias of genomic predictions varied from 0.74 to 0.85 and from 0.86 to 1.34 across traits and methods, respectively. CONCLUSIONS The best results in terms of detection power were obtained by ridge regression that, however, led to a large number of false positives. Good results both in terms of true detections and false positives were obtained by the approaches that fit polygenic effects in the model. The investigation of the pleiotropic nature of the QTL permitted the identification of few additional markers compared to the single-trait analyses. Bayesian and grouped regularized regression methods performed similarly for genomic prediction while GBLUP produced the poorest results.
منابع مشابه
Comparison of analyses of the QTLMAS XII common dataset. II: genome-wide association and fine mapping
As part of the QTLMAS XII workshop, a simulated dataset was distributed and participants were invited to submit analyses of the data based on genome-wide association, fine mapping and genomic selection. We have evaluated the findings from the groups that reported fine mapping and genome-wide association (GWA) efforts to map quantitative trait loci (QTL). Generally the power to detect QTL was hi...
متن کاملA Bayesian QTL linkage analysis of the common dataset from the 12th QTLMAS workshop
BACKGROUND To compare the power of various QTL mapping methodologies, a dataset was simulated within the framework of 12th QTLMAS workshop. A total of 5865 diploid individuals was simulated, spanning seven generations, with known pedigree. Individuals were genotyped for 6000 SNPs across six chromosomes. We present an illustration of a Bayesian QTL linkage analysis, as implemented in the special...
متن کاملComparison of analyses of the QTLMAS XIV common dataset. II: QTL analysis
BACKGROUND A quantitative and a binary trait for the 14th QTLMAS 2010 workshop were simulated under a model which combined additive inheritance, epistasis and imprinting. This paper aimed to compare results submitted by the participants of the workshop. METHODS The results were compared according to three criteria: the success rate (ratio of mapped QTL to the total number of simulated QTL), a...
متن کاملGenomic Selection GENOMIC SELECTION USING A FAST EM ALGORITHM 2. ANALYSIS OF SIMULATED DATA
The paper reports on a fast EM algorithm for genomic selection by mapping QTL in genomewide dense SNP marker data. The algorithm called emBayesB was used to analyse a 6000 SNP dataset simulated for the QTLMAS XII workshop. True breeding value was accurately predicted by GEBV with a correlation of 0.85 in the validation data, while the regression coefficient of 0.97 indicated unbiased prediction...
متن کاملQTLMAS 2010: simulated dataset
BACKGROUND Objective was to simulate the data for the QTLMAS 2010 Workshop under a model that includes major additive, epistatic and parent-of-origin effects. RESULTS Data were simulated for 3226 individuals in 5 generations. Genomic data for 5 chromosomes were simulated using coalescent model. In total, the data included 10,031 SNPs, 30 additive QTLs, 2 interacting QTL pairs, and 3 imprinted...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 8 شماره
صفحات -
تاریخ انتشار 2014