A Lasso multi-marker mixed model for association mapping with population structure correction

نویسندگان

  • Barbara Rakitsch
  • Christoph Lippert
  • Oliver Stegle
  • Karsten M. Borgwardt
چکیده

MOTIVATION Exploring the genetic basis of heritable traits remains one of the central challenges in biomedical research. In traits with simple Mendelian architectures, single polymorphic loci explain a significant fraction of the phenotypic variability. However, many traits of interest seem to be subject to multifactorial control by groups of genetic loci. Accurate detection of such multivariate associations is non-trivial and often compromised by limited statistical power. At the same time, confounding influences, such as population structure, cause spurious association signals that result in false-positive findings. RESULTS We propose linear mixed models LMM-Lasso, a mixed model that allows for both multi-locus mapping and correction for confounding effects. Our approach is simple and free of tuning parameters; it effectively controls for population structure and scales to genome-wide datasets. LMM-Lasso simultaneously discovers likely causal variants and allows for multi-marker-based phenotype prediction from genotype. We demonstrate the practical use of LMM-Lasso in genome-wide association studies in Arabidopsis thaliana and linkage mapping in mouse, where our method achieves significantly more accurate phenotype prediction for 91% of the considered phenotypes. At the same time, our model dissects the phenotypic variability into components that result from individual single nucleotide polymorphism effects and population structure. Enrichment of known candidate genes suggests that the individual associations retrieved by LMM-Lasso are likely to be genuine. AVAILABILITY Code available under http://webdav.tuebingen. mpg.de/u/karsten/Forschung/research.html. CONTACT [email protected], [email protected] or [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LMM-Lasso: A Lasso Multi-Marker Mixed Model for Association Mapping with Population Structure Correction

Motivation: Exploring the genetic basis of heritable traits remains one of the central challenges in biomedical research. In traits with simple mendelian architectures, single polymorphic loci explain a significant fraction of the phenotypic variability. However, many traits of interest appear to be subject to multifactorial control by groups of genetic loci. Accurate detection of such multivar...

متن کامل

Association mapping for resistance to powdery mildew in oriental tobacco (Nicotiana tabaccum L.) germplasm

Powdery mildew caused by Erysiphe cichoracearum is an important fungal disease which threatens tobacco (Nicotiana tabacum L.) production. The objective of this study was to determine DNA markers linked to genomic regions associated with resistance to powdery mildew in tobacco through the association mapping approach. Seventy tobacco geno-types were fingerprinted using 26 simple se-quence repeat...

متن کامل

A multi-marker association method for genome-wide association studies without the need for population structure correction

All common genome-wide association (GWA) methods rely on population structure correction, to avoid false genotype-to-phenotype associations. However, population structure correction is a stringent penalization, which also impedes identification of real associations. Using recent statistical advances, we developed a new GWA method, called Quantitative Trait Cluster Association Test (QTCAT), enab...

متن کامل

نقشه یابی ارتباطی برخی صفات فنولوژیک در جو تحت تنش شوری

Current research was performed to identify molecular markers associated to phonological traits including days to tillering, days to stem elongation, days to heading, days from stem elongation to heading, grain filling period and days to physiological maturity based on 407 AFLP and SSR markers in 148 barley cultivars by association mapping. This experiment was conducted in two alpha lattice desi...

متن کامل

نقشه یابی ارتباطی صفات زراعی در توتون‌های شرقی (Nicotiana tabacum L.)

Tobacco (Nicotiana tabacum L.) is one of valuable agricultural and industrial crops. Studying most important traits of tobacco is difficult because of quantitative nature that are controlled by multiple genes and affected by environmental factors. Among various methods for the study of quantitative traits, association mapping which utilize phenotypic and DNA markers information is one of the ef...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 29 2  شماره 

صفحات  -

تاریخ انتشار 2013