An efficient design strategy for logistic regression using outcome- and covariate-dependent pooling of biospecimens prior to assay.

نویسندگان

  • Robert H Lyles
  • Emily M Mitchell
  • Clarice R Weinberg
  • David M Umbach
  • Enrique F Schisterman
چکیده

Potential reductions in laboratory assay costs afforded by pooling equal aliquots of biospecimens have long been recognized in disease surveillance and epidemiological research and, more recently, have motivated design and analytic developments in regression settings. For example, Weinberg and Umbach (1999, Biometrics 55, 718-726) provided methods for fitting set-based logistic regression models to case-control data when a continuous exposure variable (e.g., a biomarker) is assayed on pooled specimens. We focus on improving estimation efficiency by utilizing available subject-specific information at the pool allocation stage. We find that a strategy that we call "(y,c)-pooling," which forms pooling sets of individuals within strata defined jointly by the outcome and other covariates, provides more precise estimation of the risk parameters associated with those covariates than does pooling within strata defined only by the outcome. We review the approach to set-based analysis through offsets developed by Weinberg and Umbach in a recent correction to their original paper. We propose a method for variance estimation under this design and use simulations and a real-data example to illustrate the precision benefits of (y,c)-pooling relative to y-pooling. We also note and illustrate that set-based models permit estimation of covariate interactions with exposure.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Practice of Epidemiology Specimen Pooling for Efficient Use of Biospecimens in Studies of Time to a Common Event

For case-control studies that rely on expensive assays for biomarkers, specimen pooling offers a cost-effective and efficient way to estimate individual-level odds ratios. Pooling helps to conserve irreplaceable biospecimens for the future, mitigates limit-of-detection problems, and enables inclusion of individuals who have limited available volumes of biospecimen. Pooling can also allow the st...

متن کامل

Specimen pooling for efficient use of biospecimens in studies of time to a common event.

For case-control studies that rely on expensive assays for biomarkers, specimen pooling offers a cost-effective and efficient way to estimate individual-level odds ratios. Pooling helps to conserve irreplaceable biospecimens for the future, mitigates limit-of-detection problems, and enables inclusion of individuals who have limited available volumes of biospecimen. Pooling can also allow the st...

متن کامل

Addressing data privacy in matched studies via virtual pooling

BACKGROUND Data confidentiality and shared use of research data are two desirable but sometimes conflicting goals in research with multi-center studies and distributed data. While ideal for straightforward analysis, confidentiality restrictions forbid creation of a single dataset that includes covariate information of all participants. Current approaches such as aggregate data sharing, distribu...

متن کامل

Efficient design and analysis of biospecimens with measurements subject to detection limit.

Pooling biospecimens is a well accepted sampling strategy in biomedical research to reduce study cost of measuring biomarkers, and has been shown in the case of normally distributed data to yield more efficient estimation. In this paper we examine the efficiency of pooling, in the context of information matrix related to estimators of unknown parameters, when the biospecimens being pooled yield...

متن کامل

تحلیل درستنمایی ماکزیمم مدل رگرسیون لجستیک در حالتی که داده های متغیرهای پیشگو کامل نیستند ولی متغیرهای کمکی وجود دارند

Background and Objectives: Missing data exist in many studies, e.g. in regression models, and they decrease the model's efficacy. Many methods have been suggested for handling incomplete data: they have generally focused on missing outcome values. But covariate values can also be missing.Materials and Methods: In this paper we study the missing imputation by the EM algorithm and auxiliary varia...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biometrics

دوره 72 3  شماره 

صفحات  -

تاریخ انتشار 2016