Comparison of generalized estimating equations and quadratic inference functions using data from the National Longitudinal Survey of Children and Youth (NLSCY) database
نویسندگان
چکیده
BACKGROUND The generalized estimating equations (GEE) technique is often used in longitudinal data modeling, where investigators are interested in population-averaged effects of covariates on responses of interest. GEE involves specifying a model relating covariates to outcomes and a plausible correlation structure between responses at different time periods. While GEE parameter estimates are consistent irrespective of the true underlying correlation structure, the method has some limitations that include challenges with model selection due to lack of absolute goodness-of-fit tests to aid comparisons among several plausible models. The quadratic inference functions (QIF) method extends the capabilities of GEE, while also addressing some GEE limitations. METHODS We conducted a comparative study between GEE and QIF via an illustrative example, using data from the "National Longitudinal Survey of Children and Youth (NLSCY)" database. The NLSCY dataset consists of long-term, population based survey data collected since 1994, and is designed to evaluate the determinants of developmental outcomes in Canadian children. We modeled the relationship between hyperactivity-inattention and gender, age, family functioning, maternal depression symptoms, household income adequacy, maternal immigration status and maternal educational level using GEE and QIF. Basis for comparison include: (1) ease of model selection; (2) sensitivity of results to different working correlation matrices; and (3) efficiency of parameter estimates. RESULTS The sample included 795, 858 respondents (50.3% male; 12% immigrant; 6% from dysfunctional families). QIF analysis reveals that gender (male) (odds ratio [OR] = 1.73; 95% confidence interval [CI] = 1.10 to 2.71), family dysfunctional (OR = 2.84, 95% CI of 1.58 to 5.11), and maternal depression (OR = 2.49, 95% CI of 1.60 to 2.60) are significantly associated with higher odds of hyperactivity-inattention. The results remained robust under GEE modeling. Model selection was facilitated in QIF using a goodness-of-fit statistic. Overall, estimates from QIF were more efficient than those from GEE using AR (1) and Exchangeable working correlation matrices (Relative efficiency = 1.1117; 1.3082 respectively). CONCLUSION QIF is useful for model selection and provides more efficient parameter estimates than GEE. QIF can help investigators obtain more reliable results when used in conjunction with GEE.
منابع مشابه
معادلات برآورد تعمیم یافته و کاربرد آن در تحلیل داده های کولیک شیرخوارگی
Introdiuction and objective: Many Studies about Epidemiology and medical sciences that measured desired outcome at frequently individuals over times are done based on a longitudinal design. the features of a data set is repeated observes . For this reason , requirement of independence observation in longitudinal data is violated that causes this kind of data needs the specific statistical metho...
متن کاملMarginal Analysis of A Population-Based Genetic Association Study of Quantitative Traits with Incomplete Longitudinal Data
A common study to investigate gene-environment interaction is designed to be longitudinal and population-based. Data arising from longitudinal association studies often contain missing responses. Naive analysis without taking missingness into account may produce invalid inference, especially when the missing data mechanism depends on the response process. To address this issue in the ana...
متن کاملDoes Type of Pain Predict Pain Severity Changes in Individuals With Multiple Sclerosis? A Longitudinal Analysis Using Generalized Estimating Equations
Background & Objective: Pain is a common symptom among people with MS. In the majority of MS patients, pain is chronic in nature, but it can change over time. The objective of this study was to determine if pain type can predict pain severity changes in individuals with MS over time. Materials & Methods: The research method was a longitudinal design that evaluated pain type and severity at...
متن کاملEpilepsy, comorbid conditions in Canadian children: Analysis of cross-sectional data from Cycle 3 of the National Longitudinal Study of Children and Youth
PURPOSE The purpose of this study was to analyze national survey data to provide estimates of prevalence of epilepsy and associated developmental disabilities and comorbid conditions. METHODS We analyzed data from Cycle 3 of Canada's National Longitudinal Survey of Children and Youth. The NLSCY captured, socio-demographic information, as well as age, sex, education, ethnicity, household incom...
متن کاملUsing marginal mean models with data from a longitudinal survey having a complex design: some advances in methods
In recent years, longitudinal surveys, where sample subjects are observed over two or more time points, are being undertaken by government agencies in order to provide longitudinal data for analytic studies and to aid in the development of public policy. At Statistics Canada, for example, the National Population Health Survey (NPHS), the National Longitudinal Survey of Children and Youth (NLSCY...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- BMC Medical Research Methodology
دوره 8 شماره
صفحات -
تاریخ انتشار 2008