Beyond crosswalks: reliability of exposure assessment following automated coding of free-text job descriptions for occupational epidemiology.
نویسندگان
چکیده
Epidemiologists typically collect narrative descriptions of occupational histories because these are less prone than self-reported exposures to recall bias of exposure to a specific hazard. However, the task of coding these narratives can be daunting and prohibitively time-consuming in some settings. The aim of this manuscript is to evaluate the performance of a computer algorithm to translate the narrative description of occupational codes into standard classification of jobs (2010 Standard Occupational Classification) in an epidemiological context. The fundamental question we address is whether exposure assignment resulting from manual (presumed gold standard) coding of the narratives is materially different from that arising from the application of automated coding. We pursued our work through three motivating examples: assessment of physical demands in Women's Health Initiative observational study, evaluation of predictors of exposure to coal tar pitch volatiles in the US Occupational Safety and Health Administration's (OSHA) Integrated Management Information System, and assessment of exposure to agents known to cause occupational asthma in a pregnancy cohort. In these diverse settings, we demonstrate that automated coding of occupations results in assignment of exposures that are in reasonable agreement with results that can be obtained through manual coding. The correlation between physical demand scores based on manual and automated job classification schemes was reasonable (r = 0.5). The agreement between predictive probability of exceeding the OSHA's permissible exposure level for polycyclic aromatic hydrocarbons, using coal tar pitch volatiles as a surrogate, based on manual and automated coding of jobs was modest (Kendall rank correlation = 0.29). In the case of binary assignment of exposure to asthmagens, we observed that fair to excellent agreement in classifications can be reached, depending on presence of ambiguity in assigned job classification (κ = 0.5-0.8). Thus, the success of automated coding appears to depend on the setting and type of exposure that is being assessed. Our overall recommendation is that automated translation of short narrative descriptions of jobs for exposure assessment is feasible in some settings and essential for large cohorts, especially if combined with manual coding to both assess reliability of coding and to further refine the coding algorithm.
منابع مشابه
Commentary: standardized coding of occupational data in epidemiological studies.
The evaluation of occupational exposures in epidemiological studies is complex because of the multiple potential exposures in the workplace, the varying determinants of exposure between people, the many jobs people hold in a lifetime, and the different reasons for taking or leaving a job. Mannetje and Kromhout 1 show that beyond these well-recognized difficulties there are several more basic is...
متن کاملRetrospective assessment of occupational exposure to chemicals in community-based studies: validity and repeatability of industrial hygiene panel ratings.
BACKGROUND Occupational hygiene panels are increasingly being used to rate retrospective occupational exposures to chemicals in community-based studies. This study aimed to assess the validity, reliability and feasibility of using such an expert panel in a brain tumour case-control study. METHODS A panel of five experts was recruited to rate exposure to 21 chemicals for 298 job descriptions t...
متن کاملWhat do measures of agreement (κ) tell us about quality of exposure assessment? Theoretical analysis and numerical simulation
BACKGROUND The reliability of binary exposure classification methods is routinely reported in occupational health literature because it is viewed as an important component of evaluating the trustworthiness of the exposure assessment by experts. The Kappa statistics (κ) are typically employed to assess how well raters or classification systems agree in a variety of contexts, such as identifying ...
متن کاملReliability and Validity Assessment of the Persian Version of the Noise Exposure Questionnaire (NEQ): An NIHL Predictor Tool
Background: Noise and noise-induced hearing loss (NIHL) are the most prevalent workplace problems. The best way to prevent NIHL is to monitor people's annual noise exposure (ANE) using tools, such as questionnaires. The present study aims to assess reliability of the Persian version of the Noise Exposure Questionnaire (NEQ) and NIHL scores among workers. Materials & Methods: This descriptive s...
متن کاملEvaluation of the quality of coding of job episodes collected by self questionnaires among French retired men for use in a job-exposure matrix.
BACKGROUND and AIMS The ESPACES study was intended to identify retirees who may have been, according to their job descriptions, exposed to asbestos during their working lives. As part of this study, we analysed the quality of the occupation and activity sector coding as well as its effect on the subjects' exposure status. METHODS The occupation and activity sector for a sample of 450 retire...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The Annals of occupational hygiene
دوره 58 4 شماره
صفحات -
تاریخ انتشار 2014