Temporal group linkage and evolution analysis for census data
نویسندگان
چکیده
The temporal linkage of census data allows the detailed analysis of population-related changes in an area of interest. It should not only link records about the same person but also support the linkage of groups of related persons such as households. In this paper, we thus propose a new approach to both temporal record and group (household) linkage for census data and study its application for change analysis. The approach utilizes the relationships between individuals to determine the similarity of groups and their members within a graph-based method. The approach is also iterative by first identifying high quality matches that are subsequently extended by matches found with less restrictive similarity criteria. A comprehensive evaluation using historical census data from the UK indicates a high effectiveness of the proposed approach. Furthermore, the linkage enables an insightful analysis of household changes determined by so-called evolution patterns.
منابع مشابه
Linking 2006 Census and hospital data in Canada.
BACKGROUND Record linkage is commonly used in health research to fill data gaps. This study summarizes the linkage of the 2006 Census of Population (excluding Quebec) to hospital data from the Discharge Abstract Database (DAD). DATA AND METHODS Hierarchical deterministic exact matching was employed to link 2006 Census and DAD (2006/2007, 2007/2008 and 2008/2009) data, based on linkage keys de...
متن کاملA Supervised Learning and Group Linking Method for Historical Census Household Linkage
Historical census data provide a snapshot of the era when our ancestors lived. Such data contain valuable information that allows the reconstruction of households and the tracking of family changes across time, allows the analysis of family diseases, and facilitates a variety of social science research. One particular topic of interest in historical census data analysis are households and linki...
متن کاملMultiple Instance Learning for Group Record Linkage
Record linkage is the process of identifying records that refer to the same entities from different data sources. While most research efforts are concerned with linking individual records, new approaches have recently been proposed to link groups of records across databases. Group record linkage aims to determine if two groups of records in two databases refer to the same entity or not. One app...
متن کاملImproving Temporal Record Linkage Using Regression Classification
Temporal record linkage is the process of identifying groups of records that are collected over a period of time, such as in census or voter registration databases, where records in the same group represent the same real-world entity. Such databases often contain temporal information, such as the time when a record was created or when it was modified. Unlike traditional record linkage, which co...
متن کاملApplication of Advanced Record Linkage Techniques for Complex Population Reconstruction
Record linkage is the process of identifying records that refer to the same entities from several databases. This process is challenging because commonly no unique entity identifiers are available. Linkage therefore has to rely on partially identifying attributes, such as names and addresses of people. Recent years have seen the development of novel techniques for linking data from diverse appl...
متن کامل