Omic Data Modelling for Information Retrieval
نویسندگان
چکیده
This study aims at designing a data model managing the vast majority of omic data types, dedicated to information retrieval on two dimensional levels, single patient and multi-patients. The exploitation of scientific health research data, in the search for new biomedical applications, is a promising challenge. Data integration generated by scientific research, particularly omics, with clinical data stored in the Electronic Health Record (EHR) can lead to significant progress in the development of new diagnostic tests and therapies as well as improve our understanding of complex genetic diseases and cancers. Currently, a few solutions already exist such as i2b2 or Transmart. However, they do not handle all main omic data types. Moreover, integrating omic analysis results in EHRs is today mandatory to help clinicians in decision making. This study proposes a generic omic data model dedicated to managing main omic data types (expression data, DNA-methylation, variants, etc.), omic data representation, and information retrieval. Integrating omics data within clinical data involves: (i) identify the different types of omic data, (ii) select relevant information in the context of integration with clinical data then (iii) design an effective omic data model. Four levels of data have been defined according to their level of interpretation. The second level representing interpreted data has been selected to build the data model. Various omic data types have been integrated into a database coupled to clinical data.
منابع مشابه
Improved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملIssues of Data Modelling in Information Retrieval
This paper addresses the problem of data modelling in information retrieval. The study introduces various aspects and issues that are necessarily taken into account when designing and developing an information retrieval system. Particular attention is paid to the representation of the different types of data managed by an information retrieval application: structured and unstructured data. A re...
متن کاملRetrieval–travel-time model for free-fall-flow-rack automated storage and retrieval system
Automated storage and retrieval systems (AS/RSs) are material handling systems that are frequently used in manufacturing and distribution centers. The modelling of the retrieval–travel time of an AS/RS (expected product delivery time) is practically important, because it allows us to evaluate and improve the system throughput. The free-fall-flow-rack AS/RS has emerged as a new technology for dr...
متن کاملدیداری کردن نتایج جستوجو در فرایند بازیابی اطلاعات
Purpose: One of the most effective ways to achieve optimum information retrieval is through visualization of Information. Search strategies, probing skills, querying of information needs and analysis of information play a significant role in the accessing of necessary and useful information. Besides the factors mentioned above, information visualization can increase the availability level of in...
متن کامل