Combining applications and remote databases view in a common SQL distributed genomic database

نویسندگان

  • Pierre-Emmanuel Gros
  • Joan Hérisson
  • Nicolas Férey
  • Rachid Gherbi
چکیده

Huge volumes of bioinformatics data are emerging from sequencing efforts, gene expression assays, X-ray crystallography of proteins, and many other activities. High-throughput experimental methods produce masses of data, so that the whole of biology has changed from a data-light science into a data-driven science. Currently there are a lot of databases and software tools dealing with these genomic data. In general, each tool and database uses a different type of data in exchange protocols, and usually they offer specific services. These Databases are design with different languages and run on different operating systems. Therefore biologists are in a difficult situation where they have to use, process and store heterogeneous data when using heterogeneous software tools and databases. Our framework, GenoMEDIA provides two main middleware to help for this integration, Lydia and Antje. On the one hand, the Lydia middleware offers us facilities for working simultaneously with a variety of Services and Databases. On the other hand, the Antje one ,with the concept of remote view, is designed to allow users to manage multiple heterogeneous remote databases in a uniform vision. The aim of this paper is to present GenoMEDIA and how heterogeneous databases and remote services are integrated, in particular how Antje was designed, implemented and tested with various genomic databases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed Query Processing for Structured and Bibliographic Databases

To support future digital library systems which draw information from different sources on the intemet, we have to provide integrated queries to pre-existing database servers which contain structured, semistructured and unstructured data. In this paper, we specijically examine the problem of querying both existing structured relational databases and bibliographic databases. By adopting the well...

متن کامل

Zatara, the Plug-in-able Eventually Consistent Distributed Database

With the proliferation of the computer Cloud, new software delivery methods were created. In order to build software to fit into one of these models, a scalable, easy to deploy storage tier is required. Distributed, non-SQL databases use multiple techniques to distribute information, guarantee data consistency and grow, but unfortunately most developments were designed with a single class of ap...

متن کامل

Remote Batch Invocation for SQL Databases

Batch services are a new approach to distributed computation in which clients send batches of operations for execution on a server and receive hierarchical results sets in response. In this paper we show how batch services provide a simple and powerful interface to relational databases, with support for arbitrary nested queries and bulk updates. One important property of the system is that a si...

متن کامل

Object SQL - A Language for the Design and Implementation of Object Databases

object databases, query language, information services, distributed environment, relational databases Object SQL (OSQL) is a language for the design and implementation of object databases. The OSQL language is computationally complete and provides a rich set of constructs that allow definition, implementation and integration of information services in a distributed environment. It also provides...

متن کامل

Supporting Information Fusion with Federated Database Technologies (Position Paper)

A common problem facing many users today is to extract and combine information from multiple, heterogeneous sources and to derive information of a new quality or abstraction level. Though essential parts of this information fusion process can be supported by techniques developed in the eld of federated databases, new approaches for managing consistency, uncertainty or quality of data and enabli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Data Science Journal

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2005