PAQS Data Sources, Exchange Formats, and Database Schemas for a Proteo-Chemometric Analysis and Query System
نویسنده
چکیده
The research described in this Thesis is part of a project to develop a new database system for proteo-chemometric research. This new system uses a mediator/wrapper approach for integrating heterogeneous and autonomous data sources. Special-purpose modules for data representation and data analysis can be incorporated into the system through the extensibility of the object-relational mediator. Life science data sources and data exchange formats for the new Proteo-chemometric Analysis and Query System (PAQS) have been surveyed. Although important data sources exist on many different formats the trend towards XML is evident. For proteo-chemometric research it is important to be able to access data sources with binding affinity data. Most such data sources are only accessible via web forms, which limits the query capabilities. Database schemas for parts of the proteo-chemometric information domain have been developed within a functional data model with object-oriented extensions. These schemas have also been implemented in the Amos II system as a first-stage prototype of PAQS. Special emphasis has been put on modelling binding experiments and experiment evaluations, and the corresponding data types have been used to show how data analysis could be performed by means of foreign functions of the mediator. The mediator/wrapper approach is described in the Thesis, and examples are given of other systems which use this architecture for integrating life science data, both research prototypes and commercial systems. Introductions to the proteo-chemometric approach to drug design, to some general database concepts, and to information integration by means of database systems are also given.
منابع مشابه
Completing CAD Data Queries for Visualization
A system has been developed permitting database queries over data extracted from a CAD system where the query result is returned back to the CAD for visualization and analysis. This has several challenges. First, CAD data representations use complex object-oriented schemas and the query language must be object-oriented too. Second, the query system resides outside the CAD system and must theref...
متن کاملانتخاب مناسبترین زبان پرسوجو برای استفاده از فراپیوندها جهت استخراج دادهها در حالت دیتالوگ در سامانه پایگاه داده استنتاجی DES
Deductive Database systems are designed based on a logical data model. Data (as opposed to Relational Databases Management System (RDBMS) in which data stored in tables) are saved as facts in a Deductive Database system. Datalog Educational System (DES) is a Deductive Database system that Datalog mode is the default mode in this system. It can extract data to use outer joins with three query la...
متن کاملCOMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data
Today’s web search engines are still following the paradigm of keyword-based search. Although this is the best choice for large scale search engines in terms of throughput and scalability, it inherently limits the ability to accomplish more meaningful query tasks. XML query engines (e.g., based on XQuery or XPath), on the other hand, have powerful query capabilities; but at the same time their ...
متن کاملAnalysis of User query refinement behavior based on semantic features: user log analysis of Ganj database (IranDoc)
Background and Aim: Information systems cannot be well designed or developed without a clear understanding of needs of users, manner of their information seeking and evaluating. This research has been designed to analyze the Ganj (Iranian research institute of science and technology database) users’ query refinement behaviors via log analysis. Methods: The method of this research is log anal...
متن کاملDeveloping and Accessing Scientific Databases with the Object-Protocol (OPM) Data Management Tools
The Object-Protocol Model (OPM) data management tools provide facilities for rapid development, documentation, and flexible exploration of scientific databases. The tools are based on OPM, an object-oriented data model which is similar to the ODMG standard, but also supports extensions for modeling scientific data [41. Databases de signed using OPM can be implemented using a variety of commerci...
متن کامل