A Schema Integration Approach for Big Data Analysis

نویسندگان

چکیده

A huge volume of data is analyzed by organizations to understand their clients and improve services. In many cases, these are stored separately in different database systems need be integrated before being used analysis tools or prediction applications. One the main tasks integration process definition global schema. Defining a schema context NoSQL demanding task since it necessitates dealing with variety issues, including lack local schemas, model heterogeneity, semantic heterogeneity. To address challenges, this work aims automatically define set databases heterogeneous systems. The contributions presented three phases: (1) Schema extraction where we schemas using unified representation. (2) matching which propose hybrid approach find attributes between schemas. (3) results. Covid-19 use case as well other benchmarks paper evaluate results proposed illustrate its effectiveness.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection

Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....

متن کامل

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

Data Integration Schema Analysis: An Approach With Information Quality

Integrated access to distributed data is an important problem faced in many scientific and commercial applications. A data integration system provides a unified view for users to submit queries over multiple autonomous data sources. The queries are processed over a global schema that offers an integrated view of the data sources. Much work has been done on query processing and choosing plans un...

متن کامل

Semantics for Big Data Integration and Analysis

Much of the focus on big data has been on the problem of processing very large sources. There is an equally hard problem of how to normalize, integrate, and transform the data from many sources into the format required to run large-scale analysis and visualization tools. We have previously developed an approach to semi-automatically mapping diverse sources into a shared domain ontology so that ...

متن کامل

XML Data Transformation and Integration — A Schema Transformation Approach

The process of transforming and integrating XML data involves resolving the syntactic, semantic and schematic heterogeneities that the data sources present. Moreover, there are a number of different application settings in which such a process could take place, such as centralised or peer-to-peer settings, each of which needs to be considered separately. In this thesis, we investigate the probl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Ingénierie Des Systèmes D'information

سال: 2023

ISSN: ['1633-1311', '2116-7125']

DOI: https://doi.org/10.18280/isi.280207