Enhancing EpiCenter Data Quality Analytics with R
نویسندگان
چکیده
منابع مشابه
A Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection
Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....
متن کاملFeature Selection in Enterprise Analytics: A Demonstration using an R-based Data Analytics System
Enterprise applications are analyzing ever larger amounts of data using advanced analytics techniques. Recent systems from Oracle, IBM, and SAP integrate R with a data processing system to support richer advanced analytics on large data. A key step in advanced analytics applications is feature selection, which is often an iterative process that involves statistical algorithms and data manipulat...
متن کاملEnhancing Data Warehouse Quality with the NFR Framework
In recent years, Data Warehouse has emerged as a powerful technology for integrating heterogeneous data into a multidimensional repository on behalf of decision-support analysis. The complex extraction, transformation and loading process involved, as well as the aggregational-intensive queries are governed by a multitude of quality factors such as integrity, accessibility, performance, and othe...
متن کاملEnhancing Education Quality Assurance Using Data Mining
In this paper we introduce a comprehensive educational quality assurance system for a university. The system takes into consideration the three main pillars of the educational process: content, delivery, and assessment. We will demonstrate a comprehensive system that enables quality control and quality assurance using data mining combining data from Quality Assurance Automated System QAAS, the ...
متن کاملCollaborative Data Analytics with DataHub
While there have been many solutions proposed for storing and analyzing large volumes of data, all of these solutions have limited support for collaborative data analytics, especially given the many individuals and teams are simultaneously analyzing, modifying and exchanging datasets, employing a number of heterogeneous tools or languages for data analysis, and writing scripts to clean, preproc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Online Journal of Public Health Informatics
سال: 2016
ISSN: 1947-2579
DOI: 10.5210/ojphi.v8i1.6591