Big Data Mining using Map Reduce: A Survey Paper
نویسنده
چکیده
Big data is large volume, heterogeneous, distributed data. Big data applications where data collection has grown continuously, it is expensive to manage, capture or extract and process data using existing software tools. For example Weather Forecasting, Electricity Demand Supply, social media and so on. With increasing size of data in data warehouse it is expensive to perform data analysis. Data cube commonly abstracting and summarizing databases. It is way of structuring data in different n dimensions for analysis over some measure of interest. For data processing Big data processing framework relay on cluster computers and parallel execution framework provided by Map-Reduce. Extending cube computation techniques to this paradigm. MR-Cube is framework (based on mapreduce)used for cube materialization and mining over massive datasets using holistic measure. MR-Cube efficiently computes cube with holistic measures over billion-tuple datasets.
منابع مشابه
A Survey on Accessing Data over Cloud Environment using Data mining Algorithms
In today’s world to access the large set of data is more complex, because the data may be structured and unstructured like in the form of text, images, videos, etc., it cannot be controlled from the internet users this is known as Big data. Useful data can be accessed through extracting from big data with the help of data mining algorithms. Data mining is a technique for determine the patterns;...
متن کاملAlgorithms Using Map Reduce-a Survey
Despite increasing data volumes much faster than compute power. This growth demands new strategies for processing and analyzing information. Organizations are determining that significant forecasting can be through sorting and analyze Big Data. Ever since a large amount of data is "amorphous", it should be structured in a manner which is appropriate for mining and succeeding analysis. Hadoop he...
متن کاملA Survey on Parallel Rough Set Based Knowledge Acquisition Using MapReduce from Big Data
Nowadays, the volume of data is growing at an nprecedented rate, big data mining , and knowledge discovery have become a new challenge in the era of data mining and machine learning. Rough set theory for knowledge acquisition has been successfully applied in data mining. The MapReduce technique, received more attention from scientific community as well as industry for its applicability in big d...
متن کاملHigh Performance clustering for Big Data Mining using Hadoop
Now a day, organizations across public and private sectors have made a premeditated decision to big data into competitive advantage. The motivation and challenge of extracting value from big data is similar in many ways to the age-old problem of distilling business intelligence from transactional data. Hadoop is a speedily budding ecosystem of components based on big data Map Reduce algorithm a...
متن کاملMarket Basket Analysis Algorithm on Map/Reduce in AWS EC2
As the web, social networking, and smartphone application have been popular, the data has grown drastically everyday. Thus, such data is called Big Data. Google met Big Data earlier than others and recognized the importance of the storage and computation of Big Data. Thus, Google implemented its parallel computing platform with Map/Reduce approach on Google Distributed File Systems (GFS) in ord...
متن کامل