Statistical Data Analysis of Continuous Streams Using Stream Dsms
نویسنده
چکیده
Several applications involve a transient stream of data which has to be modeled and analyzed continuously. Their continuous arrival in multiple, rapid, time-varying and possibly unpredictable and unbounded way make the analysis difficult and opens fundamentally new research problems. Examples of such data intensive applications include stock market, road traffic analysis, whether forecasting systems etc. In this study, we have used a Data Stream Management System toolStanford STREAM to model and analyze data from two different application domainsRoad Traffic analysis and Habitat Monitoring analysis. Based on the results we discuss advantages and disadvantages of STREAM.
منابع مشابه
Statistical Mining in Data Streams
Statistical Mining in Data StreamsAnkur Jain Recent years have seen a steady rise of a new class of data management systemscalled Data Stream Management Systems (DSMS). These systems manage rapid, high-volume data-streams with transient relations instead of static data with persistent rela-tions. Data streams are common to applications such as network traffic and transac-<lb...
متن کاملData Stream Management Systems
In many application fields, such as production lines or stock analysis, it is substantial to create and process high amounts of data at high rates. Such continuous data flows with unknown size and end are also called data streams. The processing and analysis of data streams are a challenge for common data management systems as they have to operate and deliver results in real time. Data Stream M...
متن کاملSTREAM: The Stanford Data Stream Management System
Traditional database management systems are best equipped to run onetime queries over finite stored data sets. However, many modern applications such as network monitoring, financial analysis, manufacturing, and sensor networks require long-running, or continuous, queries over continuous unbounded streams of data. In the STREAM project at Stanford, we are investigating data management and query...
متن کاملNetwork-aware optimization in distributed data stream management systems
The management of streaming data in distributed environments is gaining importance in many application areas such as sensor networks and e-science. This is mainly due to both, the need for immediate reactions to important events in input streams as well as the requirement to efficiently handle enormous data volumes that are generated, for example, by modern scientific experiments and observatio...
متن کاملLogical Foundations of Continuous Query Languages for Data Streams
Data Stream Management Systems (DSMS) have attracted much interest from the database community, and extensions of relational database languages were proposed for expressing continuous queries on data streams. However, while relational databases were built on the solid bedrock of logic, the same cannot be said for DSMS. Thus, a logic-based reconstruction of DSMS languages and their unique comput...
متن کامل