Efficient temporal pattern mining in big time series using mutual information
نویسندگان
چکیده
Very large time series are increasingly available from an ever wider range of IoT-enabled sensors deployed in different environments. Significant insights can be gained by mining temporal patterns these series. Unlike traditional pattern mining, (TPM) adds event intervals into extracted patterns, making them more expressive at the expense increased and space complexities. Existing TPM methods either cannot scale to datasets, or work only on pre-processed events rather than This paper presents our Frequent Temporal Pattern Mining Time Series (FTPMfTS) approach providing: (1) The end-to-end FTPMfTS process taking as input producing frequent output. (2) efficient Hierarchical Graph (HTPGM) algorithm that uses data structures for fast support confidence computation, employs effective pruning techniques significantly faster mining. (3) An approximate version HTPGM mutual information, a measure correlation, prune unpromising search space. (4) extensive experimental evaluation showing outperforms baselines runtime memory consumption, big datasets. is up two orders magnitude less consuming baselines, while retaining high accuracy.
منابع مشابه
Temporal Pattern Mining Using a Time Ontology
The analysis of temporal data has deserved a considerable attention, in particular in the analysis of time series. However, the general research on data mining seldom has focused its attention on dealing with the specific attribute – time. The discovery of temporal patterns, that reveal interesting behaviors over time, is one of such cases. In this paper, we propose a new approach to effectivel...
متن کاملRPM: Representative Pattern Mining for Efficient Time Series Classification
Time series classification is an important problem that has received a great amount of attention by researchers and practitioners in the past two decades. In this work, we propose a novel algorithm for time series classification based on the discovery of class-specific representative patterns. We define representative patterns of a class as a set of subsequences that has the greatest discrimina...
متن کاملAn Efficient Time Series Data Mining Technique
Data Mining is the process of discovering potentially valuable patterns, associations, trends, sequences and dependencies in data. Data mining techniques can discover information that many traditional business analysis and statistical techniques fail to deliver. In our study, we emphasis on the use of data mining techniques on time series, where mining techniques and tools are used in an attemp...
متن کاملMining Big Time-series Data on the Web
Online news, blogs, SNS and many other Web-based services has been attracting considerable interest for business and marketing purposes. Given a large collection of time series, such as web-click logs, online search queries, blog and review entries, how can we efficiently and effectively find typical time-series patterns? What are the major tools for mining, forecasting and outlier detection? T...
متن کاملBig Data Frequent Pattern Mining
Frequent pattern mining is an essential data mining task, with a goal of discovering knowledge in the form of repeated patterns. Many efficient pattern mining algorithms have been discovered in the last two decades, yet most do not scale to the type of data we are presented with today, the so-called “Big Data”. Scalable parallel algorithms hold the key to solving the problem in this context. In...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the VLDB Endowment
سال: 2021
ISSN: ['2150-8097']
DOI: https://doi.org/10.14778/3494124.3494147