TKUS: Mining top-k high utility sequential patterns

نویسندگان

چکیده

High-utility sequential pattern mining (HUSPM) has recently emerged as a focus of intense research interest. The main task HUSPM is to find all subsequences, within quantitative database, that have high utility with respect user-defined minimum threshold. However, it difficult specify the threshold, especially when database features, which are invisible in most cases, not understood. To handle this problem, top-k was proposed. Up now, only very preliminary work been conducted capture HUSPs, and existing strategies require improvement terms running time, memory consumption, unpromising candidate filtering, scalability. Moreover, no systematic problem statement defined. In paper, we formulate propose novel algorithm called TKUS. improve efficiency, TKUS adopts projection local search mechanism employs several schemes, including Sequence Utility Raising, Terminate Descendants Early, Eliminate Unpromising Items strategies, allow greatly reduce space. Finally, experimental results demonstrate can achieve sufficiently good performance compared state-of-the-art TKHUS-Span.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining top-k high utility patterns over data streams

Online high utility itemset mining over data streams has been studied recently. However, the existing methods are not designed for producing topk patterns. Since there could be a large number of high utility patterns, finding only top-k patterns is more attractive than producing all the patterns whose utility is above a threshold. A challenge with finding top-k high utility itemsets over data s...

متن کامل

Mining Top-K Sequential Rules

Mining sequential rules requires specifying parameters that are often difficult to set (the minimal confidence and minimal support). Depending on the choice of these parameters, current algorithms can become very slow and generate an extremely large amount of results or generate too few results, omitting valuable information. This is a serious problem because in practice users have limited reso...

متن کامل

TKS: Efficient Mining of Top-K Sequential Patterns

Sequential pattern mining is a well-studied data mining task with wide applications. However, fine-tuning the minsup parameter of sequential pattern mining algorithms to generate enough patterns is difficult and timeconsuming. To address this issue, the task of top-k sequential pattern mining has been defined, where k is the number of sequential patterns to be found, and is set by the user. In ...

متن کامل

Mining High Utility Mobile Sequential Patterns in Mobile Commerce Environments

Mining user behaviors in mobile environments is an emerging and important topic in data mining fields. Previous researches have combined moving paths and purchase transactions to find mobile sequential patterns. However, these patterns cannot reflect actual profits of items in transaction databases. In this work, we explore a new problem of mining high utility mobile sequential patterns by inte...

متن کامل

Efficient Mining of High Utility Sequential Patterns Over Data Streams

High utility sequential pattern mining has emerged as an important topic in data mining. Although several preliminary works have been conducted on this topic, the existing studies mainly focus on mining high utility sequential patterns (HUSPs) in static databases and do not consider the streaming data. Mining HUSPs over data streams is very desirable for many applications. However, addressing t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information Sciences

سال: 2021

ISSN: ['0020-0255', '1872-6291']

DOI: https://doi.org/10.1016/j.ins.2021.04.035