Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise

نویسندگان

  • Guimei Liu
  • Jinyan Li
  • Limsoon Wong
  • Wynne Hsu
چکیده

A complete set of frequent itemsets can get undesirably large due to redundancy. Several representations have been proposed to eliminate the redundancy. Existing generator based representations rely on a negative border to make the representation lossless. However, negative borders of generators are often very large. The number of itemsets on a negative border sometimes even exceeds the total number of frequent itemsets. In this paper, we propose to use a positive border together with frequent generators to form a lossless representation. A set of frequent generators plus its positive border is always no larger than the corresponding complete set of frequent itemsets, thus it is a true concise representation. The generalized form of this representation is also proposed. We develop an efficient algorithm, called GrGrowth, to mine generators and positive borders as well as their generalizations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Concise and Lossless Representation of Frequent Itemsets Using Generators and A Positive Border

A complete set of frequent itemsets can get undesirably large due to redundancy when the minimum support threshold is low or when the database is dense. Several concise representations have been proposed to eliminate the redundancy. Existing generator based representations rely on a negative border to make the representation lossless. However, negative borders of generators are often very large...

متن کامل

Emerging Cubes: Borders, size estimations and lossless reductions

Discovering trend reversals between two data cubes provides users with a novel and interesting knowledge when the real world context fluctuates: What is new? Which trends appear or emerge? Which tendencies are immersing or disappear? With the concept of Emerging Cube, we capture such trend reversals by enforcing an emergence constraint. We resume the classical borders for the Emerging Cube and ...

متن کامل

Negative Generator Border for Effective Pattern Maintenance

In this paper, we study the maintenance of frequent patterns in the context of the generator representation. The generator representation is a concise and lossless representation of frequent patterns. We effectively maintain the generator representation by systematically expanding its Negative Generator Border. According to our literature review, no prior work has studied the maintenance of the...

متن کامل

Exploring the Disjunctive Search Space towards Discovering New Exact Concise Representations for Frequent Patterns

Extracting concise representations seems to be a milestone towards the emerging knowledge extraction field. In fact, it is a quite survival reflex towards providing a manageably-sized and reliable knowledge. Thus, we bashfully witness the emergence of a trend towards extracting concise representations, e.g., closed patterns, non-derivable patterns and essential patterns. The essential pattern-b...

متن کامل

Non-Derivable Item Set and Non-Derivable Literal Set Representations of Patterns Admitting Negation

The discovery of frequent patterns has attracted a lot of attention of the data mining community. While an extensive research has been carried out for discovering positive patterns, little has been offered for discovering patterns with negation. The main hindrance to the progress of such research is huge amount of frequent patterns with negation, which exceeds the number of frequent positive pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006