Behaviour Recovery and Complicated Pattern Definition in Web Usage Mining

نویسندگان

  • Long Wang
  • Christoph Meinel
چکیده

Data mining includes four steps: data preparation, pattern mining, and pattern analysis and pattern application. But in web environment, the user activities become much more complex because of the complex web structure. So user behaviours recovery and pattern definition play more important roles in web mining than other applications. In this paper, we gave a new view on behaviour recovery and complicated pattern definition. We used several methods to recover different user behaviours, such as simple behaviour, sequence visiting, tree structure behaviour, acyclic routing behaviour and cyclic routing behaviour. Based on various recovered behaviours, we raised how to define complicated usage patterns. These usage patterns include constraint association rules, constraint sequential patterns, deepest access paths, shortest access paths, tree structure accessing patterns, parallel visiting patterns, circle visiting patterns and so on. We also gave some experiment results about these complicated access patterns which reveal some interesting usage behaviours.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recovering Individual Accessing Behaviour from Web Logs

In this paper, we present a new view on the data preparation in web usage mining. We concentrate on recovering individual usage behaviour from accessing records on web site. We defined five categories of individual behaviours such as granular accessing behaviour, linear sequential behaviour, tree structure behaviour, acyclic routing behaviour and cyclic routing behaviour. The algorithms for rec...

متن کامل

Minimizing the Repeated Database Scan Using an Efficient Frequent Pattern Mining Algorithm in Web Usage Mining

Data Mining, is the process of discovery of new patterns and knowledge from large dataset. Web mining is the application of data mining techniques to extract and mine useful knowledge and interesting patterns from World Wide Web .Web data including web documents, hyperlinks between documents, usage logs of web sites. The web usage data captures the identity and origin of the web user along thei...

متن کامل

Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems

  One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...

متن کامل

Mining of Users’ Access Behaviour for Frequent Sequential Pattern from Web Logs

Sequential Pattern mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. The task of discovering frequent sequences is challenging, because the algorithm needs to process a combinatorially explosive number of possible sequences. Discovering hidden information fro...

متن کامل

Fuzzy Equivalent Matrix for Discovering Patterns of Web Users Navigation

-World Wide Web provides abundance of information for the Internet users and is a huge repository of web pages and links. The growth of web is tremendous as approximately one million pages are added daily. Web logs record users’ accesses. Because of the tremendous usage of web , the web log files are growing at a faster rate and the size is becoming huge. Web data mining is the application of d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004