Chapter.i, " Combining Data Warehousing and Data Mining Techniques for Web Log Analysis "
نویسندگان
چکیده
In enterprises, a large volume of data has been collected and stored in data warehouses. Advances in data gathering, storage, and distribution have created a need for integrating data warehousing and data mining techniques. Mining data warehouses raises unique issues and requires special attention. Data warehousing and data mining are interrelated , and require holistic techniques from the two disciplines. The " Advanced Topics in Data Warehousing and Mining " series comes into place to address some issues related to mining data warehouses. To start this series, this volume 1, includes 12 chapters in four sections, contributed by authors and editorial board members from the International Journal of Data Warehousing and Mining. Section I, on Data Warehousing and Mining, consists of three chapters covering data mining techniques applied to data warehouse Web logs, data cubes, and high-dimensional datasets. brings together data warehousing and data mining by focusing on data that has been collected in Web server logs. This data will only be useful if high-level knowledge about user navigation patterns can be analyzed and extracted. There are several approaches to analyze Web logs. They propose a hybrid method that combines data warehouse Web log schemas and a data mining technique called Hyper Probabilistic Grammars, resulting in a fast and flexible Web log analysis. Further enhancement to this hybrid method is also outlined.
منابع مشابه
I Combining . Data . Warehousing . and . Data . Mining . Techniques . for . Web . Log . Analysis
Enormous amounts of information about Web site user behavior are collected in Web server logs. However, this information is only useful if it can be queried and analyzed to provide high-level knowledge about user navigation patterns, a task that requires powerful techniques.This chapter presents a number of approaches that combine data warehousing and data mining techniques in order to analyze ...
متن کاملWeb Usage Mining with Web Logs
With the rapid growth of the World Wide Web, the use of automated Web-mining techniques to discover useful and relevant information has become increasingly important. One challenging direction is Web usage mining, wherein one attempts to discover user navigation patterns of Web usage from Web access logs. Properly exploited, the information obtained from Web usage log can assist us to improve t...
متن کاملData Mining for Intelligent Web Caching
The paper presents a vertical application of data warehousing and data mining technology: intelligent web caching. We introduce several ways to construct intelligent web caching algorithms that employ predictive models of web requests; the general idea is to extend the LRU policy of web and proxy servers by making it sensible to web access models extracted from web log data using data mining te...
متن کاملEnhancing Web Search through Query Log Mining
INTRODUCTION Web query log is a type of file keeping track of the activities of the users who are utilizing a search engine. Compared to traditional information retrieval setting in which documents are the only information source available, query logs are an additional information source in the Web search setting. Based on query logs, a set of Web mining techniques, such as log-based query clus...
متن کاملWarehousing complex data from the web
The data warehousing and OLAP technologies are now moving onto handling complex data that mostly originate from the Web. However, intagrating such data into a decision-support process requires their representation under a form processable by OLAP and/or data mining techniques. We present in this paper a complex data warehousing methodology that exploits XML as a pivot language. Our approach inc...
متن کامل