Olap aggregation function for textual data warehouse
نویسندگان
چکیده
For more than a decade, OLAP and multidimensional analysis have generated methodologies, tools and resource management systems for the analysis of numeric data. With the growing availability of semistructured data there is a need for incorporating text-rich document data in a data warehouse and providing adapted multidimensional analysis. This paper presents a new aggregation function for keywords allowing the aggregation of textual data in OLAP environments as traditional arithmetic functions would do on numeric data. The AVG_KW function uses an ontology to join keywords into a more common keyword.
منابع مشابه
OLAP textual aggregation approach using the Google similarity distance
Data warehousing and On-Line Analytical Processing (OLAP) are essential elements to decision support. In the case of textual data, decision support requires new tools, mainly textual aggregation functions, for better and faster high level analysis and decision making. Such tools will provide textual measures to users who wish to analyse documents online. In this paper, we propose a new aggregat...
متن کاملTop_Keyword: An Aggregation Function for Textual Document OLAP
For more than a decade, researches on OLAP and multidimensional databases have generated methodologies, tools and resource management systems for the analysis of numeric data. With the growing availability of digital documents, there is a need for incorporating text-rich documents within multidimensional databases as well as an adapted framework for their analysis. This paper presents a new agg...
متن کاملContent aggregation in natural language hypertext summarization of OLAP and Data Mining Discoveries
We present a new approach to paratactic content aggregation in the context of generating hypertext summaries of OLAP and data mining discoveries. Two key properties make this approach innovative and interesting: (1) it encapsulates aggregation inside the sentence planning component, and (2) it relies on a domain independent algorithm working on a data structure that abstracts from lexical and s...
متن کاملData Warehouse Performance Optimization Implementing DHE Algorithm in Mortgage Backed Security using Mondrian
OLAP (Online Analytical Processing) means analyzing large quantities of data in real-time. It requires massive amount of processing time to extract information from data warehouse cubes. Business requires online reporting\information these days even for historical data that spans years if not decades. Data warehousing helps in making the retrieval of that data easier by aggregating large datase...
متن کاملActive Data Warehouses: Complementing OLAP with Active Rules
Conventional data warehouses are passive. All tasks related to analysing data and making decisions must be carried out manually by analysts. Today's data warehouse and OLAP systems o er little support to automatize decision tasks that occur frequently and for which well established decision procedures are available. Such a functionality can be provided by extending the conventional data warehou...
متن کامل