Olap aggregation function for textual data warehouse

نویسندگان

  • Franck Ravat
  • Olivier Teste
  • Ronan Tournier
چکیده

For more than a decade, OLAP and multidimensional analysis have generated methodologies, tools and resource management systems for the analysis of numeric data. With the growing availability of semistructured data there is a need for incorporating text-rich document data in a data warehouse and providing adapted multidimensional analysis. This paper presents a new aggregation function for keywords allowing the aggregation of textual data in OLAP environments as traditional arithmetic functions would do on numeric data. The AVG_KW function uses an ontology to join keywords into a more common keyword.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

OLAP textual aggregation approach using the Google similarity distance

Data warehousing and On-Line Analytical Processing (OLAP) are essential elements to decision support. In the case of textual data, decision support requires new tools, mainly textual aggregation functions, for better and faster high level analysis and decision making. Such tools will provide textual measures to users who wish to analyse documents online. In this paper, we propose a new aggregat...

متن کامل

Top_Keyword: An Aggregation Function for Textual Document OLAP

For more than a decade, researches on OLAP and multidimensional databases have generated methodologies, tools and resource management systems for the analysis of numeric data. With the growing availability of digital documents, there is a need for incorporating text-rich documents within multidimensional databases as well as an adapted framework for their analysis. This paper presents a new agg...

متن کامل

Content aggregation in natural language hypertext summarization of OLAP and Data Mining Discoveries

We present a new approach to paratactic content aggregation in the context of generating hypertext summaries of OLAP and data mining discoveries. Two key properties make this approach innovative and interesting: (1) it encapsulates aggregation inside the sentence planning component, and (2) it relies on a domain independent algorithm working on a data structure that abstracts from lexical and s...

متن کامل

Data Warehouse Performance Optimization Implementing DHE Algorithm in Mortgage Backed Security using Mondrian

OLAP (Online Analytical Processing) means analyzing large quantities of data in real-time. It requires massive amount of processing time to extract information from data warehouse cubes. Business requires online reporting\information these days even for historical data that spans years if not decades. Data warehousing helps in making the retrieval of that data easier by aggregating large datase...

متن کامل

Active Data Warehouses: Complementing OLAP with Active Rules

Conventional data warehouses are passive. All tasks related to analysing data and making decisions must be carried out manually by analysts. Today's data warehouse and OLAP systems o er little support to automatize decision tasks that occur frequently and for which well established decision procedures are available. Such a functionality can be provided by extending the conventional data warehou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007