From Personal Desktops to Personal Dataspaces: A Report on Building the iMeMex Personal Dataspace Management System

نویسندگان

  • Jens Dittrich
  • Lukas Blunschi
  • Markus Färber
  • Olivier René Girard
  • Shant Kirakos Karakashian
  • Marcos Antonio Vaz Salles
چکیده

We propose a new system that is able to handle the entire Personal Dataspace of a user. A Personal Dataspace includes all data pertaining to a user on all his disks and on remote servers such as network drives, email and web servers. This data is represented by a heterogeneous mix of files, emails, bookmarks, music, pictures, calendar data, personal information streams and so on. State-of-the-art tools such as desktop search engines and desktop operating systems (including the upcoming Vista) are not enough as they neither solve the problem of physical personal information independence (where is my data) nor format and data model independence (how is it stored and which application do I have to use in order to access that data). Our work builds on the visions presented in [DSKB05], which calls for a single system to manage the personal information jungle, and [FHM05], which advocates dataspaces as a new abstraction for information management. In contrast to [FHM05] this paper presents a concrete implementation of a Personal DataSpace Management System (PDSMS) termed iMeMex: integrated memex. We discuss the core architecture of iMeMex and services offered by our system. As we will show, a PDSMS can be seen as a system that occupies the middleground between a search engine, a database management system, and a traditional information integration system. A PDSMS has to bridge these separate worlds and requires: (1) no full control on data, i.e., data may be accessed bypassing the interfaces of a PDSMS, (2) simple keyword search on all data available in a dataspace without performing any semantic data integration, (3) rich querying able to mix structural, attribute, and content predicates, (4) pay-as-you-go integration capabilities, (5) the ability to define arbitrary logical views on all data, (6) durability and consistency guarantees to avoid loss of data assigned to a dataspace, and (7) update capabilities. iMeMex is the first implementation of a PDSMS we are aware of. This paper presents the architecture of iMeMex and reports on the current state of the iMeMex research project at ETH Zurich. ∗This work is partially supported by the Swiss National Science Foundation (SNF) under contract 200021112115.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

iMeMex: From Search to Information Integration and Back

In this paper, we report on lessons learned while building iMeMex, the first incarnation of a Personal Dataspace Management System (PDSMS). In contrast to traditional keyword search engines, users may not only search their collections with iMeMex but also semantically integrate them over time. As a consequence, the system may improve on precision and recall of queries in a pay-as-you-go fashion...

متن کامل

A Dataspace Odyssey: The iMeMex Personal Dataspace Management System (Demo)

A Personal Dataspace includes all data pertaining to a user on all his local disks and on remote servers such as network drives, email and web servers. This data is represented by a heterogeneous mix of files, emails, bookmarks, music, pictures, calendar, personal information streams and so on. We demonstrate a new breed of system that is able to handle the entire Personal Dataspace of a user. ...

متن کامل

Understanding Personal Data as a Space - Learning from Dataspaces to Create Linked Personal Data

In this paper we argue that the space of personal data is a dataspace as defined by Franklin et al. We define a personal dataspace, as the space of all personal data belonging to a user, and we describe the logical components of the dataspace. We describe a Personal Dataspace Support Platform (PDSP) as a set of services to provide a unified view over the user’s data, and to enable new and more ...

متن کامل

Principles of Dataspaces

This seminar paper introduces a concept in the area of data management called dataspaces. The goal of this concept is to offer a way to handle multiple data sources with different models as answer to the rapidly increasing demand of working with such a data. Management challenges like providing search/query capability, integrity constraints, naming conventions, recovery, and access control aris...

متن کامل

Research on Operation-based Correlation in Personal Dataspace

Operation of user was defined. The weight of operation was expressed. The variable quantity of user behavior was computed by weight. 3-ary vector data definition was expanded. Data item was defined by 4-ary vector in personal dataspace. Correlation of data for user was defined by weight. Current weight of data was defined by initial weight and variable quantity of user operation. A library data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007