TEL-AVIV UNIVERSITY RAYMOND AND BEVERLY SACKLER FACULTY OF EXACT SCIENCES BLAVATNIK SCHOOL OF COMPUTER SCIENCE Coresets and Their Applications

نویسندگان

  • Dan Feldman
  • Micha Sharir
چکیده

In this thesis we investigate the construction and applications of coresets (small sets which approximately represent much larger input sets, in term of various objective measures) to several problems in geometric optimization. Bi-criteria approximation algorithms We consider the problem of approximating a set P of n points inR by a collection of k j-dimensional flats, and extensions thereof, under the median / mean / center measures, in which we wish to minimize, respectively, the sum of the Euclidean distances from each point of P to its nearest flat, the sum of the squares of these distances, and the maximum such distance. Problems of this kind belong to the area of projective clustering. Such problems cannot be approximated in polynomial time, for every approximation factor, unless P=NP but do allow bi-criteria approximations, where one allows some leeway in both the number of flats and the quality of the objective function. We give a very simple randomized bi-criteria approximation algorithm, which produces, with high probability, at most α(k, j, n) = log n · (jk log log n) flats, which exceeds the optimal objective value for any k jdimensional flats by a factor of no more than β(j) = 2. We use this bi-criteria approximation in the construction of coresets for projective clustering; see Chapter 4. Our bi-criteria algorithm has many advantages over previous work, in that it is much more widely applicable (wider set of objective functions and classes of clusters) and much more efficient — reducing the running time bound from O(n) to O(dn) · (jk). We give full details of this work in Chapter 3. A preliminary version has appeared in [FFSS07]; Since the publication of [FFSS07] in 2007 it has been cited and used in subsequent work [FL08, FFKN09, FMSW10].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Determining the configuration of macromolecular assembly components based on cryoEM density fitting and pairwise geometric complementarity

School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel Department of Biopharmaceutical Sciences and Pharmaceutical Chemistry, and California Institute for Quantitative Biomedical Research, University of California at San Francisco, San Francisco, CA 94158, USA School of Crystallography, Birkbeck College University of London...

متن کامل

Practically Stabilizing Atomic Memory

A self-stabilizing simulation of a single-writer multi-reader atomic register is presented. The simulation works in asynchronous message-passing systems, and allows processes to crash, as long as at least a majority of them remain working. A key element in the simulation is a new combinatorial construction of a bounded labeling scheme that can accommodate arbitrary labels, i.e., including those...

متن کامل

TEL-AVIV UNIVERSITY RAYMOND AND BEVERLY SACKLER FACULTY OF EXACT SCIENCES BLAVATNIK SCHOOL OF COMPUTER SCIENCE The 2-Center Problem in Three Dimensions

Let P be a set of n points in three dimensions. The 2-center problem for P is to find two closed congruent balls with smallest radius which cover P . We present a randomized algorithm for solving the problem in O ( 1 (1−r∗/r0)3n 2 log n ) expected time, where r∗ is the common radius of the optimal solution balls and r0 is the radius of the smallest enclosing ball of P . This improves the naive ...

متن کامل

TEL-AVIV UNIVERSITY RAYMOND AND BEVERLY SACKLER FACULTY OF EXACT SCIENCES BLAVATNIK SCHOOL OF COMPUTER SCIENCE Counting Triangulations of Planar Point Sets

We study the maximal number of triangulations that a planar set of n points can have, and show that it is at most 30n. This new bound is achieved by a careful optimization of the charging scheme of Sharir and Welzl (2006), which has led to previous best upper bound of 43n for the problem. Moreover, this new bound is useful for bounding the number of other types of planar (i.e., crossing free) s...

متن کامل

HEMOSTASIS, THROMBOSIS, AND VASCULAR BIOLOGY Reduced incidence of ischemic stroke in patients with severe factor XI deficiency

1The Amalia Biron Research Institute of Thrombosis and Hemostasis, Sheba Medical Center, Tel Hashomer and Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv; 2Department of Statistics and Operations Research, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv; 3Division of Epidemiology and Preventive Medicine, Sackler Faculty of Medicine, Tel Aviv Unive...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012