Optimal Repair of MDS Codes in Distributed Storage via Subspace Interference Alignment

نویسندگان

  • Viveck R. Cadambe
  • Cheng Huang
  • Syed Ali Jafar
  • Jin Li
چکیده

It is well known that an (n, k) code can be used to store information in a distributed storage system with n nodes/disks. If the storage capacity of each node/disk is normalized to one unit, the code can be used to store k units of information, where n > k. If the code used is maximum distance separable (MDS), then the storage system can tolerate up to (n−k) disk failures (erasures), since the original information can be reconstructed from any k surviving disks. The focus of this paper is the design of a systematic MDS code with the additional property that a single disk failure can be repaired with minimum repair bandwidth, i.e., with the minimum possible amount of data to be downloaded for recovery of the failed disk. Previously, a lower bound of n−1 n−k units has been established by Dimakis et. al, on the repair bandwidth for a single disk failure in an (n, k) MDS code based storage system, where each of the n disks store 1 unit of data. Recently, the existence of asymptotic codes achieving this lower bound for arbitrary (n, k) has been established by drawing connections to an asymptotic interference alignment scheme developed by Cadambe and Jafar for the interference channel. While the recent asymptotic constructions show the existence of codes achieving this lower bound in the limit of large code sizes, finite code constructions achieving this lower bound existed in previous literature only for the special (high-redundancy) scenario where k ≤ max(n/2, 3). The question of existence of finite codes for arbitrary values of (n, k) achieving the lower bound on the repair bandwidth remained open. As a main contribution of this paper, we provide the first known construction of a finite code for arbitrary (n, k), which can repair a single failed systematic disk by downloading exactly n−1 n−k units of data. The codes, which are optimally efficient in terms repair bandwidth are based on permutation matrices. We also show that our code has a simple repair property which enables efficiency, not only in terms of the amount of repair bandwidth, but also in terms of the amount of data accessed on the disk. We also generalize our permutation matrix based constructions by developing a novel framework for repair-bandwidth-optimal MDS codes based on the idea of subspace interference alignment a concept previously introduced by Suh and Tse the context of wireless cellular networks. This paper will be published, in part, in the Proceedings of IEEE Symposium on Information Theory (ISIT) 2011 [1]. 1The permutation marix based constructions of this paper have been discovered in parallel by Tamo et. al in [2]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Existence of Optimal Exact-Repair MDS Codes for Distributed Storage

The high repair cost of (n, k) Maximum Distance Separable (MDS) erasure codes has recently motivated a new class of codes, called Regenerating Codes, that optimally trade off storage cost for repair bandwidth. In this paper, we address bandwidth-optimal (n, k, d) Exact-Repair MDS codes, which allow for any failed node to be repaired exactly with access to arbitrary d survivor nodes, where k ≤ d...

متن کامل

A Non-MDS Erasure Code Scheme for Storage Applications

This paper investigates the use of redundancy and self repairing against node failures indistributed storage systems using a novel non-MDS erasure code. In replication method, accessto one replication node is adequate to reconstruct a lost node, while in MDS erasure codedsystems which are optimal in terms of redundancy-reliability tradeoff, a single node failure isrepaired after recovering the ...

متن کامل

A Novel Construction of Low-Complexity MDS Codes with Optimal Repair Capability for Distributed Storage Systems

Maximum-distance-separable (MDS) codes are a class of erasure codes that are widely adopted to enhance the reliability of distributed storage systems (DSS). In (n, k) MDS coded DSS, the original data are stored into n distributed nodes in an efficient manner such that each storage node only contains a small amount (i.e., 1/k) of the data and a data collector connected to any k nodes can retriev...

متن کامل

Exact Regeneration Codes for Distributed Storage Repair Using Interference Alignment

The high repair cost of (n, k) Maximum Distance Separable (MDS) erasure codes has recently motivated a new class of codes, called Regenerating Codes, that optimally trade off storage cost for repair bandwidth. On one end of this spectrum of Regenerating Codes are Minimum Storage Regenerating (MSR) codes that can match the minimum storage cost of MDS codes while also significantly reducing repai...

متن کامل

Distributed Data Storage with Minimum Storage Regenerating Codes - Exact and Functional Repair are Asymptotically Equally Efficient

We consider a set up where a file of size M is stored in n distributed storage nodes, using an (n, k) minimum storage regenerating (MSR) code, i.e., a maximum distance separable (MDS) code that also allows efficient exactrepair of any failed node. The MDS property ensures that the original file can be reconstructed even if any n− k storage nodes fail. When a node fails, a new node collects data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1106.1250  شماره 

صفحات  -

تاریخ انتشار 2011