Provenance Based Rebuild: Using Data Provenance to Improve Reliability

Published as Storage Systems Research Center Technical Report UCSC-SSRC-11-04.

Abstract

Traditionally, data preservation and reliability have used error correcting codes (ECCs) to ensure data safety. The development of general data provenance tracking sys- tems provides a new opportunity for data reliability. We present a method that utilizes provenance to determine a datum’s generating process and inputs, and then uses this information to recompute lost data. This method, called Provenance Based Rebuild (PBR) provides a new, com- plimentary reliability mechanism that integrates with tra- ditional systems to offer a variety of benefits including fine grained prioritized rebuild and parallel rebuild. While PBR offers benefits that address weaknesses in current techniques, it also faces a number of challenges such as data placement, and infrastructure provisioning.

Publication date:
May 2011

Authors:
Brian Madden
Ian Adams
Mark W. Storer
Ethan L. Miller
Darrell D. E. Long
Thomas Kroeger

Projects:
Reliable Storage

Available for download:

Full text:
Download as PDF

Bibtex entry

@techreport{madden11-tr,
  author       = {Brian Madden and Ian Adams and Mark W. Storer and Ethan L. Miller
and Darrell D. E. Long and Thomas Kroeger},
  title        = {Provenance Based Rebuild: Using Data Provenance to Improve
Reliability},
  institution  = {University of California, Santa Cruz},
  number       = {UCSC-SSRC-11-04},
  month        = may,
  year         = {2011},
}
Last modified 26 May 2011