Write a Blog >>
Sun 22 - Fri 27 October 2017 Vancouver, Canada
Fri 27 Oct 2017 11:15 - 11:37 at Regency A - Language Design Chair(s): Gregor Richards

Today’s cloud services extensively rely on replication techniques to ensure availability and reliability. In complex datacenter network architectures, however, seemingly independent replica servers may inadvertently share deep dependencies (e.g., aggregation switches). Such unexpected common dependencies may potentially result in correlated failures across the entire replication deployments, invalidating the efforts. Although existing cloud management and diagnosis tools have been able to offer post-failure forensics, they, nevertheless, typically lead to quite prolonged failure recovery time. In this paper, we propose a novel language framework, named RepAudit, that manages to prevent correlated failure risks before service outages occur, by allowing cloud administrators to proactively audit the replication deployments of interest. In particular, RepAudit consists of three new components: 1) a declarative domain-specific language, RAL, for cloud administrators to write auditing programs expressing diverse auditing tasks; 2) a high-performance RAL auditing engine that generates the auditing results by accurately and efficiently analyzing the underlying structures of the target replication deployments; and 3) an RAL-code generator that can automatically produce complex RAL programs based on easily written specifications. Our evaluation result shows that RepAudit can determine the top-20 critical correlated failure root causes in a replication system containing 30,528 devices within 1 minute, which is 400x more efficient in auditing time than state-of-the-art efforts. To the best of our knowledge, RepAudit is the first effort capable of simultaneously offering expressive, accurate and efficient correlated failure auditing to the cloud-scale replication systems.

Fri 27 Oct (GMT-07:00) Tijuana, Baja California change

10:30 - 12:00: OOPSLA - Language Design at Regency A
Chair(s): Gregor RichardsUniversity of Waterloo
splash-2017-OOPSLA10:30 - 10:52
Matthew ParkinsonMicrosoft Research, UK, Dimitrios VytiniotisMicrosoft Research, Cambridge, Kapil VaswaniMicrosoft Research, Manuel CostaMicrosoft Research, Pantazis DeligiannisMicrosoft Research, Dylan McDermottUniversity of Cambridge, Jonathan BalkindPrinceton, USA, Aaron BlanksteinPrinceton, USA
splash-2017-OOPSLA10:52 - 11:15
Kiwan MaengCarnegie Mellon University, USA, Alexei ColinCarnegie Mellon University, Brandon LuciaCarnegie Mellon University
splash-2017-OOPSLA11:15 - 11:37
Ennan ZhaiYale University, USA, Ruzica PiskacYale University, Ronghui GuColumbia University, USA, Xun LaoYale University, USA, Xi WangYale University, USA
splash-2017-OOPSLA11:37 - 12:00
Ted KaminskiUniversity of Minnesota, Lucas KramerUniversity of Minnesota, Travis CarlsonUniversity of Minnesota, USA, Eric Van WykUniversity of Minnesota, USA
DOI Pre-print