Massively Parallel, Adaptive and Fault-Tolerant Execution of Data Flow Programs on Dynamic Clouds

Applicant Professor Dr. Odej Kao
Subject Area Security and Dependability, Operating-, Communication- and Distributed Systems
Term from 2010 to 2014
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 132320961
 

Project Description

The goal of Project B within the Stratosphere Research Unit is to research and to implement a massivelyparallel, adaptive and fault-tolerant execution framework for re-configurable Cloud environments. Forming the bottom layer of the entire research group, the project will address the challenge of how to bring together the demands of modern information management applications with the scalability and computational power offered by today’s Cloud systems. In particular, the project will conduct research on topology awareness for data locality in virtualized environments, the efficient and adaptive execution of data flow programs in such environments as well as methods to ensure fault tolerance in presence of permanent or transient node failures. All research results will be integrated with preliminary work of the research group, evaluated experimentally and will facilitate higher level research efforts of the Stratosphere project.
DFG Programme Research Units
Subproject of FOR 1306:  Stratosphere - Information Management above the Clouds