| Paper: “MapReduce: Simplified Data Processing on Large Clusters,” by Jeffrey Dean and Sanjay Ghemawat. Discusses the programming model, types, an execution overview, implementation, master data structures, master data structures, fault tolerance, locality, task granularity, backup tasks, refinements, partitioning function, ordering guarantees, combiner function, input and output types, side effects, skipping bad records, local execution, status information, counters, performance, cluster configuration, grep, sort, effects of backup tasks, machine failures, experience, and large-scale indexing. |