Apache ZooKeeper

In any distributed cluster, it is important that all nodes be able to share configuration and state data in a reliable way. Hadoop relies on ZooKeeper to keep each of its distributed processes, including MapReduce and HBase, consistent across the cluster.

ZooKeeper nodes store a shared hierarchical name space of data registers in RAM, allowing clients to access it with high throughput and low latency. Hadoop clusters should be provisioned with an odd number of ZooKeeper nodes, typically either 3 or 5, to provide high availability and maintain a quorum.


Apache ZooKeeper

Apache ZooKeeper Wiki

Apache ZooKeeper Issues

Apache ZooKeeper Mailing Lists

Apache ZooKeeper FAQ

Apache ZooKeeper IRC

Download Sandbox for Hadoop

GitHub - MapR

MapR Developer Central