Hao Zhu is a Manager, Hadoop Escalation Group at MapR. Prior to MapR, Hao was Principal Technical Support Engineer at Pivotal, before Pivotal he was an Oracle DBA at eBay. Openkb.info is his personal technical blog.
Blog Posts by Hao Zhu
September 11, 2015 | By Hao Zhu
Resource Allocation Configuration for Spark on YARN
In this blog post, I will explain the resource allocation configurations for Spark on YARN, describe the yarn-client and yarn-cluster modes, and will include examples. Spark can request two resources in YARN: CPU and memory. Note that Spark configurations...Read more
July 24, 2015 | By Hao Zhu
Best Practices for YARN Resource Management
In this blog post, I will discuss best practices for YARN resource management. The fundamental idea of MRv2(YARN) is to split up the two major functionalities—resource management and job scheduling/monitoring, into separate daemons. The idea is to have...Read more
July 20, 2015 | By Hao Zhu
Hive Transaction Feature in Hive 1.0
The New Hive Transaction Feature Adds ACID Semantics at the Row-Level This article describes the new Hive transaction feature introduced in Hive 1.0. This new feature adds initial support of the 4 traits of database transactions – atomicity, consistency...Read more
50,000+ of the smartest have already joined!
Stay ahead of the bleeding edge...get the best of Big Data in your inbox.