MapR Technologies and EMC Announce Technology Licensing Agreement for Next Generation Hadoop Distribution

May 24, 2011

Game-changing innovations bring unmatched performance, reliability, and manageability to Apache Hadoop

MapR Technologies, Inc. today announced a software licensing agreement with EMC Corporation (NYSE:EMC) in which MapR Technologies will be part of the recently announced EMC® Greenplum® HD Enterprise Edition, a 100 percent interface-compatible implementation of the Apache Hadoop software stack. The new EMC system will incorporate MapR Technologies' pre-integrated, tested and hardened distribution for Apache Hadoop.

"EMC is focused on delivering the best-in-class solutions for Big Data. We evaluated the various Hadoop software offerings and believe that MapR is the clear enterprise- class innovation leader. With MapR, we are able to provide an unmatched solution for high availability, fault tolerance, and enterprise-class support and service. Combined with the EMC Greenplum Database we will enable the co-processing of both structured and unstructured data within a single, seamless solution," said Scott Yara, Co-Founder of Greenplum and Vice President of Products, Data Computing Division, EMC.

Although a number of Hadoop distributions are available, they fail to address underlying customer concerns such as single points of failure, lack of snapshots and mirroring, and poor performance.

"This is a major advancement for Hadoop users everywhere. MapR's innovations coupled with EMC's big data analytics capabilities and service will allow more people to use the power of big data analytics and enable substantial market growth," said John Webster, Senior Analyst, Evaluator Group. "MapR has managed to innovate on performance, cost reduction, dependability and ease-of-use all at once. This marks a major shift for the Hadoop market."

MapR's innovations transform Hadoop into a dependable compute platform while also increasing performance. Specific MapR advances make Hadoop, easy, dependable and fast and include:


  • NFS direct access allows users to use the NFS protocol to simply load and access data directly in a Hadoop cluster and enables standard tools and utilities to work directly on data contained in Hadoop.
  • Heatmap user interface provides full cluster visibility and control.


  • All single points of failure are eliminated in the Hadoop stack
  • JobTracker High Availability ensures continuous job execution.
  • Distributed NameNode with High Availability addresses major reliability issue while also improving performance and scale.
  • Snapshots allow point-in-time data protection and recovery.
  • Mirroring for business continuity includes wide area replication support.


  • Significant speed and efficiency improvements result in faster execution with half the hardware required by other distributions.

"Today marks an exciting milestone as we announce our partnership with EMC and unveil the industry's best distribution for Apache Hadoop that will advance and grow the entire market," said John Schroeder, CEO and Co-Founder, MapR Technologies. "We listened to customers, partners and the community about where Hadoop needed major investment and addressed those areas by delivering breakthrough innovations."

About the Data Computing Division of EMC
EMC's Data Computing Division is driving the future of data warehousing and analytics with breakthrough products including the EMC Greenplum Data Computing Appliance, EMC Greenplum Database, EMC Greenplum Community Edition, EMC Greenplum HD – Enterprise ready Apache Hadoop, and EMC Greenplum Chorus™-the industry's first Enterprise Data Cloud platform. The division's products embody the power of open systems, cloud computing, virtualization and social collaboration-enabling global organizations to gain greater insight and value from their data than ever before possible.

MapR is a trademark of MapR Technologies, Inc. EMC and Greenplum are trademarks or registered trademarks of EMC Corporation in the U.S. and other countries. Apache Hadoop and Hadoop is a trademark of the Apache Software Foundation. All other trademarks are the property of their respective owners.


About MapR Technologies

MapR Technologies is a visionary Silicon Valley software company and creator of the next-generation data platform for AI and analytics, with the scale and reliability required by enterprise-grade, mission-critical deployments. The MapR Data Platform delivers the power of dataware to accelerate data-driven innovation. Forward leaning companies such as Cisco, Philips, and Société Générale, are able to create new data-driven solutions to outperform the competition. Learn more:

MapR is a registered trademark of MapR Technologies, Inc. in the United States and other countries. Other names and brands may be the property of others.

Media Contacts

Beth Winkowski
MapR Technologies, Inc.
(978) 649-7189

Kim Pegnato
MapR Technologies, Inc.
(781) 620-0016