Google Compute Engine and MapR Technologies Crush MinuteSort Record
February 26, 2013
New TB MinuteSort Benchmark demonstrates MapR's scalability and speed
MapR Technologies, Inc., the Hadoop technology leader, announced at the O'Reilly Strata Conference a new world record for MinuteSort, sorting 15 billion 100-byte records (a total of 1.5 trillion bytes) in 60 seconds using Google Compute Engine and the MapR Distribution for Apache Hadoop. The benchmark, often referred to as the World Cup of data sorting, demonstrates how quickly data can be sorted starting and ending on disks.
MapR's record-setting benchmark was completed on 2,103 virtual instances in the Google Compute Engine. Each virtual instance consisted of four virtual cores and one virtual disk, for a total of 8,412 virtual cores and 2,103 virtual disks. The new record surpassed the previous record of 1.4TB. The new record is also almost three times the amount of data processed by the previous Hadoop MinuteSort record which was 578 GB.
"The record is significant because it represents a total efficiency improvement executed in a cloud environment," said Jack Norris, VP of marketing, MapR Technologies. "In an era where information is increasing by tremendous leaps, being able to quickly scale to meet data growth with high performance makes business analytics a reality in situations previously impossible."
While the previous MinuteSort record was achieved with custom hardware, MapR set the record using commercially available Google Compute Engine, Hadoop MapReduce and the MapR Distribution. Google Compute Engine is currently in Limited Preview but will soon be available so any business or developer can benefit from the scale, performance and value of Google's infrastructure.
For more detail on MapR's use of Google Compute Engine and Hadoop MapReduce to set the MinuteSort record, visit https://mapr.com/blog/hadoop-minutesort-record.
About MapR Technologies
MapR Technologies is a visionary Silicon Valley software company and creator of the next-generation data platform for AI and analytics, with the scale and reliability required by enterprise-grade, mission-critical deployments. The MapR Data Platform delivers the power of dataware to accelerate data-driven innovation. Forward leaning companies such as Cisco, Philips, and Société Générale, are able to create new data-driven solutions to outperform the competition. Learn more: mapr.com.
MapR is a registered trademark of MapR Technologies, Inc. in the United States and other countries. Other names and brands may be the property of others.