5 min read
We are proud to announce a validated reference architecture for the MapR Data Platform on Oracle Cloud Infrastructure (OCI). You can now deploy the MapR Data Platform on Oracle's high-performance cloud with full MapR support.
The MapR and Oracle partnership enables customers to benefit from a highly integrated data platform for big data and machine learning applications. Oracle and MapR share a common vision for delivering data insights across the enterprise, and both are committed to developing and delivering a best-in-class platform.
MapR offers a unified data platform that simultaneously runs analytics and applications with speed, scale, and reliability. It converges all data into a data fabric that can store, manage, process, apply, and analyze as the data happens.
The MapR Data Platform supports Hadoop, Spark, and Apache Drill with real-time database capabilities, global event streaming, and scalable enterprise storage to power a new generation of big data applications. It enables writing against open APIs across MapR and Oracle Cloud Infrastructure through JSON (OJAI), HBase, S3, HDFS, NFS, REST, and Kafka.
Oracle offers the most powerful bare metal compute instances with local flash storage in the industry. Only Oracle offers this local storage, based on advanced NVMe SSD technology and backed by a storage performance SLA.
Unlike other cloud infrastructure providers that oversubscribe networking, Oracle delivers low latency and high throughput via a non-oversubscribed 25-gigabit network infrastructure, which is a key requirement for high-performance, distributed, streaming workloads. Oracle Cloud Infrastructure is the only cloud with a network throughput performance SLA.
MapR clusters that are spun up in the cloud can sit right next to Exadata or Oracle Database environments over private networks, allowing easy data sharing for analytics. Gartner regards Oracle as one of the top three vendors in the Data Management Storage Analytics space, making MapR on Oracle Cloud Infrastructure a great choice for running analytics workloads.
Cloud infrastructure enables you to deploy the optimal amount of infrastructure to meet your demands. No more underutilization of too much infrastructure or long queues caused by under-forecasting. In addition, Oracle offers:
You can easily deploy the MapR Data Platform on Oracle Cloud Infrastructure by using Terraform automation.
The recommended network architecture for MapR deployment on Oracle Cloud Infrastructure consists of a virtual cloud network (VCN), containing three separate subnets that are duplicated across all the availability domains in a target region. This configuration gives you the ability to deploy a MapR cluster in any availability domain in the region and have the same topology and security lists associated with each network.
This network model is illustrated in the following diagram, with host associations at the subnet level, showing a single cluster running in a single availability domain.
The Terraform module for deploying MapR on Oracle Cloud Infrastructure is available on the Oracle Cloud Infrastructure Cloud Partners GitHub. Provisioning a fully ready cluster typically takes about 45 minutes, requiring minimal user interaction after setting a few configuration values in the Terraform template. Detailed steps for deploying MapR on Oracle Cloud Infrastructure are located in the readme file available in the GitHub repository.
If you don’t have an Oracle Cloud Infrastructure account yet, you can sign up for a 30-day free trial account.
Stay ahead of the bleeding edge...get the best of Big Data in your inbox.