Jump-Start Your Data Exploration Project

Jump-Start Your Data Exploration Project

Market Landscape

New analytical insights are the fuel for future business growth and ideation of new business opportunities. Whether you’re a line of business manager, IT man- ager or analytics leader, self-service analytics is a key business initiative. The data driving these new analytics is increasingly semi-structured in nature, such as JSON and different types of files, typically stored in Hadoop and NoSQL systems.

The MapR Data Exploration Quick Start Solution enables business analysts and data analysts to conduct analyses on larger and more diverse sets of data faster and formulate new hypothesis quicker than before. IT organizations can fulfill their promise of delivering analyses to business users faster and more efficiently.

Solution Highlights

The Data Exploration Quick Start Solution provides the following critical capabilities that organizations require:

  • Ingestion of the various data sources to be brought into the cluster with specific workflows
  • Identification and implementation of business logic to process and aggregate data that has been ingested
  • Querying of data via ad-hoc queries and visualizations with a BI tool of choice

The Data Exploration Quick Start Solution includes a solution template built on the MapR Distribution including ApacheTM Hadoop® and leverages Apache Drill to enable organizations to realize faster time-to-value with their Data Exploration projects.

Data Exploration Quick Start Solution–What’s Included

Software, Professional Services and Certification are all included.

The Data Exploration Quick Start Solution includes a combination of software, professional services and training.

Software: Six nodes of any edition of the MapR Distribution including Apache Hadoop. Support for one year is included, which includes support for Apache Drill and Apache Spark.

Quick Start Services. The primary goal of the Quick Start services engagement is to jump-start a data exploration solution through the use of pre-built tem- plates. The services component of the Data Exploration Quick Start Solution is a five-week engagement that comes with the following deliverables:

  1. Installation and configuration of the MapR cluster
  2. Access and use of the solution template
  3. Knowledge transfer on customizing the solution template
  4. Deployment architecture document that enables a production rollout plan

Hadoop Certification. The Data Exploration Quick Start Solution includes Hadoop certification exam for three professionals. After completing requisite Hadoop On-Demand Training, you can become a certified Hadoop professional and put your new skills into action right away. The certification exams enable you to become a certified Hadoop professional and establish yourself as an accredited big data specialist within your organization.

The certification exams offered are for becoming a:

  1. MapR Certified Hadoop Administrator (MCHA)
  2. MapR Certified Hadoop Developer (MCHD)
  3. MapR Certified HBase Developer (MCHBD)

Business Benefits

Rapid time-to-value
Business and data analysts can query any dataset without waiting for data modeling and schema development.

Efficiency and governance
IT can avoid unnecessary ETL cycles and schema maintenance, and still ensure governance.

Leverage existing investments
Organizations can use their existing SQL talent base and profit from their current investments in BI and visualization tools.

Solution Highlights

Software, services and training delivered in a 5-week engagement enables self-service data exploration on semi-structured data.

Covers the complete data pipeline from data ingestion to processing, aggregation, and querying.

Apache Drill Benefits

Schema Discovery on the Fly
Apache Drill is unique in its ability to discover schemas on the fly. It can utilize a centralized repository such as the Hive metastore or self- describing data.

ANSI-SQL Support
Drill offers out-of-the-box connectivity with all ANSI SQL- compliant query builder and visualization tools such as Tableau, Qlik, Tibco Spotfire and others using standard ODBC and JDBC drivers.

Apache Drill can scale up to terabytes or petabytes of data, 1000’s of users, and 1000’s of nodes.

Apache Drill supports various authentication mechanisms, row/ column level controls and has a decentralized security model.

About MapR

MapR provides the industry’s only big data platform that combines the process- ing power of the top-ranked Hadoop with web-scale enterprise storage and real-time database capabilities, enabling customers to harness the enormous power of their data. Organizations with the most demanding production needs, including sub-second response for fraud prevention, secure and highly available data-driven insights for better healthcare, petabyte analysis for threat detection, and integrated operational and analytic processing for improved customer expe- riences, run on MapR. A majority of customers achieve payback in fewer than 12 months and realize greater than 5X ROI. MapR ensures customer success through world-class professional services and with free on-demand training that 40,000 developers, data analysts and administrators have used to close the big data skills gap. Amazon, Cisco, Google, HP, SAP, and Teradata are part of the worldwide MapR partner ecosystem. Investors include Google Capital, Lightspeed Venture Partners, Mayfield Fund, NEA, Qualcomm Ventures and Redpoint Ventures. Connect with MapR on Facebook, LinkedIn, and Twitter.