Apache Drill

Apache Drill Enables Self-Service Data Exploration on All Data with a Schema-Free SQL Query Engine

Apache Drill is an open source distributed SQL query engine integrated into the MapR Converged Data Platform for delivering fast and secure self-service BI SQL analytics at scale. Drill’s distributed shared-nothing architecture enables incremental scale-out with low-cost hardware to meet increasing demands of query response and user concurrency.

drill-on-mcdp.png

ANSI SQL compliance, SQL user interface tools, and MapR supported integrations with popular BI tools, such as Tableau, MicroStrategy, and Qlik, allow architecture modernization without disrupting current BI analyst workflows. With the ability to discover schemas on the fly, Drill is a pioneer in in-place analytics on historical data stored in popular file formats such as Parquet, JSON, CSV, and TSV in MapR-XD, alongside operational data stored in MapR-DB.

Top 10 Reasons to Choose Apache Drill on MapR

Analytics

High Performance SQL Queries with Scale-Out Architecture

Apache Drill can support thousands of users across thousands of nodes running queries on data that is in the terabyte and petabyte range.

query

Schemaless Query Execution for Data Exploration

Apache Drill can discover schemas on the fly and enable immediate exploration of data stored in MapR across a variety of data formats and sources.

analytics

In-Place Analytics across Historical and Operational Data

No need to move data between operational and analytics clusters.

bi-tools

Connectivity to Popular BI Tools through QDBC and JDBC interfaces

Connect with popular BI tools, such as Tableau, MicroStrategy, Qlik, and many more.

ansi-sql

ANSI SQL Compliance

All the SQL analytics functionality you would expect, such as aggregates, filters, sorting, sub-queries (scalar and correlated), create table/ view as, etc., is available out of the box.

integration

Integration with MapR-DB Secondary Indexes for Operational Analytics

Up to 10X query performance improvement due to native integration with MapR-DB, including secondary indexes.

hive

Integration with Hive for Interactive Queries on Existing Hive Tables

Continue querying existing Hive tables. No disruptions to existing BI workflows.

cluster-monitoring

Cluster Health and Resource Monitoring through MapR Control System

MCS gives you a single pane of glass for cluster metrics, alarms, and service logs as well as a curated user experience with streamlined workflows for common user actions.

integration-ds

Integration into MapR Data Science Refinery for Augmenting Data Science Workflows

Direct integration into the MapR Data Science Refinery enables self-service data exploration for data scientists, leading to better models.

Analytics

End-To-End Security for Data Accessed, Processed, and Analyzed

Versatile authentication mechanisms–PAM, Kerberos, and MapR Security. State-of-the-art encryption to protect sensitive data with SSL and AES 256 GCM support.

How Your Business Benefits from Apache Drill on MapR

self-service-analytics

Self-Service Analytics for All Users Across Multiple Data Sources

Empower your employees to make decisions with access to business and market insights.

sql-bi-analytics

In-Place SQL BI Aalytics and Data Exploration

Analyzing data in-place instead of moving across various clusters saves valuable time and enables faster decisions and actions.

lower-tco

Commodity Hardware for Scale-Out across On-Prem and Cloud

Scale-out architecture on commodity hardware and less manual administration leads to lower TCO.

stay-secure

Self-Service Analytics for All Users Across Multiple Data Sources

Empower your employees to make decisions with access to business and market insights.

data-exploration

Data Exploration and SQL Access for Data Scientists

Direct integration into the MapR Data Science Refinery enables self-service data exploration and discovery, making data scientists more productive.

always-on-insights

High Availability Ensures Business Intelligence Continuity

The MapR Platform provides high availability and disaster recovery out of the box, making sure your business continues to benefit from timely insights.

Raw Data Exploration:

As a business analyst or SQL specialist, you can instantly query data in any format or source the moment it lands in MapR and be more agile, without waiting for weeks or months for your IT team to develop the required schema. Goodgame Studios utilizes Apache Drill for data exploration on new data that is being continuously generated with rapidly evolving schemas.

Self-Service Interactive BI for Reporting and Ad-Hoc Querying:

As an architect, you can enable business analysts to develop ad-hoc queries and reports on data being moved from a data warehouse as well as new data being stored in a data lake or data hub. This self-service interactive BI be accomplished at a fraction of the cost of traditional systems. Harte Hanks utilizes Apache Drill on the MapR Platform to gain immediate insights from new digital sources of data, including survey data and social media.

Learn more

Community

Learn and contribute to the community
Learn more

On-Demand Training

Apache Drill Training
Learn more

All Training Information
Learn more

Whitepaper

Delivering Fastest Time-to-Value for SQL-on-Hadoop
Download

Blog

More about Apache Drill
Read

Tutorial

Get started with Drill on the MapR Sandbox.
Read

Technical

Drill Documentation.
Read

Video

Apache Drill with Tableau - Demo
Watch