MapR Data Science Refinery is the scalable data science notebook offering with native platform access, superior out-of-the-box security, and automatic model storage with mirroring and sharing. With the MapR Data Science Refinery, our vision is to provide you with a suite of tools to enable you to distill insights from your data and turn them into operational next-gen applications that lead to actionable changes for your business.
Apache Drill is an open source, low-latency query engine for big data that delivers secure and interactive SQL analytics at petabyte scale. With the ability to discover schemas on-the-fly, Drill is a pioneer in delivering self-service data exploration capabilities on data stored in multiple formats in files or NoSQL databases. Drill is fully ANSI SQL compliant and integrates seamlessly with visualization tools.
Apache Spark is a general-purpose engine for large-scale data processing. It supports rapid application development for big data and allows for code reuse across batch, interactive, and streaming applications. The most popular use cases for Apache Spark include building data pipelines and developing machine learning models. The MapR Converged Data Platform is the choice for production Spark applications.
Hadoop is built to process large amounts of data from terabytes to petabytes and beyond. It delivers greater business impact when used as part of the MapR Converged Data Platform. The MapR Platform combines operational and analytical workloads that drive business insights in real time for complex integrations between disparate data silos.