Cask Data Application Platform (CDAP), the first unified integration platform for big data, lets developers, architects and citizen integrators focus on applications and insights rather than infrastructure and integration. CDAP accelerates time to value from big data through standardized APIs, configurable templates and visual interfaces.
CDAP provides a container architecture for your data and applications on Hadoop. High-level abstractions and deep integrations with diverse Hadoop technologies dramatically increase productivity and quality in order to accelerate development and reduce time-to-production to get your Hadoop projects to market faster.
CDAP Datasets provide a standardized, logical container and runtime framework for data in varied storage engines. They integrate with other systems for instant data access and allow the creation of complex, reusable data patterns.
CDAP Programs provide a standardized, logical container and runtime framework to compute in varied processing engines. They simplify testing and operations with standard lifecycle and operational and can consistently interact with any data container.
CDAP Applications provide a standardized packaging system and runtime framework for Datasets and Programs. They manage the lifecycle of data and apps and simplify the painful integration and operation processes in heterogeneous infrastructure.
Data Pipelines: CDAP provides a data ingestion service that simplifies and automates the difficult and time consuming task of building, running, and managing data pipelines.
Data Preparation: CDAP provides an easy and interactive way to visualize, transform, and cleanse data. It helps to derive new schemas and operationalize the data preparation with a few clicks.
App Development: As an integrated application development framework, CDAP provides standardization and deep integration with diverse big data technologies with easy-to-use APIs to build, deploy and manage complex data analytics applications in the cloud or on-premises.
Metadata & Lineage: CDAP automatically captures technical, business and operational metadata and tracks lineage by understanding changing datasets and flow of data. It provides an audit log for easy traceability for data quality and compliance needs.
Security & Operations: CDAP offers sophisticated security, authentication, authorization and encryption and integrates with LDAP, AD, Kerberos, JASPI, Apache Sentry and Apache Ranger. It provides a robust and portable production runtime environment for secure deployment and management of data lakes and data applications on Hadoop and Spark.
MapR Distribution | 4.1 | HDFS MapR Distribution | 4.1 | MapReduce MapR Distribution | 4.1 | HBase MapR Distribution | 4.1 | YARN MapR Distribution | 4.1 | Spark MapR Distribution | 4.1 | Zookeeper MapR Distribution | 4.1 | Hive
Application Version: CDAP 3.1.0
CDAP provides details installation guide at the website:
CDAP provides extensive documentation to show use cases and examples to take advantage of the platform:
As an open source offering, CDAP provides basic support via developer mailing list and user mailing list. CDAP is backed by Cask which provides commercial support when needed. CDAP also provides JIRA to file and track issues and features development.
CDAP dev group: https://groups.google.com/forum/#!forum/cdap-dev
CDAP user group: https://groups.google.com/forum/#!forum/cdap-user
CDA issues updates: https://groups.google.com/forum/#!forum/cdap-issues
CDAP Issue tracker: https://issues.cask.co/browse/CDAP