Partner App: Waterline Data

Waterline Data Catalog

Waterline Data Catalog builds a complete inventory of your data assets in Hadoop, automatically and securely, and lets enterprise users find, understand, and help govern Hadoop data. As a result, Data Engineers and Data Scientists no longer need to struggle to get the right data for deep analytics; Business Analysts can leverage Hadoop data for ad-hoc reporting; Data Governance Teams can find sensitive data easily and ensure that Hadoop data is in compliance; and Big Data IT can deploy an economical and scalable data self- service platform that allows IT to stay ahead of the fast growing needs of the business.

Application Description

Waterline Data Catalog builds a complete inventory of your data assets in Hadoop, automatically and securely, and lets enterprise users find, understand, and help govern Hadoop data. As a result, Data Engineers and Data Scientists no longer need to struggle to get the right data for deep analytics; Business Analysts can leverage Hadoop data for ad-hoc reporting; Data Governance Teams can find sensitive data easily and ensure that Hadoop data is in compliance; and Big Data IT can deploy an economical and scalable data self-service platform that allows IT to stay ahead of the fast growing needs of the business.

Waterline Data Catalog resides on an edge node of a Hadoop network, and automatically crawls and profiles all Hadoop data, automatically and securely. The profiling process parses files to compute or infer detailed properties, including field-level data quality metrics, data distribution, and tags. It then leverages the results of profiling to build a complete inventory of the Hadoop data assets, finding lineage, temporary files, and sensitive data. It also aids data scientists’ efforts by propagating their tags for files and fields as suggestions for those that have the same characteristics.


Component Version Connection Method
MapR 4.0+ HDFS API, MapReduce, Hive
Application Version: V1.1

Download App

Installation instructions

Installation Instructions are here


Use Instructions

Tutorials:

  1. How to Hadoop Effortlessly with Waterline Data Inventory
  2. Tag, Annotate, and Propagate Tags
  3. Search and Browse
  4. File Lineage
  5. User Guide

http://www.waterlinedata.com/solutions

Support Information

Waterline Data Inventory provides three support levels:

  1. Community: Online portal with a product knowledge base, forum, and issue submission.
  2. Professional: Community access plus phone and email support during business hours 9 to 6 Pacific time
  3. Enterprise: Community access plus phone and email support 24 x 7

For support level details, see Support Overview. To access the community portal, go to http://support.waterlinedata.com and register.


suzie@waterlinedata.com