7 min read
In this blog series, we’re showcasing the top 10 reasons customers are turning to MapR in order to create new insights and optimize their data-driven strategies. Here’s reason #4: MapR provides true multi-tenancy with job isolation, volumes, quotas, data and job placement control, including for YARN.
Multi-tenancy is the ability of a single instance of software to serve multiple tenants. A tenant is a group of users that have the same view of the system. Hadoop, as an enterprise data hub, naturally demands multi-tenancy. Creating different instances of Hadoop for various users or functions is not acceptable as it makes it harder to share data across departments and creates silos.
From an administrator’s perspective, multi-tenancy requirements are to
The MapR multi-tenant architecture provides a way for you to address these requirements using industry-leading capabilities.
Per tenant policy controls
Typical use cases include volumes for specific users, projects, departments, development, and production environments. For example, if you need to organize data for a special project, you can create a specific volume for the project. The figure below shows two lines of businesses (retail and trading), each having their own volumes. Additionally each retail and trading user could have their own volume as well.You can mount volumes under other volumes to build a structure that reflects the needs of your organization. The volume structure defines how data is distributed across the nodes.
Volumes are great at providing policy management at a logical level. Volumes can be used to:
Establish ownership and accountability. Following specific permissions can be granted to other users or groups.
Enforce Quotas. You can associate a standard volume with an accountable entity and set quotas. Quotas can be advisory and enforced.
ExpressLane feature allows for small jobs to get ahead if cluster is extremely busy.
In summary, providing multi-tenancy on Hadoop cannot be bolted on. It has to be built into the foundation. These capabilities allow you to run your multi-tenant, multi-service cluster to create a shared services infrastructure that can be a foundation of your competitive advantage.
Be sure to check out the complete top 10 list here.
Stay ahead of the bleeding edge...get the best of Big Data in your inbox.