After installing the MapR core and any desired Hadoop components, you might need to perform additional steps to ready the cluster for production. Review the topics below for next steps that might apply to your cluster.
- Setting up the MapR Metrics Database
- Setting up Topology
- Setting Up Volumes
- Setting Up Central Configuration
- Designating Subnets for MapR
- Setting up MapR NFS
- Configuring Authentication
- Configuring Permissions
- Setting Usage Quotas
- Configuring alarm notifications
- Setting up a Client to Access the Cluster
- Working with Mirror Volumes
Setting up the MapR Metrics Database
In order to use MapR Metrics you have to set up a MySQL database where metrics data will be logged. For details see Setting up the MapR Metrics Database.
Setting up Topology
Your node topology describes the locations of nodes and racks in a cluster. The MapR software uses node topology to determine the location of replicated copies of data. Optimally defined cluster topology results in data being replicated to separate racks, providing continued data availability in the event of rack or node failure. For details see Setting Up Node Topology.
Setting Up Volumes
A well-structured volume hierarchy is an essential aspect of your cluster's performance. As your cluster grows, keeping your volume hierarchy efficient maximizes your data's availability. Without a volume structure in place, your cluster's performance will be negatively affected. For details see Managing Data with Volumes.
Setting Up Central Configuration
MapR services can be configured globally across the cluster, from master configuration files stored in a MapR-FS, eliminating the need to edit configuration files on all nodes individually. For details see Central Configuration.
Designating Subnets for MapR
If multiple NICs are present on nodes, you can configure MapR to use one or more of them, depending on the cluster's need for bandwidth. For details on configuring NICs, see Designating Subnets for MapR. Review Planning the Cluster for details on provisioning NICs according to data workload.
Setting up MapR NFS
The MapR NFS service lets you access data on a licensed MapR cluster via the NFS protocol. You can mount the MapR cluster via NFS and use standard shell scripting to read and write live data in the cluster. NFS access to cluster data can be faster than accessing the same data with the
hadoop fs commands. For details, see Setting Up MapR NFS. You also might also be interested in Setting Up VIPs for NFS.
If you use Kerberos, LDAP, or another authentication scheme, make sure PAM is configured correctly to give MapR access. See PAM Configuration.
By default, users are able to log on to the MapR Control System, but do not have permission to perform any actions. You can grant specific permissions to individual users and groups. See Setting User Permissions.
Setting Usage Quotas
You can set specific quotas for individual users and groups. See Setting Quotas for Users and Groups.
Configuring alarm notifications
If an alarm is raised on the cluster, MapR sends an email notification. For example, if a volume goes over its allotted quota, MapR raises an alarm and sends email to the volume creator. To configure notification settings, see Checking Alarms.
To configure email settings see Setting Up Alarm Notifications.
Setting up a Client to Access the Cluster
You can access the cluster either by logging into a node on the cluster, or by installing MapR client software on a machine with access to the cluster's network. For details see Setting Up the Client.
Working with Mirror Volumes
If you need to access multiple clusters or mirror data between clusters, see Working with Mirror Volumes.