MapR 4.0.x Documentation : Sqoop

Sqoop transfers data between MapR-FS and relational databases. You can use Sqoop to transfer data from a relational database management system (RDBMS) such as MySQL or Oracle into MapR-FS and use MapReduce on the transferred data. Sqoop can export this transformed data back into an RDBMS. For more information about Sqoop, see the Apache Sqoop Documentation.

Hadoop MRv1 and MRv2 Builds of Sqoop 1.4.5

The MapR Maven repository contains two sets of jar files for Sqoop 1.4.5:

You can download and use the build that is appropriate for your application.

Installing Sqoop

The following procedures use the operating system package managers to download and install from the MapR Repository. If you want to install this component manually from packages files, see Packages and Dependencies for MapR Software.

To install Sqoop on an Ubuntu cluster:

  1. Execute the following commands as root or using sudo.
  2. Perform this procedure on a MapR cluster. If you have not installed MapR, see the Advanced Installation Topics.
  3. Update the list of available packages:

    apt-get update
  4. On each planned Sqoop node, install mapr-sqoop:

    apt-get install mapr-sqoop

To install Sqoop on a Red Hat or CentOS cluster:

  1. Execute the following commands as root or using sudo.
  2. Perform this procedure on a MapR cluster. If you have not installed MapR, see the Advanced Installation Topics.
  3. On each planned Sqoop node, install mapr-sqoop:

    yum install mapr-sqoop

Using Sqoop

For information about configuring and using Sqoop, see the following documents: