Install Sqoop2

Sqoop2 includes two packages:
  • The client package, mapr-sqoop2-client (install on each node that will act as a client)
  • The server package, mapr-sqoop2-server (install on at least one node in the cluster)
The Sqoop2 server also acts as a MapReduce client, so each node where you install the mapr-sqoop2-server package must also have Hadoop installed and configured. If you have a lot of client users, you can install multiple servers on multiple nodes, so as not to overload a single server.
Warning: Sqoop2 packages cannot be installed on the same nodes as Sqoop1 packages. However, you can use both versions in the same Hadoop cluster by installing Sqoop1 and Sqoop2 on different nodes.
Execute the following commands as root or using sudo:
  1. Update the list of available packages:
    Ubuntu
    apt-get update
    RedHat and CentOS
    yum clean all
  2. On each Sqoop2 server node, install mapr-sqoop2-server:
    On Ubuntu
    apt-get install mapr-sqoop2-server
    On RedHat and CentOS
    yum install mapr-sqoop2-server
  3. On each Sqoop2 client node, install mapr-sqoop2-client:
    On Ubuntu
    apt-get install mapr-sqoop2-client
    On RedHat and CentOS
    yum install mapr-sqoop2-client
  4. If you installed Sqoop2 1.99.3 (version prior to the 1507 release) and the cluster runs in YARN mode (MRv2), update the path to the Hadoop configuration directory in the sqoop.properties file: The correct path for YARN is as follows. By default the sqoop.properties file contains the path for MRv1.
    Note: As of Sqoop2 1.99.6-1507, Sqoop2 automatically sets the Hadoop configuration directory based on the Hadoop version that is used by the cluster.
    /opt/mapr/sqoop/sqoop-2.0.0/server/conf/sqoop.properties
    # Hadoop configuration directory
    org.apache.sqoop.submission.engine.mapreduce.configuration.directory=/opt/mapr/hadoop/hadoop-2.x.x/etc/hadoop/
  5. Start each Sqoop server node.
    maprcli node services -name sqoop2 -action start -nodes <space delimited list of nodes>