Contributor: James Casaletto

MapR Converge Blog author, James Casaletto

James is a Principal Solutions Architect for MapR, where he develops and deploys big data solutions with Apache Hadoop.

Blog Posts by James Casaletto

February 26, 2015 | By James Casaletto

How to: Using Non-Java Programs or Streaming for MapReduce Jobs

In this blog post, we introduce the concept of using non-Java programs or streaming for MapReduce jobs. MapReduce’s streaming feature allows programmers to use languages other than Java such as Perl or Python to write their MapReduce programs. You can...

Read more
February 25, 2015 | By James Casaletto

How to: Launching MapReduce Jobs

In this post, we look at the different approaches for launching multiple MapReduce jobs, and analyze their benefits and shortfalls. Topics covered include how to implement job control in the driver, how to use chaining, and how to work with Oozie to manage...

Read more
February 17, 2015 | By James Casaletto

How to Use the MapReduce API

Hadoop MapReduce is a framework that simplifies the process of writing big data applications running in parallel on large clusters of commodity hardware. The MapReduce framework consists of a single master ResourceManager, one slave NodeManager per cluster...

Read more
February 10, 2015 | By James Casaletto

Managing, Monitoring, and Testing MapReduce Jobs: Managing Jobs and Tasks

In this post, we will discuss how to use the MapR Control System (MCS) to monitor MRv1 jobs. We will also see how to manage and display jobs, history, and logs using the command line interface. In part 1 of this post, we focused on how to work with built...

Read more
February 05, 2015 | By James Casaletto

Managing, Monitoring, and Testing MapReduce Jobs: How to Work with Counters

In this post, we detail how to work with counters to track MapReduce job progress. We will look at how to work with Hadoop’s built-in counters, as well as custom counters. In part 2, we will discuss how to use the MapR Control System (MCS) to monitor...

Read more
February 03, 2015 | By James Casaletto

How to: Job Execution Framework MapReduce V1 & V2

In this blog post, we compare MapReduce v1 to MapReduce v2 (YARN), and describe the MapReduce Job Execution framework. We also take a detailed look at how jobs are executed and managed in YARN and how YARN differs from MapReduce v1. Note: The material...

Read more
January 29, 2015 | By James Casaletto

How to Write a MapReduce Program

In this blog post we detail how data is transformed as it executes in the MapReduce framework, how to design and implement the Mapper, Reducer, and Driver classes; and execute the MapReduce program. Note: The material from this blog post is from our free...

Read more
December 10, 2014 | By James Casaletto

How to Configure the Network for the MapR Sandbox for Hadoop - #WhiteboardWalkthrough

Editor's Note: In this weeks Whiteboard Walkthrough, James Casaletto walks you through how to configure the network for the MapR Hadoop Sandbox. Whether you use VirtualBox, VMware Fusion, VMware Player, or pretty much any hypervisor on your laptop...

Read more
Categories

50,000+ of the smartest have already joined!

Stay ahead of the bleeding edge...get the best of Big Data in your inbox.


Get our latest posts in your inbox

Subscribe Now