Machine Learning with Apache Spark

Spark’s machine learning (ML) library goal is to make practical machine learning scalable and easy. Decision trees are widely used for the machine learning tasks of classification and regression.

In this Free Code Friday post, I’ll give an overview of machine learning with Apache Spark’s MLlib, and I'll show you how to use decision trees to predict flight delays.

We'll go over:

  • An overview of machine learning with Apache Spark MLlib
  • A machine learning classification workflow
  • Predicting flight delays with Apache Spark MLlib decision trees

Related Blog Post: Apache Spark Machine Learning Tutorial

Carol McDonald

Carol has extensive experience as a developer and architect building complex mission critical applications in the Banking, Health insurance and Telecom Industries. As a Java Technology Evangelist at Sun Microsystems, Carol traveled all over the world speaking at Sun Tech Days, JUGs, Companies, and Conferences. She is a recognized speaker in Java communities.

Want to join future Free Code Fridays? Check out the lineup here.

Additional Resources