Contributor: Dong Meng

MapR Converge Blog author, Dong Meng

As a Data Scientist for MapR, Dong helps customers solve their business problems by leveraging his years of experience in statistical machine learning, data mining, and big data product development.

Blog Posts by Dong Meng

July 27, 2017 | By Dong Meng

Deploy Distributed Deep Learning QSS on MapR GPU Cluster, Part 2

MapR volume as persistent storage and running distributed Tensorflow with GPU Editor's Note: This is the fifth installment in our blog series about deep learning. In this series, we will discuss the deep learning technology, available frameworks/tools...

Read more
July 27, 2017 | By Dong Meng

Deploy Distributed Deep Learning QSS on MapR GPU Cluster, Part 1

A Step-By-Step Guide with Kubernetes 1.7 and MapR 5.2.1 Editor's Note: This is the fifth installment in our blog series about deep learning. In this series, we will discuss the deep learning technology, available frameworks/tools, and how to scale...

Read more
May 23, 2017 | By Dong Meng

Distributed Deep Learning on the MapR Data Platform

This is the third installment in our blog series about deep learning. In this series, we will discuss the deep learning technology, available frameworks/tools, and how to scale deep learning using big data architecture. Read Part 1 and Part 2. Introduction...

Read more
May 02, 2017 | By Dong Meng

Scaling Time Series Analysis on the MapR Data Platform

Introduction A time series is a collection of observations (x~t~), where x is the event recorded at time t. Common motivations for time series analysis include forecasting, clustering, classification, point estimation, and detection (in signal process...

Read more
August 30, 2016 | By Dong Meng

How to Integrate Apache PredictionIO with MapR for Actionable Machine Learning

Introduction PredictionIO is an open source machine learning server, and is a recent addition to the Apache family. PredictionIO allows you to: Quickly build and deploy an engine as a web service in production with customizable templates Respond to dynamic...

Read more
August 04, 2016 | By Dong Meng

How to Speed Up Ad-hoc Analytics with SparkSQL, Parquet, and Alluxio

How to Speed Up Ad-hoc Analytics with SparkSQL, Parquet, and Alluxio In the big data enterprise ecosystem, there are always new choices when it comes to analytics and data science. Apache incubates so many projects that people are always confused as to...

Read more
January 07, 2016 | By Dong Meng

How to Set Up Distributed XGBoost on MapR XD (Formerly MapR-FS)

XGBoost is a library that is designed for boosted (tree) algorithms. It has become a popular machine learning framework among data science practitioners, especially on Kaggle, which is a platform for data prediction competitions where researchers post...

Read more
Categories

50,000+ of the smartest have already joined!

Stay ahead of the bleeding edge...get the best of Big Data in your inbox.


Get our latest posts in your inbox

Subscribe Now