6 min read
“Big Data” is no longer a buzzword. Businesses big and small that don’t invest now in big data technologies risk getting left behind as the marketplace becomes more and more data-driven. In fact, a recent McKinsey and Company report suggested that companies that invest in big data and analytics consistently outperform their peers in both productivity and revenue.
The advent of the Internet of Things means there is going to be more data collected and transmitted than ever before. Even small local businesses will need to invest in ways to collect, store, and process data in order to understand more about their customers and their products.
But where to start? Investing in big data infrastructure doesn’t have to be an overwhelming proposition. Here are a few things that you need to know when making the leap from antiquated databases of old to the modern big data platform.
Start small (and free). Hadoop is the open-source software framework of choice for many in the big data game. It’s built to scale, and can run on single servers to thousands of machines, and it is designed to handle failures at the application level rather than at the hardware level. Though it is an open source technology spearheaded by Apache, enterprise deployments built for business can make it easier to get off the ground.
When I was building out the framework for one of the first real-time bidding engines in digital advertising, I took advantage of the free version of MapR, and successfully used it until we grew out of it and needed to invest in a larger deployment to support our exponentially growing business. It was the perfect low-risk way to try out a platform without needing to spend hundreds of thousands of dollars. I learned how to use and manage the service, so that when we invested in the higher level enterprise platform, there were no surprises.
Get familiar with the ecosystem. The big data world can be a daunting place for those new to the game, but the architecture around your platform can make a big difference in how effective and efficient your business can be. On top of a big data distribution (MapR, Cloudera, and Hortonworks being some of the most used in the space), Hadoop is integrated with a number of tools to make it easier to manage, understand, and use data. This includes Apache Drill for querying large data sets, Apache Hive for ad-hoc queries and data summarization, and Cascading as a Java framework for building machine-learning and other data processing applications.
To manage this suite of tools most effectively, look for a provider who packages these different solutions and makes them easy to experiment with and test. Ask about any training materials or support available to make the most of these types of tools.
At Altitude Digital, we often experiment with newer tools like Spark (a new data processing engine) on a small set of data, and when it proves to be valuable, we’ll expand it to our platform more broadly.
Recognize the stakeholders. As the data side of the business begins to scale, you will have to invest in the platform and infrastructure. Thankfully, hardware has become a commodity and, as a result, is more affordable than ever before. The numerous cloud services too have provided an option for those who don’t need or want to invest in their own hardware. That doesn’t mean this piece of the business still isn’t a significant capital investment.
Understand the various stakeholders and their motivations when selling this investment internally. Convince the CFO that your big data platform will actually increase revenue and provide a high return on investment by providing early results and what you would expect from continued investment. Work with the CEO to provide statistics on how this investment will differentiate your business from competitors and make you stand out in the long run.
Invest in people. All the data in the world won’t help you if you don’t have someone who can understand and interpret it, and then use it to make better business investments. This can actually be one of the toughest parts of creating a big data strategy, as the talent you need is few and far between. In an ideal world, you’d hire someone that knows the market that your business is in and has a big data platforms expertise. That way they can apply market expertise to the problems you need to solve as you scale.
You can also hire someone who’s had experience with big data platforms in the past — but don’t underestimate the importance and necessity for them to learn the ins and outs of your market. This real-world understanding of the problems you are trying to solve will ensure you structure your platform in the ways where it will produce maximum value for your business.
Investing in data is no longer a choice—the right platform and strategies will give a competitive advantage to the companies that use it best.
*This was originally published on Forbes here.
Stay ahead of the bleeding edge...get the best of Big Data in your inbox.