6 min read
What is big data? There are various definitions, nearly all of which focus appropriately on the concept of “big data”, and not on the data itself, whose volume is undeniably quite BIG and thus not particularly informative as a defining characteristic! Most definitions of the big data concept, therefore, revolve around either: (a) the 3 V’s that characterize it (Volume, Velocity, and Variety); or (b) the staunch belief that big data simply refers to data that’s not the same as the data we previously collected. I have a better definition, which defines what big data really means to the world today. I will explain what that is, after examining the two choices above.
Big Data is Big
Using definition (a) above, which simply lists characteristics of big data (in a very restrictive manner by the way), we have violated the first rule of definitions that we all learned in grade school: defining “how something is different” is not the same as defining “what something is.” Example: What is a guépard? Answer: A guépard is the world’s fastest land mammal. But… what is it? Note that I have also contributed to the “3 V’s” mnemonic characterization of big data by introducing my own Top 10 list of the 10 V’s that characterize the main big data challenges – but, again, these are characteristics, not a definition.
Big Data is Unlike Previous Data
Using definition (b) from the opening paragraph, which is also restrictive, we end up again with another relative description (in this case, a negative comparison) – this is not an actual description or definition. Example: What is a wolverine? A wolverine is not a wolf. So… what is it?
A common extension to definition (b) states that big data refers to data that’s so big, so complex, and moving at such a high rate that it exceeds our existing resources for data acquisition, storage, processing, analysis, and interpretation. This is good, but again it is a comparative definition (relative to something else), not an actual definition. In fact, using this definition, one could easily argue that even the ancient Romans had big data! As a consequence of this mindset, there are many folks, especially in their online resumes, who conveniently claim to have done big data for decades! But I say: “Today’s Big Data is Not Yesterday’s Big Data!”
Big Data is Your Ticket to Data-Driven Decisions and Discovery
My current, best definition of big data, and the one that I prefer (not entirely because I created it, but mostly because I truly believe in it) is this: big data is everything, quantified and tracked_._ Let’s pick that apart:
All of these quantified and tracked data streams will enable smarter decisions, better products, deeper insights, greater knowledge, optimal solutions, customer-centric products, increased customer loyalty, more automated processes, more accurate predictive and prescriptive analytics, and better models of future behaviors and outcomes in business, government, security, science, healthcare, education, and more.
So, don’t be left out of the big data revolution because the terminology seems vague or daunting. Focus on your business goals, what you are trying to achieve, and big data’s three D2D’s (Data-to-Decisions, Data-to-Discovery, and Data-to-Dollars). You will then arrive at big data’s biggest meaning: big value and big ROI = Return on Innovation!
In conclusion, I was very impressed recently with the record growth of MapR. Score one for the new definition of big data: everything, quantified and tracked!
Stay ahead of the bleeding edge...get the best of Big Data in your inbox.