Big v. Small Data – Why Marketers Today Need to Know the Difference

108 Shares 108 Shares ×

More cool facts about social media

This is a guest post by Erin Robbins O’Brien.

What is “big data” and why should it matter to marketers?

I recently posited the question, “Do marketers know what ‘big data’ is?” at a networking event for marketers, and the response I got was unexpected: “Do they really need to?”

I was taken aback, likening the response to questioning whether or not we should include social media strategies in marketing campaigns today.  Marketers today, whether they are aware of it or not, constantly work with data so of course they should know what “big data” is and how it differs from “small data.”

Then I realized that, until recently, there was no WAY for marketers to understand big data. The shift of data availability beyond statisticians and IT managers is a recent development.

The surge in conversation about “big data” is the result of a sequence of events so let’s talk about how “big data” came about.

Up until the invention of the PC, data was a cruel mistress, demanding hand-written calculations, myriad sheets of paper, and a lot of patience to analyze.

The rise of the PC was a critical first step in giving data a home and a way to sort through thousands of items. Users can now access vast amounts of data easily (via cloud computing, computational capabilities, etc.).

As a result of the ability to compute large quantities of information and display the output graphically, the number of channels we use to market our products and services has grown causing an increase in the amount of data we can collect.

Finally, based on availability and technological capability to manipulate and illustrate our data, the business intelligence community has risen along with products to make it easier.

So why does any of this matter to marketers? Data today refers to a set of three characteristics: condition, location, and population.

Condition

This refers to the data’s readiness for use. For instance, if you have a list of email addresses that have been confirmed through a Captcha validation system and have opted in to receiving communication from you, then that data is likely ready to use (often called “clean”).

Clean data is small data. Contrast that with a list of purchased email addresses that must be validated as correct, relevant to your organization, and willing to receive messages from you. This data is not well conditioned and requires time and cost to clean, making it “big data.”

Location

This refers to where the data originates and its compatibility with a usable format.

Using our email example, data that lives in an email distribution client such as Marketo or MailChimp has a single location and is compatible with the format that it needs to be sent from, making it “small data.”

Data that requires merging from multiple sources in a variety of formats or with differing variables is “big data.”

Population

This refers to the individuals that have qualities in common to the need in consideration – in this instance, your email list.

A “small data” set includes a known population that is not expected to have changes to its composition in the short term, which allows marketers to use this data to answer a specific question or need.

For example, when looking to market a product, consider a list of users who recently purchased a similar item from you.

Since they purchased a similar item it is safe to say that emailing them is fine. Conversely, “big data” would represent your large purchased email list with unknowns, possible duplications, and unsubscribes. This list cannot be used for targeted email marketing sends in its current form (at least not by a good marketer).

So why should marketers be concerned with data size? Here are a few reasons:

Bad Data = Bad Marketing

You need to know when someone is giving you bad data. Data can be (and is) manipulated to say what people need it to say.

This can be a powerful tool if you are the one controlling the data, but receiving large data sets or outcomes from data can be daunting if you trust others to provide you with accurate information to do your job.

This is not to say that people always willingly provide incorrect data or outcomes – often times they may not know themselves that it is inaccurate. This leads to number two.

Maslow’s Hammer

“If you have a hammer in hand, you eventually start to see a nail.” This means that if you are looking for something, you naturally increase your odds in finding it.

Data will always have structures or characteristics that naturally group items together. When working with “big data” it’s important to understand that these groupings may not be the result of anything other than chance, and trends should be determined by testing.

For example, an email campaign may appear to be opened more frequently in one geographic location than another. This could cause a decision to market more heavily in that area, when the grouping of email opens could have, in fact, been random.

Aggregation is Power

Having access to the data your organization gathers means you can unearth powerful information about your users, potential customers, and own capabilities.

What you may discover is a ton of data that needs a better way to be analyzed for your needs, or that you need to be capturing more data from particular sources to round out your set. Either way, knowledge is power so get your data and get going!

Beyond the reasons mentioned above, I know that investing in big data is expensive, and analyzing any data has a cost – whether it’s via purchased tools or expended man-hours.

The pros of big data are many, but one of the most important for marketing professionals is big data’s ability to enable you to discover new, relevant patterns that may not be noticed when using a small data sample.

Much of the data collected by marketers does involve an understanding of evolving behavioral patterns, making small data outdated quickly. Investing in big data and a way to manage it can have a great impact on your organization.

The most important thing to remember is that whether your data is big or small, it’s the correct analysis that counts.

Erin Robbins O’BrienAbout the author: Erin Robbins O’Brien is the Director of Business Intelligence at Viralheat. She spends her days sifting through data and making it relevant to product development and marketing. She can be found online as @TexasGirlErin and at erin at viralheat dot com.

Join over 11,000 smart subscribers

Get my latest updates delivered straight to your Inbox as soon as they're published (I respect your privacy and will never spam you)

108 Shares Twitter 61 Facebook 4 Google+ 6 Buffer 10 Buffer LinkedIn 27 Email -- Email to a friend 108 Shares ×
About Danny

Danny Brown is Chief Technologist at ArCompany and an award-winning marketer and blogger. His blog is recognized as the #1 marketing blog in the world by HubSpot. Danny is also co-author of Influence Marketing: How to Create, Manage and Measure Brand Influencers in Social Media Marketing.

15 comments
Sort: Newest | Oldest
John Swain
John Swain

Erin,  

I have to say that I agree with the marketeers; why should they know any more about Big Data than IP addressing schemes or transistor densities? I totally agree with you that they should be aware of the issues you then go on to talk about, but I am not sure it is really about Big Data as opposed to big data or difficult data.

Big Data is usually defined by the 3Vs (Volume, Variety, Velocity) with sometimes others thrown in (Veracity, Value, Virtual...). What is is for sure is Big Data is data that currently exceeds the capability available to deal with it using conventional tools and hardware. For some organisations that might be 100 of millions of rows for others billions.

Unfortunately I think the term Big Data has become so widely used to mean other things it is losing any sense of currency.

Big Data is not new, just the term. This is my favorite history lesson on the subject: http://www.youtube.com/watch?v=eC6nw229CEo&feature=youtu.be

Which is kind of a test of whether marketers should be interested - I can't see any being interested in this!

js

Bhaskar Sarma
Bhaskar Sarma

Erin , this was fascinating. I am by no means any sort of expert on Big Data but one of my earliest clients who needed work done their website worked with stuff like Hadoop. In the course of research, I came to know a fair bit about Big Data, and the challenges associated with it. One of the things that I still remember was about structured and unstructured data and how the majority of big data out there is actually unstructured From the POV of a marketer, making sense of unstructured data like tweets, forum posts or customer chat logs is probably one of the biggest challenges right now. Sure, there are things like sentiment analysis, but in your opinion just how mature these tools and features have become in providing a bird's eye view of the entire picture?

Meagan Dahl
Meagan Dahl

Great, great post and one that I think marketers need to pay close attention to. I have the luxury of working in a small-ish digital strategy agency where I work in close proximity to our search department (about 20 ft away). We are in a constant state of collaboration over keywords and content, metrics, tracking, et al. Although our agency believes in a blended approach, often our clients don't understand the need to throw money at big data, they don't see it as a tangible investment. Trying to prove the merits of big data to our clients has been the biggest challenge, and any insights on how to do so are much appreciated. Infographs perhaps?

Erin Robbins O'Brien
Erin Robbins O'Brien

Hi Meagan, First off, Buzzshift looks awesome, like the kind of thing I would join if I ever go back to agency life. :) In terms of proving the merits of "big data" (or any data in general) to clients in order to move them to invest it... that's a recurring issue I hear about. One of the things I have done with my own investment into data is to invest some of my own time up front with free tools (yeah, I mean spreadsheets and good old fashioned manual labor) to prove the types of insights that can be gleaned from data and how those insights can be leveraged for smarter marketing. Maybe you uncover a new audience segment, a cross-pollination between two channels, or a messaging strategy that worked particularly well in one location but not another, etc. From there, I usually show how long it took to garner that information by hand and how much ROI that insight could have. The natural progression is (hopefully) that you say, "Imagine if I had a tool that could help make hundreds of these insights possible every week/month and how our ROI would grow exponentially." Once you get one case study or example like that for a client then you can typically leverage it (maybe throw some dummy data in there for privacy measures) across your organization. Not sure if this helps, but if you want to chat you can always ping me at Erin at Viralheat dot com or @texasgirlerin

Meagan Dahl
Meagan Dahl

Thanks, it's a pretty cool place to hang! Spreadsheets, yes. Also I think case studies and client-specific examples are the best solution to the big data messaging problem, your suggestion totally helped. I talked to our head of search and he said he was thinking about doing it for a large client who is finally, finally considering the benefits of key word research on their website. Showing them an example SEO'd page on the site with accompanying analytics spike will be a pretty good argument for implementation across their properties. Side-note, fellow texas girl here :) @hiuhime

Hugh Anderson
Hugh Anderson

Hi Erin & Danny, I agree with you that it is very important to the future of marketing for the reasons you suggest: many more important digital channels, plus advances in technology. I have 2 suppositions. Firstly, that CMOs and their teams will only really get to grips with it when the technology solution providers (presumably including Viralheat) translate the big data sets into meaningful, readily consumable outputs for them; and secondly, that the next few years will see some big advances in what it is possible to derive from big data by using more sophisticated techniques such as network science. So the marketers can focus on the marketing and the tech providers can spend the millions on development. Too simplistic, but hopefully you get my general drift.

Erin Robbins O'Brien
Erin Robbins O'Brien

I hear ya Hugh! There's a lot at work when making data "compatible and consumable" across the myriad types of marketing, community and branding opportunities that exist now, but that's what keeps me employed. ;) Getting beyond raw numbers to actual insights and delivering that to the right people is top of mind and what keeps me coming back for more every day.

Barrett Rossie
Barrett Rossie

Erin, Danny: I'm starting to see the term "big data" pop up everywhere suddenly. There seems to be pros and cons associated with it concerning privacy. Is this an accurate observation? If so, can either of you provide a quick overview of both sides of the privacy issue?

Erin Robbins O'Brien
Erin Robbins O'Brien

Hi Barrett! Thanks for the question, one I am getting more frequently. :) I think that "big data" has been lumped in with some other issues and may be getting a bad rap. When it comes to data, bigness refers to the number of variables, the "cleanliness" and the accessibility of the data, and less to a "Big Brother" type collection of intel about the unknowing public. In terms of privacy, much of what we're talking about with regards to marketers using big data is information consumers have opted into sharing or is readily available on channels that are publicly accessible. The data itself is (in the majority of cases) collected from a variety of sources and where many of the privacy issues I hear about come from is the aggregation of the data into a single place then analyzed to make inferences about consumer preference. My hope is that companies will continue to find ways to offer consumers choices about what information they want made public, where and how they want to be marketed to, and who it is ok to share their preferences with - so that they feel safe and companies feel they are best servicing their needs. At least, that's what I'm trying to do... ;) As always, ping me at Erin at Viralheat dot com or @texasgirlerin if you would like to discuss.

Danny
Danny

Hey there Erin, Absolutely fantastic post, and thanks so much for sharing your insights here. One of the biggest questions we face is manpower - with the amount of data that's out there, is it even realistic to think we can harness it without allocating millions of dollars and man hours to the task? What would your opinion be on this and how do different-sized companies overcome this hurdle?

Erin Robbins O'Brien
Erin Robbins O'Brien

Hi Danny, Such a big topic... the issue of data, how much of it there is, what to do with it and how to use it. Manpower and budget are often perceived barriers to making data work for organizations, especially organizations that don't have departments dedicated to dealing with it! I think that when it comes to data, doing something is better than doing nothing and there are many ways to get insights and impact out of your marketing efforts without spending a lot of time and money. Some go-to's for me are things like Google Analytics, Viralheat, my email marketing tool (currently Marketo, but MailChimp is a great less expensive option), and KissMetrics. These at least allow you to monitor and compare the efficacy of different campaigns, target audiences, types of marketing, etc. The important thing to remember here is that each of these is measuring a different type of interaction, often on a different channel. What's the value of a Tweet, a Facebook post, an email response or a website form fill to your company? That's where your team's qualification comes in. Always happy to give insight or help out with additional questions. Erin at Viralheat dot com or @texasgirlerin

RCONNORIII
RCONNORIII

This one is way out of my league Danny. Have a great day on purpose!

Robert Dempsey
Robert Dempsey

Great post Erin. As marketers, we need more data, not less. And I mean data of every size. Being a geek, I've learned a bit of the R programming language, am digging into Wolfram Mathematica, am learning SOFA statistics, and we've created our own in-house tools for natural language analysis and statistics. My question for you is this - are there any "easy to use" data analysis tools for marketers that produce actionable insights, and don't require a PhD in Mathematics or Statistics to use?

Erin Robbins O'Brien
Erin Robbins O'Brien

Hi Robert, Glad you enjoyed the post. Per your question, I think that "ease is in the eye of the beholder" when it comes to analytics. Although I get what you're asking. ;) In terms of an analysis tool that doesn't require an advanced stats degree - there are tons of ways to look at data that don't require anything more than a spreadsheet and the perseverance to set it up. Moving beyond simple tools and channel-specific data providers (which often don't allow you to compare efforts across various mediums) I would say a lot of it is dependent on what you're trying to measure. Is it marketing outcomes, usability, sales leads, product efficacy, some combination of all those? Just typing this response, I realize I might have found the fodder for my next post! I love chatting about this stuff so if you'd like to have a deeper discussion, ping me at erin at viralheat dot com. I'm around!


108 Shares Twitter 61 Facebook 4 Google+ 6 Buffer 10 Buffer LinkedIn 27 Email -- Email to a friend 108 Shares ×