6 'data' buzzwords you need to understand

Fear not: The lingo needn't be mysterious. What's tricky is making it all pay off.

Take one major trend spanning the business and technology worlds, add countless vendors and consultants hoping to cash in, and what do you get? A whole lot of buzzwords with unclear definitions.

In the world of big data, the surrounding hype has spawned a brand-new lingo. Need a little clarity? Read on for a glossary of sorts highlighting some of the main data types you should understand.

1. Fast data

The shining star in this constellation of terms is "fast data," which is popping up with increasing frequency. It refers to "data whose utility is going to decline over time," said Tony Baer, a principal analyst at Ovum who says he coined the term back in 2012.

It's things like Twitter feeds and streaming data that need to be captured and analyzed in real time, enabling immediate decisions and responses. A capital markets trading firm may rely on it for conducting algorithmic or high-frequency trades.

"Fast data can refer to a few things: fast ingest, fast streaming, fast preparation, fast analytics, fast user response," said Nik Rouda, a senior analyst with Enterprise Strategy Group. It's "mostly marketing hype," but it "shows the need for performance in a variety of ways."

Increased bandwidth, commodity hardware, declining memory prices and real-time analytics have all contributed to the rise of fast data, Baer said.

2. Slow data

At the opposite end of the spectrum is "slow data," or data that might trickle in at a comparatively leisurely pace, warranting less-frequent analysis. Baer points to a device that monitors ocean tides as an example -- for most purposes, real-time updates aren't needed.

In general, this kind of data is better-suited for capture in a data lake and subsequent batch processing.

3. Small data

"Small data" is "anything that fits on one laptop," said Gregory Piatetsky-Shapiro, president of analytics consultancy KDnuggets.

Essentially, the term recognizes the fact that "a lot of analysis is still done on one or a few data sources, on a laptop, using lightweight apps -- sometimes even just Excel," Rouda said.

4. Medium data

As for "medium data," well, it's in between.

When you're talking about many petabytes of data, that's big data, and you'd likely use technologies such as Hadoop and MapReduce to analyze it, Baer said. But "most analytic problems don't involve petabytes," he added. When analyses involve data on a more intermediate scale, that's medium data, and you'd likely use Apache Spark.

5. Dark data

"Dark data" is typically data that gets overlooked and underused.

"People don’t know it’s there, don’t know how to access it, aren’t allowed access, or the systems haven’t been set up to leverage it yet," Rouda explained. It crops up "all too often" in databases, data warehouses and data lakes, he said.

Such restricted or poorly documented pools of data are often referred to as the "dark web." Bringing them to light is generally the domain of data-discovery services, often using machine-learning algorithms, Baer said.

6. Dirty data

Last but not least, "dirty data" is nowhere near as fun as it sounds. Rather, it's simply a data set before it gets cleaned up.

"A matter of nature is that things are dirty until you clean them," Baer said. "Unless you've performed some operation on it, data is not going to be clean."

Those operations can include preparation, enrichment and transformation, Rouda noted. "Otherwise a lot of wrong answers are possible."

One more thing...

Using data to grow your business is a lot more than just understanding the lingo.

"There's a gap between all the data that has become available and our ability to use it for insight," said Brian Hopkins, a vice president with Forrester.

Bridging that gap could be a matter of using Hadoop, or it could be accomplished through simple self-service tools, Hopkins said. Either way, it's the link that has to be made in order for meaningful action to result.

"Vendors and analysts are great at creating new buzzwords," he said. Rather than getting bogged down in terms, "my advice for CIOs is to stay laser-focused on outcomes that will transform your business."

Join the CMO newsletter!

Error: Please check your email address.
Show Comments

Supporting Association

Blog Posts

Top tips to uncovering consumer insights for business innovation

An in-depth understanding of consumers sits at the heart of what we all need to do, but we know it’s not always easy to uncover insights that will unlock a true innovation opportunity.

Matt Whale

Managing director, How To Impact

Is your customer experience program suffering bright shiny object syndrome?

You may have heard of ‘bright shiny object syndrome’. The term is used to describe new initiatives undertaken by organisations that either lack a strategic approach, or suffer from a failure to effectively implement.

Leveraging technology to stand out in the sea of sameness

The technology I'm talking about here is data and marketing automation. Current digital marketing methodology, much as it is practiced at Bluewolf, dictates the need for a strategy that does four things: Finds the right audience, uses the right channel, delivers the right content, and does all of that at the right time.

Eric Berridge

CEO and co-founder of Bluewolf, an IBM Company

Lead Management is very important part of the process. For anyone running Facebook Lead Ads I would recommend using this service.Get your...

Dirk Lo

How this fintech startup is improving content marketing and lead generation

Read more

I am agreeing with Mr. Tyron Hayes that a measured test-and-learn approach could be missing opportunities to not only better engage custo...

rush essay reviews

CMO interview: How Curtin University’s marketing chief is using test and learn to cope with complexity

Read more

Excellent!

Dr Sadasivan,US

Shakespeare shows data and creativity aren’t Montagues and Capulets

Read more

Great article! Agreed with all... Matthew Lerner, Deeps De Silva... When a company has a great product that solves customers needs, a gre...

James Tyler

Why marketers are embracing growth hacking techniques

Read more

Very good article, Social media analytics helps in problem identification. They can serve as an early warning system for negative custome...

BizVinu

Four ways to use social media to boost customer loyalty

Read more

Latest Podcast

More podcasts

Sign in