Big Data. Big Decisions
InformationWeek
Special Coverage Series


Big Data Myths Persist

Big data hype has reached a level where some customers are actually avoiding labeling projects as big data, Tibco executives say.

13 Big Data Vendors To Watch In 2013
13 Big Data Vendors To Watch In 2013
(click image for larger view and for slideshow)

The rush to put a "big data" label on every new use of data risks diluting the meaning of the phrase and, worse, may lead some organizations down a disappointing road, say two seasoned IT executives.

More Insights

Webcasts

More >>

White Papers

More >>

Reports

More >>

"I noticed our customers doing what a lot of people would consider to be 'big data' will often say, "We don't call it big data,'" Chris Taylor of Tibco Software told InformationWeek in a phone interview. Taylor leads product marketing for business event and in-memory computing products at Tibco.

These customers didn't want their work associated with "needle in a haystack" efforts, and felt such a description was far too limiting, he said.

Also on the call was Taylor's Tibco colleague, company CTO Matt Quinn. A 14-year Tibco veteran, Quinn said it was also important to put big data in a historical context. He said this has been a 20- or 30-year trend of businesses picking out some data, normalizing it and using it to make decisions.

[ Taylor and Quinn will do a deep dive into this topic during Big Data Myths and Real Decisions, at Interop Las Vegas, May 6-10. Use Priority Code DIPR02 by March 2 to save up to $500. ]

"It's a facsimile of a facsimile of a facsimile of what's actually going on out there in the real world," he said. "The challenge people are having at the moment is that business, like life, doesn't fit in nice, clean boxes." Trying to see what's really going on, in all its variety and messiness, "is proving to be a very complex task," he said.

Along with wanting to dispel the idea that data started growing recently, and not over decades, another myth Quinn wants to explode is the well-worn notion of big data's so-called three Vs: volume, velocity and variety.

"I feel a little heretical when I say it, but there's always been variety and a relative amount of volume," he said. "What's happened now is people have started to develop tools to manage and mine that data."

Another myth is that Hadoop is somehow synonymous with big data, Taylor said. "It turns big data into a very batch and offline process," he said, adding this is a mistake because it misses the possibility of managing data "in a real-time scenario."

Another myth: The prevailing idea that these data questions can't be answered without a data scientist on the scene, and that their small number will be a huge constraint on businesses wanting to conduct big data projects.

"I tend to believe this is a misleading discussion," Taylor said. Analytical tools will become more powerful, putting these abilities "into the hands of regular people," he said.

Quinn went further, suggesting a data scientist who lacks business context may actually cause more harm than good.

"This has parallels to the banking crisis," he said. "You had a bunch of quants ... who made some incredibly poor decisions because they didn't have the context."

Rather than becoming enamored of the enabling technologies for big data, Quinn suggested business leaders should first sit back and ask, "What's the No. 1 question that you've wanted to have answered but you were always told it was impossible? Start from there. Don't start with what data you have, start with the important question."

Just as important as finding answers is making them actionable and "operationalizing" them, he added.

Quinn does allow that one category of data, machine-generated data, is exploding. Up until now, companies have typically analyzed just a tiny fraction of any activity.

For example, transactions captured into a database on a retailer's website may represent only 2% of all the server's activity, Quinn said. The rest goes untapped or may be discarded. Now it is possible to analyze this activity, "which may have more value than the transactions" for retailers trying to better model the behavior of visitors, he said.

Not an overhyped myth, according to Quinn, is the impact big data initiatives are having on company organizational structures. "For all the warts and all the good stuff, the lingering success of big data is ultimately going to be measured not in how successful the technology was, but it's going to be how successful the organizational transformation was."

Just like service-oriented architecture before it, Quinn said, the big data revolution is forcing organizational change, including breaking down barriers and silos, and prompting fundamental questions about ownership and governance.

As large healthcare providers test the limits, many smaller groups question the value. Also in the new, all-digital Big Data Analytics issue of InformationWeek Healthcare: Ask these six questions about natural language processing before you buy. (Free with registration.)



Related Reading




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

BYTE encourages readers to engage in spirited, healthy debate, including taking us to task. However, BYTE moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. BYTE further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.

Follow InformationWeek

By The Numbers

What Are Your Primary Concerns About Using Big Data Software?

Base: 417 respondents at organizations using or planning to deploy data analytics, BI or statistical analysis software
Data: InformationWeek 2013 Analytics, Business Intelligence and Information Management Survey of 541 business technology professionals, October 2012

What Do You Think?

What's your attitude about SQL analysis on top of Hadoop?
We want fast, standard SQL analysis capabilities on Hadoop ASAP
Hadoop is for unstructured data; SQL is for relational databases
We'll give SQL on Hadoop a try, but relational DBs will remain the mainstay
Given strong SQL support on Hadoop, we'd nix the data warehouse
We're not interested in Hadoop
No opinion



Related Content

From Our Sponsor

Five Big Data Challenges and How to Overcome Them with Visual Analytics

Five Big Data Challenges and How to Overcome Them with Visual Analytics

Business leaders often need a visual snapshot of data to quickly grasp and use it. This paper identifies five challenges in presenting data and how visual analytics can resolve them. Solutions are suggested to overcome the challenges of: speed, data clarity, data quality, displaying meaningful results, and dealing with outliers.

Game-Changing Analytics: How IT Executives Can Use Analytics to Create Innovation and Business Success

Game-Changing Analytics: How IT Executives Can Use Analytics to Create Innovation and Business Success

Today's competitive advantage requires a deeper understanding of your business, your market and your customers. As an IT executive, you can drive that knowledge transformation. In this white paper, learn how to make decisions as a strategic business leader and three steps to begin an analytics initiative within your enterprise.

Data Visualization Techniques: From Basics to Big Data with SAS Visual Analytics

Data Visualization Techniques: From Basics to Big Data with SAS Visual Analytics

High-performance data visualization turns sophisticated analyses into meaningful graphics, leading to faster and smarter decision making. In this white paper, learn how visual analytics can transform big data, with additional features such as real-time functionality, mobile compatibility, robust applications for technical groups and accessibility for nontechnical users.

Big Data: Lessons from the Leaders

Big Data: Lessons from the Leaders

Financial performance, competitive advantage, operational efficiency, strategic decision making - every business goal can extract value from big data, and the time for doubt or inaction has long passed. In this Economist Intelligence Unit report, in-depth interviews with data pioneers reveal the link between the effective use of big data and the bottom line among other results.

Decision-Driven Data Management: A Strategy for Better Decisions with Better Data

Decision-Driven Data Management: A Strategy for Better Decisions with Better Data

Which came first, the data or the decision? This white paper makes the case for having a decision in mind, then tailoring big data's volume, variety and velocity to achieve business results such as overcoming customer dissatisfaction or creating well-informed strategies in real time.

Informationweek Reports

Research: The Big Data Management Challenge

Research: The Big Data Management Challenge

The challenge of big data is real, but most organizations don't differentiate 'big data' from traditional data, and nearly 90% of respondents to our survey use conventional databases as the primary means of handling data. We'll help you understand what constitutes big data (it's not just size) and the numerous management challenges it poses.