Don't Confuse Big Data With Storage - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Data Management // Big Data Analytics
09:08 AM
Connect Directly

Don't Confuse Big Data With Storage

A large part of big data management is knowing what data to analyze, what to back up and what to dump, says disaster recovery expert.

Big Data's Surprising Uses: From Lady Gaga To CIA
Big Data's Surprising Uses: From Lady Gaga To CIA
(click image for larger view and for slideshow)
How much big data should your organization save? And how much should you back up?

Big data plays an important role in today's business world, but it's not up there with mission-critical applications that are essential to an organization's day-to-day operations. That's according to Michael de la Torre, VP of product management for SunGard Availability Services, an IT services company that provides, among other things, disaster recovery services.

Always remember that not all data has equal value, de la Torre advised. And in most cases, big data is just another business application.

"For most companies it's more of a business-critical app," de la Torre told InformationWeek in a phone interview. "It really doesn't need to be up all of the time -- but you're going to lose business opportunities if it isn't up and running."

This isn't to say, however, that big data doesn't matter. On the contrary, de la Torre sees big data as the next generation of business analytics.

"You have all this non-structured or minimally structured data. There's a lot of it. And it's coming from different sources that you would typically think are outside of the business warehouse," he said. "As such, you need new tools and techniques to get value out of that data."

And part of that decision-making process is figuring out what information needs to saved, and what is expendable.

"Don't just save everything to save everything. That makes very little sense," said de la Torre.

[ Confused by the buzz around data analytics and visualization? See 5 Data Sources For Visualization Beginners. ]

For instance, social media streams -- a classic big data example of high volume, velocity and variety--don't necessarily need to be hoarded for eternity. But other forms of big data may provide great value many years down the line.

"When you think about social (media), so much of the value of that data is that it's very time-dependent. It's very volatile, and it loses its value almost immediately," de la Torre said. "Other data such as weather, where you're doing long-term correlations, will potentially remain viable for years."

OK, so all big data isn't created equal. But what's worth saving?

One solution is to store summary data from a particular time period or event, along with a small amount of anecdotal information. That's better than "saving a million logs," de la Torre advised. "Do you need the summary, or do you need all the detail?"

Obviously, the summary data method is more cost effective and easier to manage than the save-everything approach. It also works with sensor-generated information, a big data category that includes data from field equipment in remote locations.

"Manufacturing companies figured this out a long time ago. You don't store the data from spinning equipment," said de la Torre. "You don't want to pay for the bandwidth costs. You don't need all that data."

The solution: "Put an expert system in place. And ultimately that's what big data is: an expert system that makes meaning out of data," he added.

Like many data professionals, de la Torre believes the term "big data" is mostly marketing hype. "It's advanced business analytics using new sources of data," he noted. "And somebody said, 'Hey, let's call it big data.'"

While it may not be a mission-critical app, big data can provide a lot of value to organizations. For instance, it can help companies find "interesting ways to use their proprietary data, and to create business opportunities from it," said de la Torre.

For more of de la Torre's insights on big data backup, read his February 2013 Data Informed article, "Six Points to Consider for Disaster Recovery and Big Data."

This Interop webcast, Data Centers Then And Now, will explore how the requirements are changing, have changed, current data center trends, and what needs to change moving forward to meet future business needs. It happens April 18. (Free registration required.)

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
What Becomes of CFOs During Digital Transformation?
Joao-Pierre S. Ruth, Senior Writer,  2/4/2020
Fighting the Coronavirus with Analytics and GIS
Jessica Davis, Senior Editor, Enterprise Apps,  2/3/2020
IT Careers: 10 Job Skills in High Demand This Year
Cynthia Harvey, Freelance Journalist, InformationWeek,  2/3/2020
White Papers
Register for InformationWeek Newsletters
Current Issue
IT 2020: A Look Ahead
Are you ready for the critical changes that will occur in 2020? We've compiled editor insights from the best of our network (Dark Reading, Data Center Knowledge, InformationWeek, ITPro Today and Network Computing) to deliver to you a look at the trends, technologies, and threats that are emerging in the coming year. Download it today!
Flash Poll