Software // Information Management
News
6/4/2013
02:42 PM
Connect Directly
LinkedIn
Twitter
Google+
RSS
E-Mail
50%
50%

Cloudera Declares End Of Data Warehousing Era

Cloudera CEO Mike Olson urges companies to reconsider their data-management approach as the "center of gravity" shifts toward Hadoop.

5 Big Wishes For Big Data Deployments
5 Big Wishes For Big Data Deployments
(click image for larger view and for slideshow)
Cloudera CEO Mike Olson took the stage Tuesday at the Cloudera Forum in San Francisco to extol companies to "unaccept the status quo" of enterprise data warehousing (EDW).

"The place that enterprises will store their data is shifting toward Hadoop," Olson told InformationWeek last week in a preview of the speech. "We're seeing customers not replace, but, rather, rationalize their enterprise data warehouse investments by adding Hadoop alongside."

Olson also announced a new Cloudera Search capability built on Apache Solr, but the larger purpose of the presentation was to question assumptions about enterprise data management. EDWs are "increasingly costly and difficult to maintain," Olson said, because the volume and variety of data now encountered is "totally out of whack" with what relational data warehouses were designed to handle in the 1990s.

EDW costs are upward of $20,000 per terabyte, versus $1,000 to $2,000 per terabyte for Hadoop clusters, including hardware, according to Olson. Thus the time is right, he said, to reconsider where most data is stored, transformed, cleaned, prepared and interactively queried.

[ Want more on changes in data management? Read Big Data Debate: Will Hadoop Become Dominant Platform? ]

"All of those workloads are sucking cycles away from the stuff that the EDW platform does very well," Olson said, citing high-powered analytics and cube-based analyses as the key roles that EDWs will continue to handle.

The announcement of Cloudera Search expands the list of Hadoop platform capabilities. Tapping the open-source Apache Solr search engine, Cloudera said it will support natural language keyword searches and faceted navigation of data stored in the Hadoop Distributed File System (HDFS) and Apache HBase. The tool runs on the Hadoop cluster and will be useful for exploring data and finding subsets of information that might be targeted for large-scale MapReduce processing, Olson said.

"When you have petabytes of data, folders don't work anymore, and we've all learned from Google that, when you need to find some bit of information, you just go search for it," he said. "Anybody can use this and you don't need to define ontologies or taxonomies to set it up."

Cloudera Search has been in private beta for several months, with Monsanto cited as a company using the software to support a high-scale search application. The software will be distributed as part of Cloudera's Hadoop distribution, but management capabilities for search will be an add-on offering that's part of the vendor's commercial Cloudera Manager software.

Cloudera competitor MapR recently announced its own answer to search-on-Hadoop, also based on Solr, but Olson discounted it as "an announcement not supported by shipping code." Cloudera Search is now available for download as part of a public beta test that's expected to last three months.

As for Cloudera's premise of offloading ETL and basic BI workloads and data volumes from more expensive EDWs onto Hadoop, the idea is not shocking or new. The strategy of turning Hadoop into the enterprise data hub has been articulated by outspoken practitioners such as Phil Shelley, CTO at Sears Holdings.

The topic also has been openly debated by vendors, with leading database suppliers such as Teradata and IBM shrugging off Hadoop as just one more arrow in the data-management quiver that enterprises will need to address big-data opportunities.

With the costs of Hadoop what they are and the scale of data growing exponentially, there's little doubt that Hadoop's popularity will grow. Time will tell just how soon and to what extent it will displace EDWs from accustomed roles such as transforming and storing the bulk of historical data and supporting the basics of BI and reporting.

E2 is the only event of its kind, bringing together business and technology leaders across IT, marketing, and other lines of business looking for new ways to evolve their enterprise applications strategy and transform their organizations to achieve business value. Join us June 17-19 for three days of 40+ conference sessions and workshops across eight tracks and discover the latest insights in enterprise social software, big data and analytics, mobility, cloud, SaaS and APIs, UI/UX and more. Register for E2 Conference Boston today and save $200 off Full Event Passes, $100 off Conference, or get a FREE Keynote + Expo Pass!

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
6/5/2013 | 3:01:08 PM
re: Cloudera Declares End Of Data Warehousing Era
The point is NOT that data warehouses are going away. The point is that you can rationalize the spend while taking advantage of more data, gaining new insights.
The Agile Archive
The Agile Archive
When it comes to managing data, don’t look at backup and archiving systems as burdens and cost centers. A well-designed archive can enhance data protection and restores, ease search and e-discovery efforts, and save money by intelligently moving data from expensive primary storage systems.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest - July10, 2014
When selecting servers to support analytics, consider data center capacity, storage, and computational intensity.
Flash Poll
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join InformationWeek’s Lorna Garey and Mike Healey, president of Yeoman Technology Group, an engineering and research firm focused on maximizing technology investments, to discuss the right way to go digital.
Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.