Big Data // Software Platforms
News
1/23/2012
11:56 AM
Doug Henschen
Doug Henschen
Slideshows
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%

12 Hadoop Vendors To Watch In 2012

Promising low cost and unheard of scalability, Hadoop has been called the next-generation platform for data processing. Check out the vendors taking Hadoop to the next level.
Previous
1 of 13
Next


12 Hadoop Vendors To Watch In 2012
Hadoop has been called the next-generation platform for data processing because it offers low cost and the ultimate in scalability. But Hadoop is still immature and will need serious work by the community--including the 12 vendors described here--to turn this fledgling baby elephant into an industry colossus.

Hadoop is at the center of this decade's big data revolution. This Java-based framework is actually a collection of software and subprojects for distributed processing of huge volumes of data. The core approach is MapReduce, a technique used to boil down tens or even hundreds of terabytes of Internet clickstream data, log-file data, network traffic streams, or masses of text from social network feeds.

Excitement has been building around Hadoop since its release as an Apache open source project in 2008, thanks to its combination of low cost, scalability, and flexibility to handle any data without building predefined schemas. Many people see in Hadoop the potential to usher in a whole new generation of data-processing capabilities, just as Structured Query Language (SQL) ushered in a revolution in data computing more than 30 years ago.

But Hadoop is immature and, in some ways, downright crude compared to SQL. Pioneers, most of whom started working on the framework at Internet giants such as Yahoo, have already put at least six years into developing Hadoop. But success has brought mainstream demand for stability, robust administrative and management capabilities, and the kind of rich functionality available in the SQL world.

All eyes are now on Hadoop vendors, a fast-growing community, to deliver robust tools, capabilities, and innovations. Leading lights in that community include Cloudera and Amazon Web Services. Cloudera was the first and is now the largest source of Hadoop software with its CDH distribution and accompanying management software. It's also the largest provider of enterprise support and training for Hadoop. Amazon was an early mover in running Hadoop in a public cloud with its Amazon Elastic MapReduce service.

In 2011, MapR and Hortonworks, the latter a Yahoo spinoff, burst onto the scene with announcements about their own distributions of Hadoop software along with support, training services and, in MapR's case, proprietary twists aimed at delivering high performance. Competition is part of what it will take to improve Hadoop, so the availability of more distributions, and new support and training options should benefit everyone.

Data processing is one thing, but what most Hadoop users ultimately want to do is analyze the data. Enter Hadoop-specialized data access, business intelligence, and analytics vendors such as Datameer, Hadapt, and Karmasphere.

The clearest sign that Hadoop is headed mainstream is that fact that it was embraced by five major database and data management vendors in 2011, with EMC, IBM, Informatica, Microsoft, and Oracle all throwing their hats into the Hadoop ring. IBM and EMC released their own distributions last year, the latter in partnership with MapR. Microsoft and Oracle have partnered with Hortonworks and Cloudera, respectively. Both EMC and Oracle have delivered purpose-built appliances that are ready to run Hadoop. Informatica has extended its data-integration platform to support Hadoop, and it's also bringing its parsing and data-transformation code directly into the environment. Read on to learn more about what these influential vendors are doing with Hadoop.

Previous
1 of 13
Next
Comment  | 
Print  | 
More Insights
In A Fever For Big Data
In A Fever For Big Data
Healthcare orgs are relentlessly accumulating data, and a growing array of tools are becoming available to manage it.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest, Nov. 10, 2014
Just 30% of respondents to our new survey say their companies are very or extremely effective at identifying critical data and analyzing it to make decisions, down from 42% in 2013. What gives?
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join us for a roundup of the top stories on InformationWeek.com for the week of November 9, 2014.
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.