Forrester's analysis and the weightings of the rankings are detailed in a scorecard on 15 criteria. The emphasis on considering more than just Hadoop made sense to Mike Maxey, senior director of product management EMC's Greenplum division, but he questioned EMC's low software packaging score given that it two Hadoop distributions: Greenplum HD based entirely on open source Apache Hadoop software and Greenplum MR based on MapR's partially proprietary distribution.
EMC was also surprised that two services-heavy vendors led the rankings, Maxey said. "Amazon doesn't even offer a [Hadoop] distribution... and IBM tends to push for a full, professional-services-led engagement," Maxey told InformationWeek. "There is value in services, but we find that many enterprises don't want to have all their data locked into a services engagement."
Kirk Dunn, Cloudera's Chief Operating Officer, said Forrester's strategy scoring, which counted for 50% of the vendor rankings, seemed to discount the importance of the vendor's Cloudera Manager admin and management software, which supports system deployment, ongoing monitoring, job and configuration management, and system optimization.
[ Want more background on how your peers are using Hadoop? Read Hadoop Spurs Big Data Revolution. ]
"Nobody has a product anything like [Cloudera Manager]," Dunn said. "We think that's highly strategic and one of the most important things that organizations are looking for when they try to make dependable, predictable use of Hadoop."
Dunn also quibbled with Forrester's placement of competitor MapR above Cloudera on the vertical "current offering" axis, questioning MapR's security capabilities and the fit and portability of the vendor's proprietary components in the "democratic environment" of open source Apache Hadoop.
A third surprise in the Forrester Wave report was the absence of Microsoft, Oracle, and Teradata, a deficiency noted in numerous Twitter comments since the report was released. Two of those absences are easily explained, as both Microsoft and Oracle failed to meet Forrester's August-September 2011 research deadline. Both companies announced their plans for Hadoop in early October.
Oracle recently partnered with Cloudera to bundle its Hadoop software distribution with its Big Data Appliance. Microsoft has introduced a cloud-based Hadoop service on its Azure platform, but release dates have not been set for the Windows-compatible Hadoop software distribution in plans to release in partnership with Hortonworks.
Teradata supports MapReduce processing by way of last year's acquisition of Aster Data. But that vendor's proprietary SQL-MapReduce approach is built on the vendor's relational database and did not fit Kobielus' definition because the mapping and reducing functions aren't performed on the Apache Hadoop code base.
Though they took exception to a few of the finer points of the ranking, both Dunn of Cloudera and Maxey of EMC gave Kobielus credit for his analysis of a very fragmented vendor set. "It's a snaphot in time of a fast-moving market, but [the report] has definitely created a fertile discussion," Dunn said. He predicted the report will lead to clearer vendor segmentations and much-needed definition around the term "big data."
Maxey of EMC advised researchers to look beyond the report's Wave ranking chart. "There's a lot of good information in there, so people should make a point of reading the entire report," he said.