Software // Information Management
Commentary
5/10/2011
03:53 PM
Doug Henschen
Doug Henschen
Commentary
Connect Directly
LinkedIn
Twitter
Google+
RSS
E-Mail
50%
50%

EMC's Hadoop Move Points To Analysis Arms Race

Planned Greenplum appliance will bridge structured and unstructured data, and it's easy to see the industry's top vendors will follow with their own all-purpose analytic platforms.

The advantage of the Greenplum HD Data Computing Appliance will be its ability to run a relational database and Hadoop on a single appliance. That will not only save money by eliminating the need for separate hardware platforms, it will also speed insight and minimize storage demands by streaming Hadoop analyses directly into the Greenplum database. In this approach, data doesn't have to be created and stored in one environment and then copied and moved into another.

"One team can handle both environments, and the streaming capabilities will drastically reduce the amount of duplicate data," Lonergan said.

EMC's Enterprise Edition software features a proprietary replacement of the Hadoop Distributed File System (HDFS). Developed by EMC partner MapR, EMC says this HDFS alternative delivers two- to five-times faster performance than the standard HDFS.

The MapR file system also provides capabilities not supported in standard HDFS, including snapshots and replication for backup and recovery, simple data loading and access via a network file system, automatic failure detection and notification for improved reliability, and multi-site management features.

Blending Analyses

As noted, EMC's planned appliance is not the first to support mixed structured and unstructured data analysis. Greenplum itself can already query HDFS from within the Greenplum database. And AsterData, which was recently acquired by Teradata, has won most of its customers on the strength of its SQL-MapReduce capabilities, which enable developers to handle many types of unstructured data query and processing jobs, though not quite to the degree supported by Hadoop.

Given the fast-moving state of Hadoop developments, there will undoubtedly be more novel combinations of Hadoop aimed at blended data-analysis capabilities. In fact, Lonergan acknowledged DataStax’s achievement and predicted that within two to three years, single platforms -- including EMC's -- will handle the mix of unstructured data and Hadoop-style analysis, structured data query with SQL analysis and data mining, and real-time, low-latency in-memory analysis of high volumes of information.

EMC has the first two covered and is "working aggressively" to cover the third, Lonergan said. SAP is tackling the second and third domain with its in-memory strategy, and SAP’s BusinessObjects analytics initiatives could lead to interest in unstructured-data analysis.

DataStax has addressed unstructured and real-time with Brisk, and it could add other open-source software for SQL-relational analysis.

Oracle has talked up the blend of transactional and analytics support, but it's currently an either-or proposition when it comes to configuring Exadata. Real-time, in-memory loading and analysis is also not part of the picture as yet with Exadata.

If innovators pioneer all-purpose analytic databases, it's easy to see that IBM, Oracle, Teradata and all other database contenders will have to respond.

Previous
2 of 2
Next
Comment  | 
Print  | 
More Insights
The Agile Archive
The Agile Archive
When it comes to managing data, donít look at backup and archiving systems as burdens and cost centers. A well-designed archive can enhance data protection and restores, ease search and e-discovery efforts, and save money by intelligently moving data from expensive primary storage systems.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek - July 21, 2014
Our new survey shows fed agencies focusing more on security, as they should, but they're still behind the times with cloud and overall innovation.
Flash Poll
Video
Slideshows
Twitter Feed
InformationWeek Radio
Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.