Big Data // Big Data Analytics
News
9/30/2013
04:16 PM
Connect Directly
RSS
E-Mail
50%
50%
Repost This

Hadoop's Second Generation Offers More To Enterprises

The first Hadoop tools weren't easy to deploy or manage. But the second-wave tools deliver great advances in usability.

5 Big Wishes For Big Data Deployments
5 Big Wishes For Big Data Deployments
(click image for larger view and for slideshow)
Hadoop is one of the single most disruptive recent innovations in enterprise IT. The promise is to turn the ever-growing tide of data into profit. Even just in my own industry, telecommunications and media, Hadoop allows a range of analytic uses in areas as diverse as network planning, customer support, security operations, fraud detection and targeted advertising.

Yet realizing this potential has been challenging for many mainstream enterprises. Many started experimenting with some of the 13 functional modules that make up Apache Hadoop, a set of technologies that required large teams and several years for the early wave of Hadoop adopters such as eBay, Facebook and Yahoo to master.

The first wave of Hadoop technology, the 1.x generation, was not easy to deploy nor easy to manage. The many moving parts that make up a Hadoop cluster were difficult to configure for new users. Seemingly minor details – patch versioning, for instance -- mattered a lot. As a result, services failed more often than expected, and many problems only showed up under severe load. Skills were and still are in short supply, although there is no shortage of good training available from leading vendors such as Hortonworks and Cloudera.

[ Hortonworks gives the low-down on modern-day Hadoop. Read Hadoop According To Hortonworks: An Insider's View. ]

Fortunately, the second generation of Hadoop, which Hortonworks calls HDP 2.0 and which was announced at Hadoop Summit 2013, fills in many of the gaps. Manageability is a key expectation, particularly for the more business-critical use cases that service providers experience. Hadoop has made great advances here with Ambari, an intuitive Web user interface that makes it much easier to provision, manage and monitor Hadoop clusters. Ambari allows the automation of initial installation, rolling upgrades without service disruption, high availability and disaster recovery, all critical to efficient IT operations.

Moreover, the independent software vendor ecosystem that supports Hadoop distributions is broadening and deepening. This is important for two reasons. In our experience, much of a buying decision boils down to how Hadoop fits with existing technology assets; in most cases, that means traditional business intelligence and data warehouse vendors. This also alleviates concerns over the skills shortage.

Previous
1 of 2
Next
Comment  | 
Print  | 
More Insights
InformationWeek Elite 100
InformationWeek Elite 100
Our data shows these innovators using digital technology in two key areas: providing better products and cutting costs. Almost half of them expect to introduce a new IT-led product this year, and 46% are using technology to make business processes more efficient.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Government, May 2014
NIST's cyber-security framework gives critical-infrastructure operators a new tool to assess readiness. But will operators put this voluntary framework to work?
Video
Slideshows
Twitter Feed
Audio Interviews
Archived Audio Interviews
GE is a leader in combining connected devices and advanced analytics in pursuit of practical goals like less downtime, lower operating costs, and higher throughput. At GIO Power & Water, CIO Jim Fowler is part of the team exploring how to apply these techniques to some of the world's essential infrastructure, from power plants to water treatment systems. Join us, and bring your questions, as we talk about what's ahead.