Big Data // Big Data Analytics
Commentary
2/13/2014
09:57 AM
Doug Henschen
Doug Henschen
Commentary
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%

9 Key Big Data Developments From Strata

We analyze the important news from SAS, Hortonworks, MetaScale, and others at the Strata conference, as big data seeks a productive next chapter.

Hadoop and beyond
Technical announcements are inevitable at a big data, er, "data at work" conference. Strata 2014 saw more than a few, but here's a short list:

MetaScale Appliances: In a bit of surprise announcement, MetaScale, the subsidiary of Sears Holdings, announced that it's offering a "line" of branded Hadoop appliances that will run Cloudera, Hortonworks, or another distribution of the customer's choice. Mind you, Cloudera, Hortonworks, and others have hardware partners that offer everything from recommended configurations to single-SKU, software-preinstalled options (as in the case of the Oracle Big Data Appliance running Cloudera, for example).

[ Join InformationWeek's Doug Henschen in a Feb. 13, 2 pm ET interview on "16 Top Big Data Analytics Platforms." Video will be archived. ]

I had it in my mind that MetaScale was a consulting organization aimed at helping big companies exploit their data with help from Hadoop and related tools, technologies, and analytics. Its expertise, developed first at Sears, is particularly relevant to companies paying big bucks for mainframe compute cycles. That impression was formed after spending a day with MetaScale executives in their offices and reporting on achievements at Sears. Okay, that was 16 months ago and executives ranks, and perhaps priorities, have since changed.

Maybe MetaScale wants to get its foot in the door earlier in the process by helping you with the basics of deploying a Hadoop cluster. But we understood this outfit's real value to be delivered higher up in the stack, helping customers to understand how to take advantage of data and reinvent legacy processes with the aid of a big data platform.

Couchbase 2.5: This upgrade of the highly scalable, NoSQL database promises better performance through Rack Awareness for high availability and better security through cross-data-center data encryption. With Couchbase Server 2.5's Rack Awareness, administrators can create logical groupings of Couchbase Server nodes and replica copies of data that are automatically distributed across server nodes on different racks. This ensures that data is secure despite disruptions such as power outages or switch or rack failure, according to Couchbase. Building on existing cross-datacenter replication capabilities, the 2.5 update adds a secure data-encryption option whereby data moving across wide area networks can be transmitted using SSL encryption between datacenters.

InfinDB 4.5: Before we get to the technology news, the company formerly know as Calpont has been renamed InfiniDB. This matches the name of the company's massively parallel processing database management system, which has been made to run in the cloud and on Apache Hadoop as well as MPP clusters.

We were on to the name change when we included "InfiniDB," alphabetically, in our 16 Top Big Data Analytics Platformscollection. The high points of the InfiniDB 4.5 release announced this week include new Hadoop capabilities such as fast bulk loading for HDFS, Apache Sqoop integration with parallel extraction for bi-directional data load/unload. A new InfiniDB Enterprise Manager provides a unified console for monitoring and managing sources and system resources. New REST APIs support integration into various enterprise systems.

InfiniDB joins Pivotal (with HAWQ) in the camp of vendors running relational database engines on Hadoop. MapR and HP Vertica also joined that camp this week in a separate, Strata-related announcement covered Tuesday. The payoff is a fast SQL-on-Hadoop option that's likely to beat Impala and Hive on query speeds, but we have yet to see benchmarks or tests that prove that performance.

InformationWeek 2014 Healthcare IT Priorities Survey: Healthcare providers are under pressure from Meaningful Use Stage 2, ICD-10 implementation, and the transition to new population health/accountable care business models, all of which have big impacts on information technology needs. We'd like to know how your organization is responding. Take the InformationWeek 2014 Healthcare IT Priorities Survey today and be eligible to win a great prize. Survey ends Feb. 14.

Doug Henschen is Executive Editor of InformationWeek, where he covers the intersection of enterprise applications with information management, business intelligence, big data and analytics. He previously served as editor in chief of Intelligent Enterprise, editor in chief of ... View Full Bio

Previous
3 of 3
Next
Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
2/13/2014 | 3:24:34 PM
Re: Seems smart, but late?
Good questions, Lorna. I'd say Autonomy and Vertica have capabilities that aren't matched by open-source products. Maybe parts of what they do, but not head-to-head competition from purely open source products. Maybe I'm missing an option, though, so I welcome comment on alternatives.
Lorna Garey
50%
50%
Lorna Garey,
User Rank: Author
2/13/2014 | 1:02:46 PM
Seems smart, but late?
HP opening up Autonomy and Vertica as platforms seems like a smart move, but what do you think are the odds it can get the developer ecosystem at this point that it needs to make a big play? What's the benefit for a developer to buy into Autonomy's code vs. going a more reliably open route?
RobPreston
50%
50%
RobPreston,
User Rank: Author
2/13/2014 | 11:30:44 AM
All Data Not Big Data
The industry has started to use "big data" as almost synonymous with analytics, no matter the size of the data pool being analyzed. Nice to see a conference organizer grounding things in reality.
Laurianne
50%
50%
Laurianne,
User Rank: Author
2/13/2014 | 11:15:37 AM
Big data talent
I am eager to hear about how the big data talent situation has evolved since last year's Strata conference. Anyone on the ground at the conference want to weigh in here?
6 Tools to Protect Big Data
6 Tools to Protect Big Data
Most IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest, Dec. 9, 2014
Apps will make or break the tablet as a work device, but don't shortchange critical factors related to hardware, security, peripherals, and integration.
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join us for a roundup of the top stories on InformationWeek.com for the week of December 14, 2014. Be here for the show and for the incredible Friday Afternoon Conversation that runs beside the program.
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.