We analyze the important news from SAS, Hortonworks, MetaScale, and others at the Strata conference, as big data seeks a productive next chapter.
Hadoop and beyond Technical announcements are inevitable at a big data, er, "data at work" conference. Strata 2014 saw more than a few, but here's a short list:
MetaScale Appliances: In a bit of surprise announcement, MetaScale, the subsidiary of Sears Holdings, announced that it's offering a "line" of branded Hadoop appliances that will run Cloudera, Hortonworks, or another distribution of the customer's choice. Mind you, Cloudera, Hortonworks, and others have hardware partners that offer everything from recommended configurations to single-SKU, software-preinstalled options (as in the case of the Oracle Big Data Appliance running Cloudera, for example).
I had it in my mind that MetaScale was a consulting organization aimed at helping big companies exploit their data with help from Hadoop and related tools, technologies, and analytics. Its expertise, developed first at Sears, is particularly relevant to companies paying big bucks for mainframe compute cycles. That impression was formed after spending a day with MetaScale executives in their offices and reporting on achievements at Sears. Okay, that was 16 months ago and executives ranks, and perhaps priorities, have since changed.
Maybe MetaScale wants to get its foot in the door earlier in the process by helping you with the basics of deploying a Hadoop cluster. But we understood this outfit's real value to be delivered higher up in the stack, helping customers to understand how to take advantage of data and reinvent legacy processes with the aid of a big data platform.
Couchbase 2.5: This upgrade of the highly scalable, NoSQL database promises better performance through Rack Awareness for high availability and better security through cross-data-center data encryption. With Couchbase Server 2.5's Rack Awareness, administrators can create logical groupings of Couchbase Server nodes and replica copies of data that are automatically distributed across server nodes on different racks. This ensures that data is secure despite disruptions such as power outages or switch or rack failure, according to Couchbase. Building on existing cross-datacenter replication capabilities, the 2.5 update adds a secure data-encryption option whereby data moving across wide area networks can be transmitted using SSL encryption between datacenters.
InfinDB 4.5: Before we get to the technology news, the company formerly know as Calpont has been renamed InfiniDB. This matches the name of the company's massively parallel processing database management system, which has been made to run in the cloud and on Apache Hadoop as well as MPP clusters.
We were on to the name change when we included "InfiniDB," alphabetically, in our 16 Top Big Data Analytics Platformscollection. The high points of the InfiniDB 4.5 release announced this week include new Hadoop capabilities such as fast bulk loading for HDFS, Apache Sqoop integration with parallel extraction for bi-directional data load/unload. A new InfiniDB Enterprise Manager provides a unified console for monitoring and managing sources and system resources. New REST APIs support integration into various enterprise systems.
InfiniDB joins Pivotal (with HAWQ) in the camp of vendors running relational database engines on Hadoop. MapR and HP Vertica also joined that camp this week in a separate, Strata-related announcement covered Tuesday. The payoff is a fast SQL-on-Hadoop option that's likely to beat Impala and Hive on query speeds, but we have yet to see benchmarks or tests that prove that performance.
InformationWeek 2014 Healthcare IT Priorities Survey: Healthcare providers are under pressure from Meaningful Use Stage 2, ICD-10 implementation, and the transition to new population health/accountable care business models, all of which have big impacts on information technology needs. We'd like to know how your organization is responding. Take the InformationWeek 2014 Healthcare IT Priorities Survey today and be eligible to win a great prize. Survey ends Feb. 14.
Doug Henschen is Executive Editor of InformationWeek, where he covers the intersection of enterprise applications with information management, business intelligence, big data and analytics. He previously served as editor in chief of Intelligent Enterprise, editor in chief of ... View Full Bio
6 Tools to Protect Big DataMost IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Big Data Brings Big Security ProblemsWhy should big data be more difficult to secure? In a word, variety. But the business won’t wait to use it to predict customer behavior, find correlations across disparate data sources, predict fraud or financial risk, and more.
Join us for a roundup of the top stories on InformationWeek.com for the week of December 14, 2014. Be here for the show and for the incredible Friday Afternoon Conversation that runs beside the program.