Microsoft Azure, SAS, MapR: Big Data Roundup

News from Microsoft, SAS, and MapR point to changes in how enterprises are buying, deploying, and learning to use big-data analytics.
15 Hot Skill Sets For IT Pros In 2015
15 Hot Skill Sets For IT Pros In 2015
(Click image for larger view and slideshow.)

The Strata + Hadoop World conference has brought a flurry of big-data announcements that hint that at least some enterprises are ready to tackle bigger issues around big data than storage, including interoperability, real-time usage, and finding ROI in analytics. Here’s a rundown of some of the announcements:

Microsoft announced updates to its Azure HDInsight service, bringing support to Storm and HortonWorks Data Platform 2.2, among others. They have also announced an HDInsight on Linux preview. Storm allows for more real-time analytics, and Microsoft is touting its Visual Studio as a great way to add ease-of-use and easier debugging to Storm. The announcement also brings Hadoop as a service to Linux.

Hortonworks Dataplatform 2.2 promises easier data interoperability because, as it claims in a press release, it is the only 100% open source Hadoop architecture that supports both Linux and Windows natively. This allows data to be moved to HDInsight much more easily, regardless of the architecture it resides on.

For more on Microsoft's Azure announcements you can read Kelly Sheridan's article right here.

SAS also announced its SAS Data Loader for Hadoop today. Data Loader allows for self-service data preparation, even for those with less training and data expertise. It allows for non-experts to use a visual interface to pull data from relational databases in Hadoop, freeing up the time for Hadoop developers and data scientists to do more advanced work. SAS says in a blog post that 80% of data scientist time is used in preparing and maintaining data, rather than in gaining insights. Data Loader is intended to give some of that time back. Here's a video if you want to see what it looks like.

Much like the Azure update, this is another tool that will allow enterprises faster and better access to data, and hopefully, bring more ROI.

The last announcement, from MapR, also pertains to real-time data. The latest version of the MapR distribution featuring Hadoop is designed to facilitate what MapR is calling the real-time, data-centric enterprise. The new release will allow for active-active clusters across databases with no table replication or bidirectional updates. This has major advantages for business recovery, but it also allows the blending of business processes and analytics processes by breaking down analytics silos.

[ Read about the best cities for IT pros in 2015.]

In an interview with InformationWeek, Jack Norris, MapR's Chief Marketing Officer, gave an instance of a use-case where this is being put into everyday practice. MapR's customer, Machine Zone, makers of the popular mobile Game of War, were stuck with batch processing of data. Their game servers would deliver data to their analytics clusters, which caused synchronization problems, data corruption, and sometimes even data loss. MapR was able to synchronize the analytics with the game platform so the data appeared in real-time. This allowed for Machine Zone to make better business decisions around monetizing Game of War.

MapR also announced improvements with table replication, data warehousing, security, and, perhaps most important, data ingestion. The group boasted that most clients could get apps up in less than a quarter and see ROI in less than a year.

Taken together, you can see the trend in big data right now is moving from "What do we do with all this stuff?" to "How can we bring this into our business?" It is a welcome change to a big data field which has been considered by many lately to be more hype than substance. Real-time data of high quality that can be used directly by the business has always been the goal. Now that some of the basic issues have been dealt with, it seems like we might soon be getting there.

Attend Interop Las Vegas, the leading independent technology conference and expo series designed to inspire, inform, and connect the world's IT community. In 2015, look for all new programs, networking opportunities, and classes that will help you set your organization’s IT action plan. It happens April 27 to May 1. Register with Discount Code MPOIWK for $200 off Total Access & Conference Passes.

Editor's Choice
Samuel Greengard, Contributing Reporter
Cynthia Harvey, Freelance Journalist, InformationWeek
Carrie Pallardy, Contributing Reporter
John Edwards, Technology Journalist & Author
Astrid Gobardhan, Data Privacy Officer, VFS Global
Sara Peters, Editor-in-Chief, InformationWeek / Network Computing