Big Data // Big Data Analytics
News
6/24/2013
12:26 PM
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%

Datameer Democratizes Advanced Big Data Analytics

Datameer 3.0 promises drag-and-drop machine learning with clustering, column-dependency, decision tree and predictive recommendations on top of Hadoop.

5 Big Wishes For Big Data Deployments
5 Big Wishes For Big Data Deployments
(click image for larger view and for slideshow)

There's storing big data and reporting against big data, and then there's gaining insights from big data with advanced analytics. The third level of maturity delivers the most value, and it's what Datameer is after with Datameer 3.0, announced Monday and set for general release this fall.

Datameer is a data-integration, data-management and self-service analytics platform that runs on top of Hadoop, and it's used by notable customers including Sears Holdings and Cardinal Health to bring together and analyze high-scale structured and unstructured data sets on Hadoop. The options for analysis have heretofore included a spreadsheet-style interface and a short list of data visualizations and packaged analytics.

Datameer 3.0 introduces four powerful options for advanced analytics: clustering, column-dependencies, decision trees and recommendation. What these four have in common is that they are machine-learning analyses driven by algorithms, and the data tells the analyst what's important.

[ Want more on Datameer in action? Read Why Sears Is Going All-In On Hadoop. ]

"With functional analytics, you as human being have to decide what you're going to look for, filter and analyze," Stefan Groschupf, CEO of Datameer, told InformationWeek. "As you integrate more diverse data and the larger the data sets become, the more you need machine learning to help you figure out what's important."

The four styles of analysis were chosen for their popularity. Clustering is used to find groups in data, as in segments of important customers. Column-dependency analysis uncovers important relationships among dimensions of data, such as age, income, location and product purchases, for example. Decision trees can be used to track conversion rates, for example, among different segments of customers in a sales funnel. And predictive recommendations are familiar to anyone who has seen Netflix movie recommendations or Amazon product-purchase suggestions.

Datameer calls the four new analysis options Smart Analytics because they don't require the complex data-preparation, sampling and scoring procedures associated with advanced analytics, according to Groschupf. With Datameer 3.0, users drag and drop data-set descriptions from a list of everything available on the Hadoop cluster. Preview analyses give users a sense of what they'll discover before the complete analysis is executed at scale behind the scenes. Datameer's software handles all the complexities of MapReduce processing without coding required by end users, according to Groshupf.

"One of our beta customers that was spending $1 million per month on Google Ad words used these analyses and found that they could cut that spend to $400,000 per month by focusing on the key words that were shown to be most likely to convert," Groshupf said.

The packaged functional analytics already available from Datameer include analyses such as Salesforce.com data in combination with Google Ad Words, Marketo leads, Web analytics or sentiment analysis against Twitter. More than 90 such packaged, template applications are available from Datameer's app store, with many having been developed by partners.

Datameer competes with Hadapt, Karmasphere, Platfora and other startups that offer business intelligence and analytics platforms designed to run on top of Hadoop. Groschupf said he isn't too worried about Cloudera Impala and other SQL-on-Hadoop options, such as Hortonworks Stinger, MapR-promoted Apache Drill or IBM Big SQL, because the universe of SQL-savvy professionals is in the low hundreds of thousands. Datameer's Smart Analytics, packaged analytics and spreadsheet tools, in contrast, are designed to be used by business analysts, he said.

"We're focused on the millions of business users who want easy-to-use tools and who don't want to have to wait for IT to help them make sense of that information," he said.

To understand how to secure big data, you have to understand what it is -- and what it isn't. In the Security Implications Of Big Data Strategies report, we show you how to alter your security strategy to accommodate big data -- and when not to. (Free registration required.)

Comment  | 
Print  | 
More Insights
6 Tools to Protect Big Data
6 Tools to Protect Big Data
Most IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest, Nov. 10, 2014
Just 30% of respondents to our new survey say their companies are very or extremely effective at identifying critical data and analyzing it to make decisions, down from 42% in 2013. What gives?
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join us for a roundup of the top stories on InformationWeek.com for the week of November 16, 2014.
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.