Commentary

Doug Henschen
Executive Editor, InformationWeek  

SAS Teams With EMC And Teradata -- And Not IBM -- On High-Performance Analytics

New approach supports high-performance modeling and scoring against complete data sets, promising greater accuracy in predictive risk-analysis, fraud-detection and optimization scenarios.

Analytic software giant SAS on Tuesday announced what it describes as a breakthrough approach in high-performance big-data analysis.

And in a apparent realignment in the wake of last year's big-data consolidation, SAS is partnering most closely with EMC and Teradata. EMC last year acquired Greenplum while Teradata recently acquired Aster Data. EMC and Teradata are independents that do not compete head-on with SAS as does IBM, which is recent years has acquired SAS competitors Cognos and SPSS. Late last year IBM also acquired Netezza, a big-data appliance vendor (and SAS partner) that goes head to head with EMC Greenplum and Teradata.


More Insights

Webcasts

More >>

White Papers

More >>

Reports

More >>

In related news, EMC announced new appliances yesterday that will step up competition with IBM and Teradata, but more on that in moment.

SAS's new approach, called High-Performance Analytics (HPA), is an advance over the vendor's prior in-database processing approach in that it supports modeling as well as scoring procedures. Modeling and scoring are foundations of analytics, and in-database processing moves the scoring step to powerful databases rather than taking hours or even days to move huge data volumes into and out of the separate, low-powered servers used to run analytics software.

What's more, HPA supports modeling against complete data sets, eliminating the need for sampling and ensuring more accurate predictive models for risk analysis, fraud detection, optimization and other needs. It does so by relying on EMC and Teradata's powerful parallel-processing environments, which spread the analysis work across hundreds if not thousands of independent compute nodes.

"This gives statisticians an ad-hoc model-building environment where they can use all their data with all their variables," said Michelle Wilkie, high performance computing product manager. "Instead of making educated guesses about how to sample and do variable selection, they can develop the most accurate [prediction] of the truth across all their data and the variables they select."

The push from the business side these days is for predictive, analytic insight rather than after-the-fact reporting, but that requires analysis of the data. With terabyte-scale data volumes now being the rule in so many business domains -- banking, insurance, retail and marketing to name a few -- analytic professionals have been struggling to model and score the data.

In recent years SAS has partnered with Teradata, Netezza, IBM and Aster Data to address part of the problem by moving scoring to the processing platform with the in-database approach. SAS HPA builds on, rather than replaces in-database processing, but it adds techniques to support modeling as well as scoring.

SAS described the advance in a statement as "going beyond in-memory databases or in-memory OLAP," a clear swipe at developments such as SAP's HANA (High-Performance Analytic Appliance) -- with SAP being another player that's competing head-on with SAS by way of BusinessObjects.

Working with EMC and Teradata, SAS said it will enable customers to solve "big analytics" problems that formerly took hours or days in a matter of seconds.

In a lab-based test involving a single logistic regression, for example, a modeling procedure that took more than a day to run previously was processed in about 80 seconds, according to Wilkie. For statisticians who are constantly tuning and tweaking models, "this completely changes their world," she said.

As promising the development sounds, SAS HPA won't be released until the fourth quarter, and not all observers are convinced it will be a breakthrough.

"SAS has been trying for years to deliver in-database parallelized modeling," said independent analyst Curt Monash of Monash Research. "Maybe SAS will have more success this time, but we need to wait and see."


Page 2: 
 1 | 2 |Next Page » 

Related Reading




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

InformationWeek encourages readers to engage in spirited, healthy debate, including taking us to task. However, InformationWeek moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. InformationWeek further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
Subscribe to RSS

Resource Links