Big Data // Hardware/Architectures
News
5/9/2014
09:16 AM
Connect Directly
LinkedIn
Twitter
Google+
RSS
E-Mail
50%
50%

HP Offers Appliance For Microsoft Analytics Platform

HP ConvergedSystem 300 promises fast deployment of SQL Server 2014 and HDInsight Hadoop.

10 Hadoop Hardware Leaders
10 Hadoop Hardware Leaders
(Click image for larger view and slideshow.)

Name a popular software platform, and HP is probably developing an appliance for that. OK, you won't find IBM or Oracle examples, but Citrix, Microsoft, SAP, and VMWare are all on the list, and Microsoft is about to get another option.

HP on Thursday announced the ConvergedSystem 300 for the Microsoft Analytics Platform System (APS). Introduced in April, Microsoft APS combines SQL Server 2014 and, optionally, HDInsight, Microsoft's Apache Hadoop distribution based on the Hortonworks Data Platform (HDP) for Windows. Microsoft APS replaces the SQL Server 2012 PDW appliance, and it's the only on-premises option for HDInsight deployment. Those wishing to run Hadoop on Windows without SQL Server can use Hortonworks HDP for Windows.

[Have you heard about Helion? Read HP Bets Big On OpenStack And Hybrid Cloud.]

The ConvergedSystem 300 for Microsoft APS will ship with SQL Server Parallel Data Warehouse (PDW) configured to customer specifications. Optionally, customers can add HDInsight Hadoop nodes configured to run on the same box. The appliance combines servers, storage, and networking, and the whole idea is to save customers time and trouble.

"We're providing pre-optimized, pre-built building blocks that enable customers to deploy these new workloads quickly and without expertise," said Paul Miller, VP of marketing for HP ConvergedSystems, in a phone interview with InformationWeek. "You can go from order to up and running on Microsoft SQL Server in less than 30 days."

SQL Server PDW users can query the data on Hadoop nodes using PolyBase query capabilities for multi-structured data. Those Hadoop nodes could be HDInsight in the same rack or system or Azure HDInsight running in the Windows Azure cloud. PolyBase can also query against combinations of PDW and Apache, Cloudera, or Hortonworks HDP nodes.

The HP ConvergedSystem 300 for Microsoft APS starts with a 1/4-rack configuration for $275,000 (not including SQL Server PDW). Customers scale up in 1/4-rack increments, and they can mix SQL Server and HDInsight nodes. The entire system can span multiple racks for up to 6 petabytes of storage.

HP's other ConvergedSystem offerings include the ConvergedSystem 300 and 700x for Virtualization (either Microsoft or VMware), the ConvergedSystem 500 for SAP Hana, and the ConvergedSystem 100 for Citrix hosted desktops. The 300-series offerings use HP's rackmount servers, such as the DL360 and DL380. The 500 series is geared to high-memory deployments (thus the fit with Hana) using servers such as the DL580. The high-scale 700 series combines HP's blade architecture servers and 3Par storage.

Customers interested in the ConvergedSystem 300 for Microsoft APS consult with HP engineers on workloads and desired configuration specifications. "Most customers have an enterprise license agreement with Microsoft with a site license," said Miller. "When we ship we have a process to enable the software, and we pre-load and pre-test everything in the factory."

The ConvergedSystem 300 for Microsoft APS is expected to ship by mid summer.

Can the trendy tech strategy of DevOps really bring peace between developers and IT operations -- and deliver faster, more reliable app creation and delivery? Also in the DevOps Challenge issue of InformationWeek: Execs charting digital business strategies can't afford to take Internet connectivity for granted.

Doug Henschen is Executive Editor of InformationWeek, where he covers the intersection of enterprise applications with information management, business intelligence, big data and analytics. He previously served as editor in chief of Intelligent Enterprise, editor in chief of ... View Full Bio

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
joseafq
50%
50%
joseafq,
User Rank: Apprentice
7/29/2014 | 12:37:58 PM
Re: Want a Hadoop Appliance? Go Build It Yourself
Replying comment by Mr. D. Henschen

 

Maybe Microsoft APS with HDInsight region(s) was designed:

- to take advantage of the internal rack network speed (54 GB/sec) when accessing non-structured data (Polybase allows to access structured and non-structured data just using regular T-SQL. You don't need to learn MapReduce, Hive, Pig, etc. to access non-structured data).

- or to comply with regulatory restrictions (like: "- All the sensitive data must be stored inside the country, not in a I-don't-know-where distributed storage farm")

- or just because accessing data outside on the appliance means to use corporate network which, many times, is not a dedicated network, and the performance is not as good as it should be.

- or because statistical information could be stored in the control node, so it's possible to create improved execution plans when accessing non-structured data

- or because the control node can manage the different workloads, providing better performance to end-user applications

- or because it's easier to manage the HDFS if it's on-prem

- or because it's easier to manage security (who access what) in structured and non-structured data.

- ...

So, IMHO, I guess there are many reasons to deploy such a "mixed" architecture in one rack.

As you mention, it's required, at least, 1 basic unit (2 compute nodes + storage in the HP appliance, or 3 compute nodes + storage in the Dell appliance) with SQL Server EE. All the other nodes could be HDInsight regions (in any flavour: Hadoop, Cloudera, etc.).
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
5/9/2014 | 10:02:28 AM
Want a Hadoop Appliance? Go Build It Yourself
Microsoft APS follows the route of adding Hadoop -- in this case Microsoft's HDInsight distirbution -- to what is otherwise a database appliance -- in this case Microsoft SQL Server 2014 Parallel Data Warehouse. Want a Hadoop-only deployment? You'll have to use Hortonworks Data Platform for Windows. HP does have suggested hardware configurations for HDP, but if you want a Hadoop appliance, it's probably going to throw HP Vertica Community Edition into the bundle -- just as Pivotal likes to mingle Greenplum with the Pivotal HD Hadoop distribution.

I'm not sure these guys are thinking these things through to scale. Relational databases and Hadoop do work together, but if you're doing a data lake with Hadoop, I'm thinking you're only going to need one rack with RDBMS while the Hadoop cluster is going to potentially grow across many racks. Or maybe you would have a couple of RDBMS racks and a handful for Hadoop. Why intermix on a single rack unless your not likely to move beyond the confines of a single rack? PolyBase connects the two worlds no matter where the clusters might be: on separate racks or even in the cloud on Azure. Opinions?  
In A Fever For Big Data
In A Fever For Big Data
Healthcare orgs are relentlessly accumulating data, and a growing array of tools are becoming available to manage it.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest - September 10, 2014
A high-scale relational database? NoSQL database? Hadoop? Event-processing technology? When it comes to big data, one size doesn't fit all. Here's how to decide.
Flash Poll
Video
Slideshows
Twitter Feed
InformationWeek Radio
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.