Microsoft Ditches Dryad, Focuses On Hadoop

Microsoft drops LINQ to HPC big-data processing project, in favor of Apache's open source Hadoop platform.

Just one month after announcing plans to release Windows Azure- and Windows Server-based implementations of open source Apache Hadoop, Microsoft quietly changed course on LINQ to Windows High Performance Computing (HPC), codenamed "Dryad."

Microsoft has been working on Dryad for more than five years, but a November 11 post on the Windows HPC Team Blog revealed that the project has been discontinued before ever seeing a production release. The move to drop Dryad was first reported Wednesday by Microsoft-watcher Mary Jo Foley.


More Software Insights

White Papers

More >>

Reports

More >>

Webcasts

More >>

Dryad was intended to run big-data jobs across HPC, Microsoft's clustered server environment. But such a release would have presented a proprietary and competing alternative to Hadoop, which is rapidly emerging as the leading platform for distributed data processing.

As InformationWeek reported October 12, Microsoft's embrace of Hadoop follows in the footsteps of EMC and IBM, which each introduced their own distributions of the software this year. And on October 3, Oracle announced plans for yet another Hadoop release, this one expected next year along with a supporting Oracle Big Data Appliance.

[ Want more background on Hadoop? Read Hadoop Spurs Big Data Revolution. ]

In the face of this growing support, a single-vendor platform like Dryad faced long odds of success. "Hadoop has emerged as a great platform for analyzing unstructured data or large volumes of data at low cost," acknowledged Don Pattee, senior program manager, on the HPC Team Blog. "It also has a vibrant community of users and developers eager to innovate on this platform."

Microsoft has partnered with Hortonworks, a Yahoo! spinoff that specializes in Hadoop, to help develop software distributions for Windows Server and Windows Azure. A preview version is expected to appear on the Azure cloud computing platform by the end of this year.

Interest in Hadoop is driven by demands for high scalability (into the petabytes) at low cost, and the need to flexibly handle loosely structured data such as social network feeds, email, and documents or inconsistently structured data from clickstreams, Web logs, and sensors. These data types cannot be effectively managed in a relational database such as Microsoft SQL Server.

"We're seeing significant changes in the data landscape, with businesses encountering more types of data--more shapes, more sizes--than ever before," Doug Leland, general manager of product management for SQL Server, told InformationWeek. "To address those changes we need a new data platform."

Microsoft's software direction is now clear, but there has been no indication whether there will be supporting appliances on third-party hardware, as there are for the SQL Server Parallel Data Warehouse (PDW.) Hewlett-Packard offers a PDW appliance in partnership with Microsoft and it's also an HPC partner. It's a safe bet that Microsoft will tap HP and possibly others to develop turnkey appliances to answer EMC's Greenplum Modular Data Computing Appliance, released in September, and Oracle's planned Big Data Appliance.

Most companies have little control over their heaps of unstructured data, our research reveals. It's time for a content management strategy. Get the new, all-digital InformationWeek supplement. Download it now. (Free with registration.)

Related Reading




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

InformationWeek encourages readers to engage in spirited, healthy debate, including taking us to task. However, InformationWeek moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. InformationWeek further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
T-Shirt Giveaway T-Shirt Giveaway: Each week we're selecting one great comment from our readers. The author of the comment will receive an InformaitonWeek Community t-shirt. So get posting!
Subscribe to RSS

Resource Links