Cloudera Director 2.0 Debuts, Google Donates To Apache Foundation: Big Data Roundup - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Data Management // Software Platforms
10:06 AM
Connect Directly

Cloudera Director 2.0 Debuts, Google Donates To Apache Foundation: Big Data Roundup

Cloudera has updated a key tool for managing big data, Google has contributed its Cloud Dataflow platform to the Apache Foundation Incubator. We have this and more in our Big Data Roundup for the week ending January 24.

IoT 2016: 13 Hot Trends For Business
IoT 2016: 13 Hot Trends For Business
(Click image for larger view and slideshow.)

Hadoop distributor Cloudera has issued an update to one of its offerings, Google has submitted one of its efforts to the Apache Foundation as a potential incubator project, MariaDB has raised some funds, and Netflix CEO Reed Hastings talks about the limits of data. Plus, we look at how data could help you predict this year's Academy Awards, all in our Big Data Roundup for the week ending January 24.

Cloudera Director 2.0

Let's start with Cloudera's news. This Hadoop distribution company updated Cloud Director, its big data deployment and management tool. Cloudera said that version 2.0 simplifies running common Hadoop workloads in the cloud, such as ETL and modeling, business intelligence and analytics, and application delivery. The tool is designed for both scale and production environments in the cloud, according to Cloudera VP of products Charles Zedlewski.

Cloudera has added spot instance support to decrease hosting costs for transient workloads and automatic job submissions to spin up and terminate clusters on a per-job basis. Cloudera also said that the newest release of Cloudera, version 5.5, has introduced support for Apache Hive and Apache Spark on Amazon S3, so users can continue to use the their choice of tools, independent of where the data resides.

(Image: PonyWang/iStockphoto)

(Image: PonyWang/iStockphoto)

In addition, the new version adds cluster cloning and cluster repair to increase the end-user base and repair clusters without affecting end users. And for application delivery workloads, Cloudera Director 2.0 has integrated high availability and Kerberos configurations within the overall bootstrap workflow, making it easier to set up, Cloudera said.

[ Intel is a big Cloudera investor. Want to know more about Intel's big data efforts? Read Intel's TAP Big Data Platform Gains Healthcare Cloud Partners. ]

The new release works across major cloud platforms, including AWS and Google Cloud Platform, and it includes the Open Cloud Connector to enable integration with other preferred or private clouds. Users who want to deploy on Microsoft Azure can provision Cloudera Enterprise via the Azure Marketplace, the company said.

Google Cloud Dataflow

Google this week has sent a proposal for its Cloud Dataflow to be accepted as an Apache Foundation Incubator project. Google's Cloud Dataflow is a platform for processing big data in the cloud. It features an open source, Java-based SDK to help make it easy to integrate with other cloud-based analytics tools. That includes letting organizations use their existing tool investments and integrate them even as they adopt more advanced technologies.

Google announced the submission in a blog post this week. The search giant said that it has submitted the project along with participants from Cloudera, Data Artisans, Talend, Cask, and PayPal.

MariaDB Raises Funds

MariaDB, which offers an open-source relational database, has raised $9 million in equity funding for advanced technology development and to accelerate sales, the company announced this week. The round included investments from Intel Capital and California Technology Ventures. The company also announced the appointment of Michael Howard as its new CEO and Michael "Monty" Widenius as CTO.

AI, Machine Learning, And IoT

At InformationWeek, we've put together some interesting coverage in the last week about adoption of AI and machine learning in the enterprise, as well as some significant open source machine learning moves by big vendors in recent months. We also explored how businesses and other enterprises are using the Internet of Things (IoT) to drive value.

We also examined how companies are monetizing data now and in the months and years ahead.

Netflix And Data

Netflix is known for leveraging data to drive its business, including decisions on creating new original shows. CEO Reed Hastings recently offered a bit of a caveat, though. Speaking at the DLD Conference in Munich, Germany, last week, Hastings said: "We start with the data. But the final call is always gut. It's informed intuition," according to this report on VentureBeat. Hastings pointed to one Netflix exec, Ted Sarandos, as the man who has the "golden gut," but also cited a distributed group of executives and managers who make the final call over content.

And The Oscar Goes To...

Finally, FiveThiryEight provided some insights this week on how to game Academy Award predictions. We'd tell you all about it, but...Spoilers!

Jessica Davis has spent a career covering the intersection of business and technology at titles including IDG's Infoworld, Ziff Davis Enterprise's eWeek and Channel Insider, and Penton Technology's MSPmentor. She's passionate about the practical use of business intelligence, ... View Full Bio

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Newest First  |  Oldest First  |  Threaded View
User Rank: Ninja
1/26/2016 | 5:13:47 AM
Re: AWS Spot Instances, a cost-effective way to go

Cloudera is trying to make the utilization of AWS Spot Instances much-much easier than it is today.

The point is are they really succeeding in what they have set out to achieve here today?The jury still seems to be out on this one.

If they do succeed then Awesome! If they don't we could do with less Complexity in Big Data Management today.

Charlie Babcock
Charlie Babcock,
User Rank: Author
1/25/2016 | 5:58:14 PM
AWS Spot Instances, a cost-effective way to go
Use of Cloudera on AWS Spot Instances is one of the most efficient options available, if you can master use of Spot Instances. Cloudera is wisely trying to make it easier.
What Becomes of CFOs During Digital Transformation?
Joao-Pierre S. Ruth, Senior Writer,  2/4/2020
Fighting the Coronavirus with Analytics and GIS
Jessica Davis, Senior Editor, Enterprise Apps,  2/3/2020
IT Careers: 10 Job Skills in High Demand This Year
Cynthia Harvey, Freelance Journalist, InformationWeek,  2/3/2020
White Papers
Register for InformationWeek Newsletters
Current Issue
IT 2020: A Look Ahead
Are you ready for the critical changes that will occur in 2020? We've compiled editor insights from the best of our network (Dark Reading, Data Center Knowledge, InformationWeek, ITPro Today and Network Computing) to deliver to you a look at the trends, technologies, and threats that are emerging in the coming year. Download it today!
Flash Poll