Big Data // Big Data Analytics
News
6/26/2014
02:46 PM
Connect Directly
LinkedIn
Twitter
Google+
RSS
E-Mail

Hortonworks Certifies Spark On YARN, Hadoop

Hortonworks catches up to Cloudera with YARN-managed implementation of Spark in-memory framework for machine learning on Hadoop.

Comment  | 
Print  | 
Comments
Newest First  |  Oldest First  |  Threaded View
srowen
50%
50%
srowen,
User Rank: Apprentice
7/4/2014 | 1:49:35 PM
Re: That's nice, but ...
I am from Cloudera and have committed about 50 patches to Spark. Same goes for a few other people here. What are you looking at?
ap2snoopy
50%
50%
ap2snoopy,
User Rank: Apprentice
7/4/2014 | 1:34:53 PM
Re: That's nice, but ...
None of the three 'race horses' (MapR, Cloudera, Hortonworks) seem to have contributed to Spark development.

UCB and Databricks (Spark is their main focus) seem to have the most commiters. 

https://cwiki.apache.org/confluence/display/SPARK/Committers

 

 
Charlie Babcock
50%
50%
Charlie Babcock,
User Rank: Author
6/30/2014 | 9:28:57 PM
Spark keeps Hadoop competitive
It looks like there is a healthy competition between these companies that will do much to keep their respective Hadoops systems competitive. MapR Spark, Hortonworks Spark on Yarn and Cloudera Manager's support for Spark are pushing the boundaries of big data.
srowen
50%
50%
srowen,
User Rank: Apprentice
6/26/2014 | 6:23:02 PM
That's nice, but ...
... would be nicer if more than 0 people from HortonWorks made any contribution to Spark. Or you could actually run Spark in production with HDP.
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
6/26/2014 | 4:25:41 PM
Another case of commercial management tool versus open-source management tool
For more on Cloudera's options for implementing Spark, incuding on YARN, click here. Given Cloudera's use of YARN, the key difference between Cloudera and Hortonworks use of Spark seems to boil down to the management software used for deploying, monitoring, and managing the software (YARN does the workloads). In Cloudera's case it's commerical Cloudera Manager software. In Hortonworks' case it's open source Ambari software, but Ambari support is part of what Hortonworks is still working on at this point. Reading between the lines, I would expect HDP 2.1 to become generally available until this fall.
6 Tools to Protect Big Data
6 Tools to Protect Big Data
Most IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest September 24, 2014
Start improving branch office support by tapping public and private cloud resources to boost performance, increase worker productivity, and cut costs.
Video
Slideshows
Twitter Feed
InformationWeek Radio
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.