Apache Spark: 3 Promising Use-Cases - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Data Management // Big Data Analytics
Commentary
3/27/2015
11:36 AM
James Kobielus
James Kobielus
Commentary
Connect Directly
Twitter
RSS

Apache Spark: 3 Promising Use-Cases

Spark is the shiny new thing in big data, but how will it stand out? Here's a look at "fog computing," cloud computing, and streaming data-analysis scenarios.

Apache Spark supports SQL, machine-learning, graph, and streaming analysis against a range of data types, and in multiple development languages.

Apache Spark supports SQL, machine-learning, graph, and streaming analysis against a range of data types, and in multiple development languages.

Comment  | 
Print  | 
Comments
Threaded  |  Newest First  |  Oldest First
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
3/27/2015 | 12:48:15 PM
Interesting left-handed compliments here
James doesn't really sound all that enthusiastic about Spark, describing it as unproven and underplaying, in my view, the interest, adoption, and number of proven applications on the platform. Interestingly, IBM sponsored Spark Summit East a couple of weeks ago, but it's probably hoping to sell its own commercial solutions to the attendees of that event. It is officially supporting Spark, but it doesn't have a distribution of or integration with that software as yet.

As I understand it, Spark has more than 500 enterprise adopters, and Spark promoter Databricks has more than 50 beta customers for its Databricks Cloud service based on Spark. Streaming data analysis is just one play for Spark, which makes it a competitor to IBM InfoSphere Streams. How "proven" is Streams, I wonder, and how many customers does it have? Is InfoSphere Streams really getting into the same conversations as Spark and Storm? Big data practitioners seem to have a strong bias toward open-source options, not commercial software. Maybe open source is the real "shiny new thing" that commercial vendors are competing against.

 
Thomas Claburn
50%
50%
Thomas Claburn,
User Rank: Author
3/27/2015 | 6:11:41 PM
Fog?
"Fog" computing just rubs me the wrong way. Why not aim for precision with "distributed computing" instead?
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
3/28/2015 | 7:30:00 AM
Re: Fog?
I had not heard the term "fog" either, but apparently it's a phrase Cisco is using as we'll. Follow the link in James' column to read more about it. As for precision, is "cloud" any more precise? Or "big data"? Sometimes terms manage to catch on as shorthand for a big collection of things. Fog's not there yet, so it sounds a bit forced.
asksqn
50%
50%
asksqn,
User Rank: Ninja
3/30/2015 | 4:46:30 PM
Oorah for Spark
I'm looking forward to Spark given its above referenced versatility and the fact that is is open source.  I see nothing but very good things in Spark's future.
Slideshows
What Digital Transformation Is (And Isn't)
Cynthia Harvey, Freelance Journalist, InformationWeek,  12/4/2019
Commentary
Watch Out for New Barriers to Faster Software Development
Lisa Morgan, Freelance Writer,  12/3/2019
Commentary
If DevOps Is So Awesome, Why Is Your Initiative Failing?
Guest Commentary, Guest Commentary,  12/2/2019
White Papers
Register for InformationWeek Newsletters
Video
Current Issue
Getting Started With Emerging Technologies
Looking to help your enterprise IT team ease the stress of putting new/emerging technologies such as AI, machine learning and IoT to work for their organizations? There are a few ways to get off on the right foot. In this report we share some expert advice on how to approach some of these seemingly daunting tech challenges.
Slideshows
Flash Poll