Comments
Pentaho Preps Data On Hadoop, Analyzes On MongoDB
Newest First  |  Oldest First  |  Threaded View
Lorna Garey
50%
50%
Lorna Garey,
User Rank: Author
7/8/2014 | 3:48:24 PM
Re: Data integration vendors are hot to get in on big data
How will Pentaho monetize this? The number of customers paying for enterprise support doesn't seem all that high.
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
7/8/2014 | 3:29:39 PM
Re: Pentaho system, ungainly or powerful?
Sorry, but I guess the headline is potentially misleading. Data-prep on Hadoop is in service of predictive analysis (done with tools such as Pentaho Weka, R, or, soon according to Pentaho, SAS or Metlab). The support for MongoDB is a separate thing, only for BI/data-visualization style analysis (not predictive work) on the data managed by MongoDB. The two are not connected other than the fact that they are both capabilities introduced in Pentaho 5.1.
Charlie Babcock
50%
50%
Charlie Babcock,
User Rank: Author
7/8/2014 | 3:24:06 PM
Pentaho system, ungainly or powerful?
To "befriend the data scientist" is no easy task. It's all too easy to be a friend to few, stranger to many. The combinatin of Hadoop with YARN on top for data prep, with the rsults plugged into MongoDB sounds like a powerful system -- as long as the movement between the two of them is smooth.  
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
7/8/2014 | 1:13:09 PM
Data integration vendors are hot to get in on big data
When Hadoop first emerged, we all heard it would displace ETL. That's at least partially true, for some transformation processing, but now data-integration vendors -- like Informatica, Paxata, and, now Pentaho -- are saying their stuff is needed for all sorts of data prep and processing ahead of big-data analysis. It's another case of offering an alternative to clunky MapReduce processing, but I haven't talked to enough customers who have validated how useful these tools can be in big-data-analysis scenarios.

The "80% of the work" line above seems like a relic of relational data warehousing approches, but I need to hear from more practitioners -- yes, this is a naked plea for comments from practitioners -- before passing this off as an overstatement or marketing ploy.


The Business of Going Digital
The Business of Going Digital
Digital business isn't about changing code; it's about changing what legacy sales, distribution, customer service, and product groups do in the new digital age. It's about bringing big data analytics, mobile, social, marketing automation, cloud computing, and the app economy together to launch new products and services. We're seeing new titles in this digital revolution, new responsibilities, new business models, and major shifts in technology spending.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest - August 20, 2014
CIOs need people who know the ins and outs of cloud software stacks and security, and, most of all, can break through cultural resistance.
Flash Poll
Video
Slideshows
Twitter Feed
InformationWeek Radio
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.