Big Data // Big Data Analytics
News
1/30/2014
09:06 AM
Doug Henschen
Doug Henschen
Slideshows
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
100%
0%

16 Top Big Data Analytics Platforms

Data analysis is a do-or-die requirement for today's businesses. We analyze notable vendor choices, from Hadoop upstarts to traditional database players.
Previous
1 of 17
Next

Revolutionary. That pretty much describes the data analysis time in which we live. Businesses grapple with huge quantities and varieties of data on one hand, and ever-faster expectations for analysis on the other. The vendor community is responding by providing highly distributed architectures and new levels of memory and processing power. Upstarts also exploit the open-source licensing model, which is not new, but is increasingly accepted and even sought out by data-management professionals.

Apache Hadoop, a nine-year-old open-source data-processing platform first used by Internet giants including Yahoo and Facebook, leads the big-data revolution. Cloudera introduced commercial support for enterprises in 2008, and MapR and Hortonworks piled on in 2009 and 2011, respectively. Among data-management incumbents, IBM and EMC-spinout Pivotal each has introduced its own Hadoop distribution. Microsoft and Teradata offer complementary software and first-line support for Hortonworks' platform. Oracle resells and supports Cloudera, while HP, SAP, and others act more like Switzerland, working with multiple Hadoop software providers.

In-memory analysis gains steam as Moore's Law brings us faster, more affordable, and more-memory-rich processors. SAP has been the biggest champion of the in-memory approach with its Hana platform, but Microsoft and Oracle are now poised to introduce in-memory options for their flagship databases. Focused analytical database vendors including Actian, HP Vertica, and Teradata have introduced options for high-RAM-to-disk ratios, along with tools to place specific data into memory for ultra-fast analysis.

Advances in bandwidth, memory, and processing power also have improved real-time stream-processing and stream-analysis capabilities, but this technology has yet to see broad adoption. Several vendors here complex event processing, but outside of the financial trading, national intelligence, and security communities, deployments have been rare. Watch this space and, particularly, new open source options as breakthrough applications in ad delivery, content personalization, logistics, and other areas push broader adoption.

Our slideshow includes broad-based data-management vendors -- IBM, Microsoft, Oracle, SAP -- that offer everything from data-integration software and database-management systems (DBMSs) to business intelligence and analytics software, to in-memory, stream-processing, and Hadoop options. Teradata is a blue chip focused more narrowly on data management, and like Pivotal, it has close ties with analytics market leader SAS.

Plenty of vendors covered here offer cloud options, but 1010data and Amazon Web Services (AWS) have staked their entire businesses on the cloud model. Amazon has the broadest selection of products of the two, and it's an obvious choice for those running big workloads and storing lots of data on the AWS platform. 1010data has a highly scalable database service and supporting information-management, BI, and analytics capabilities that are served up private-cloud style.

The jury is still out on whether Hadoop will become as indispensable as database management systems. Where volume and variety are extreme, Hadoop has proven its utility and cost advantages. Cloudera, Hortonworks, and MapR are doing everything they can to move Hadoop beyond high-scale storage and MapReduce processing into the world of analytics.

The niche vendors here include Actian, InfiniDB/Calpont, HP Vertica, Infobright, and Kognitio, all of which have centered their big-data stories around database management systems focused entirely on analytics rather than transaction processing. German DBMS vendor Exasol is another niche player in this mold, but we don't cover it here as its customer base is almost entirely in continental Europe. It opened offices in the U.S. and U.K. in January 2014.

This collection does not cover analytics vendors, such as Alpine Data Labs, Revolution Analytics, and SAS. These vendors invariably work in conjunction with platforms provided by third-party DBMS vendors and Hadoop distributors, although SAS in particular is blurring this line with growing support for SAS-managed in-memory data grids and Hadoop environments. We also excluded NoSQL and NewSQL DBMSs, which are heavily (though not entirely) focused on high-scale transaction processing, not analytics. We plan to cover NoSQL and NewSQL platforms in a separate, soon-to-be-published collection.

Now dig in and learn more about these analytics vendors and how they compare.

 

Previous
1 of 17
Next
Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
<<   <   Page 3 / 3
mhummel515
50%
50%
mhummel515,
User Rank: Apprentice
1/30/2014 | 1:09:00 PM
Re: ParStream - real-time database for big data analytics
Hi Doug,

I appreciate your interest in ParStream and will make sure to get in contact to provide insights into our product and customer base.

Unfortunately, some of our customers are "shy" and do not want to be named publicly. This unfortunately includes the US customers.

Looking forward to get in touch.



Mike
Laurianne
0%
100%
Laurianne,
User Rank: Author
1/30/2014 | 1:03:16 PM
Important Big Data Context
Doug has supplied important context for those people choosing between big data analysis vendors. Please tell us if there are aspects you would like more/less detail on when we do the next roundup. Readers, are you surprised by how much support Hadoop has won from the bigs in the last 24 months?
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
1/30/2014 | 12:53:55 PM
Re: ParStream - real-time database for big data analytics
Thanks for your note. I don't hear much about ParStream, and your list of customers isn't studded with well-know companies. I excluded Exasol for much the same reason -- a number of customers in Germany, but not a presence in North America where we get the vast majority of our readership. Your technology is of interest, however, so feel free to contact me, particularly with customer case example.
mhummel515
100%
0%
mhummel515,
User Rank: Apprentice
1/30/2014 | 12:09:03 PM
ParStream - real-time database for big data analytics
Hi Doug,

I very much appreciate your update on the big data analytics vendor market. 

I am curious to find out why you did not include ParStream in your list of big data analytics players. Agreed that the majority of our customers is still in Europe but through our strong presence in Cupertino and Boston we have won numerous customers in the US.

Our focus is on Fast Data - continuous data import at very high bandwidth combined with sub-second response times on billions of data records. 

Through partnerships with leading front-end and ETL tool providers AND Hadoop we offer a super-fast analytics solution that outperforms all product on your list.

I am happy to share with you performance benchmarks and introduce you to our reference customers.

Best
Mike
CEO ParStream
D. Henschen
100%
0%
D. Henschen,
User Rank: Author
1/30/2014 | 11:52:39 AM
It's time for this update
It has been little more than two years since we published our 12 Top Big Data Analytics Players collection, but so much has changed and so many new players have emerged that we needed this update. Over the last 26 months, all the big data-management vendors -- IBM, Microsoft, Oracle, SAP, Teradata -- have really embraced Hadoop. And whether they're adding SQL-on-Hadoop options -- a la Actian, InfiniDB/Calpont, and Pivotal -- or exploiting unprecidented levels of RAM -- as with Kognitio and SAP -- database management system suppliers are changing the scope and speed of their analysis capabilities.

The biggest change, though, is that practitioners are considering the data that they have on hand, the data that they're currenlty throwing away, and the data that they could collect with sensors or smart phones. They're considering all-new applications and, in some cases, entirely new business models. Innovaters may not want to delay or get their hands too dirty with all this technology, however, so we're seeing cloud options from 1010data, Amazon Web Services and others gathering steam.

We may have reached the end of the beginning of the big data era. But it's time to move beyond the speculative hype and get down to the business of on creating breakthrough applications that deliver value. Let 2014 be the year we shift from focusing on what could be to what is actually happening in the world of big data analysis.  
<<   <   Page 3 / 3
6 Tools to Protect Big Data
6 Tools to Protect Big Data
Most IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Government Tech Digest Oct. 27, 2014
To meet obligations -- and avoid accusations of cover-up and incompetence -- federal agencies must get serious about digitizing records.
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join us for a roundup of the top stories on InformationWeek.com for the week of October 26, 2014 and for the incredible Friday Afternoon Conversation that runs beside the program.
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.