16 Top Big Data Analytics Platforms - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Data Management // Big Data Analytics
News
1/30/2014
09:06 AM
Doug Henschen
Doug Henschen
Slideshows
Connect Directly
LinkedIn
Twitter
RSS
E-Mail

16 Top Big Data Analytics Platforms

Data analysis is a do-or-die requirement for today's businesses. We analyze notable vendor choices, from Hadoop upstarts to traditional database players.
10 of 17

Infobright focuses on machine data 
 
Analytical DBMS: Infobright. 
In-memory DBMS: None. 
Hadoop distribution: None.  
Stream-processing technology: None. 
Hardware/software systems: Infobright Infopliance.
Infobright is a focused analytical database provider with a column-store DBMS aimed at fast analysis of up to 50 terabytes per server; the database is designed for symmetric multiprocessor servers, not massively parallel processing. Fifty TB might sound small for big data, but Infobright's high compression ratio and data-skipping technology is particularly well suited to machine data such as clickstreams, mobile data, log files, and sensor data. 
Infobright is often used in concert with Hadoop or large-scale enterprise data warehouses. The larger store supports long-term, high-scale storage while Infobright, a MySQL engine, quickly drills down on the latest subsets of data of interest. Columnar design and data-skipping technology ensure that only the information that is relevant to each query is analyzed, ensuring sub-second response times. Data is indexed automatically on ingest without partitioning, and minimal tuning is required. It's a low-cost, fleet-footed alternative for fast ad-hoc anlaysis.

Infobright focuses on machine data

Analytical DBMS: Infobright.
In-memory DBMS: None.
Hadoop distribution: None.
Stream-processing technology: None.
Hardware/software systems: Infobright Infopliance.

Infobright is a focused analytical database provider with a column-store DBMS aimed at fast analysis of up to 50 terabytes per server; the database is designed for symmetric multiprocessor servers, not massively parallel processing. Fifty TB might sound small for big data, but Infobright's high compression ratio and data-skipping technology is particularly well suited to machine data such as clickstreams, mobile data, log files, and sensor data.

Infobright is often used in concert with Hadoop or large-scale enterprise data warehouses. The larger store supports long-term, high-scale storage while Infobright, a MySQL engine, quickly drills down on the latest subsets of data of interest. Columnar design and data-skipping technology ensure that only the information that is relevant to each query is analyzed, ensuring sub-second response times. Data is indexed automatically on ingest without partitioning, and minimal tuning is required. It's a low-cost, fleet-footed alternative for fast ad-hoc anlaysis.

10 of 17
Comment  | 
Print  | 
Comments
Newest First  |  Oldest First  |  Threaded View
Page 1 / 3   >   >>
shane88
50%
50%
shane88,
User Rank: Apprentice
10/9/2017 | 5:35:16 AM
Re: It's time for this update
This tool might also help: JSON formatter. Have a nice day!
UrvashiS073
50%
50%
UrvashiS073,
User Rank: Apprentice
4/27/2017 | 1:30:45 PM
IRI Voracity

I would add to this list IRI Voracity, the big data discovery, integration, migration, governance, and analytics platform introduced in 2016 to address the performance, security, quality, complexity and cost issues in legacy vendor and speciality tools.

eyu906
50%
50%
eyu906,
User Rank: Strategist
1/6/2015 | 12:36:09 PM
Drill-downs?
Dell Boomi is the #1 cloud integration platform.  Are you going to drill down to help users regarding technology strategy?
KenB037
100%
0%
KenB037,
User Rank: Apprentice
9/24/2014 | 10:27:52 PM
Great article! When is the next update?
Super overview article! I realize that it will be a lot of work, but it would be great if you decide to write an update sometime time soon.  I am already looking forward to it!
LesterK048
0%
100%
LesterK048,
User Rank: Apprentice
8/8/2014 | 2:51:40 AM
Re: It's time for this update
A smaller company which can process big JSON data for easier visualization is json-csv.com. You may want to check it out.
bigdatarelated
50%
50%
bigdatarelated,
User Rank: Apprentice
4/23/2014 | 11:24:38 AM
Re: A collection of marketing flyers from 16 vendors
Great article. I've added a link to it from  Bigdatarelated, a free big data community resource website.
Akon786
50%
50%
Akon786,
User Rank: Apprentice
2/20/2014 | 6:39:55 AM
Bedrock Data Management Platform 2.0
Comprehensive and well rounded article.

Where does Bedrock Data Management Platform 2.0 figure in the game?
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
2/11/2014 | 1:28:26 PM
Re: Bravo
Thanks, Wayne. Coming from such an esteemed expert, I'm flattered.
weckerson
50%
50%
weckerson,
User Rank: Apprentice
2/6/2014 | 4:33:06 PM
Bravo
Doug, 

Well done. This is a ton of work and well done! A great resource. 

 

Wayne
D. Henschen
100%
0%
D. Henschen,
User Rank: Author
2/5/2014 | 9:18:53 AM
Re: What about Personalized Big Data Analytics?
Analytics tools and BI systems run on servers, but these systems are generally not scaled to handle big data. More often than not, these systems draw data from data warehouses or data marts. Increasingly, a larger-scale "platform" such as a massively parallel processing (MPP) database management system or Hadoop cluster is required to handle the volume and variety of data. Some analytics vendors, notably SAS but including others, are developing their own in-memory cluster software or implementations on top of Hadoop, but the vast majority of clients use analytics and BI software in combination with data-management platforms from third-party vendors like those covered in the collection above.

Confusing matters, many vendors above offer analytic capabilites -- IBM has SPSS and Cognos; SAP has BusinessObjects and Predictive Analysis; Oracle, Pivotal, and Teradata tap advanced SQL analytics, R and various partnerships with analytics vendors including SAS, etc. -- but they're not included in this collection because of those capabilites.

There are many options for smaller companies -- including cloud, price-competitive upstart vendors, and open source options. But where this is great data volume, variety, and velocity, there's a need for a high-scale platform or platforms to serve as the place where the analysis gets done (as with in-database or in-Hadoop analytics) or as the place from which subsets of data are drawn or analyzed (as in the case of Hadoop or data warehouse integration).

 
Page 1 / 3   >   >>
News
Can Cloud Revolutionize Business and Software Architecture?
Joao-Pierre S. Ruth, Senior Writer,  1/15/2021
Slideshows
10 IT Trends to Watch for in 2021
Cynthia Harvey, Freelance Journalist, InformationWeek,  12/22/2020
News
How CDOs Can Build Insight-Driven Organizations
Jessica Davis, Senior Editor, Enterprise Apps,  1/15/2021
White Papers
Register for InformationWeek Newsletters
Video
Current Issue
2021 Top Enterprise IT Trends
We've identified the key trends that are poised to impact the IT landscape in 2021. Find out why they're important and how they will affect you.
Slideshows
Flash Poll