Slideshows


12 Top Big Data Analytics Players


October 18, 2011 08:01 AM When data grows into the tens or even hundreds of terabytes, you need a special technology to quickly make sense of it all. From Hadoop to Teradata, check out the top platform options.
« Previous Page  | 1 | 2 |  3 | 4 | 5 | 6 | 7 | 8 | 9 | 10  | Next Page » 
  • E-mail

Hadoop And MapReduce Boil Down Really Big Data

Hadoop is a collection of open-source distributed data-processing components for storing and processing structured, semi-structured, or unstructured data at truly high scale (as in tens or hundreds of terabytes of even petabytes). Clickstream and social-media analysis applications are driving much of the demand, and of particular interest is MapReduce, a technique supported by Hadoop (and a few other environments) that is ideal for processing big data sets. MapReduce breaks a big data problem into sub-problems, distributes those onto dozens, hundreds, or even thousands of processing nodes and then combines the results into a smaller data set that's easier to analyze.

Hadoop runs on low-cost commodity hardware and it scales up at a fraction of the cost of commercial storage and data-processing alternatives. That has made it a staple at Internet giants including AOL, eHarmony, eBay, Facebook, Twitter, and Netflix. But even more traditional firms coping with big data, like JPMorgan Chase, are embracing the platform.

Recommended Reading

Big Data A Big Backup Challenge

Big Data: Informatica Tackles The High-Velocity Problem

EMC Tailors Storage Systems For Big Data

IBM Picks Hadoop To Analyze Large Data Volumes

Databases Alone Can't Conquer Big Data Problems

Oracle's Big Plans For Big Data Analysis

10 Lessons Learned By Big Data Pioneers

« Previous Page  | 1 | 2 |  3 | 4 | 5 | 6 | 7 | 8 | 9 | 10  | Next Page »