12 Top Big Data Analytics Players
October 18, 2011 08:01 AM When data grows into the tens or even hundreds of terabytes, you need a special technology to quickly make sense of it all. From Hadoop to Teradata, check out the top platform options.
Hadoop And MapReduce Boil Down Really Big Data
Hadoop is a collection of open-source distributed data-processing components for storing and processing structured, semi-structured, or unstructured data at truly high scale (as in tens or hundreds of terabytes of even petabytes). Clickstream and social-media analysis applications are driving much of the demand, and of particular interest is MapReduce, a technique supported by Hadoop (and a few other environments) that is ideal for processing big data sets. MapReduce breaks a big data problem into sub-problems, distributes those onto dozens, hundreds, or even thousands of processing nodes and then combines the results into a smaller data set that's easier to analyze.
Hadoop runs on low-cost commodity hardware and it scales up at a fraction of the cost of commercial storage and data-processing alternatives. That has made it a staple at Internet giants including AOL, eHarmony, eBay, Facebook, Twitter, and Netflix. But even more traditional firms coping with big data, like JPMorgan Chase, are embracing the platform.
Recommended Reading
Big Data A Big Backup Challenge
Big Data: Informatica Tackles The High-Velocity Problem
EMC Tailors Storage Systems For Big Data
IBM Picks Hadoop To Analyze Large Data Volumes
Databases Alone Can't Conquer Big Data Problems
Oracle's Big Plans For Big Data Analysis
10 Lessons Learned By Big Data Pioneers








