Big Data is This Year's Hot Topic

Hadoop is a fast-growing choice for handling large-scale-data (as in petabytes) as well as complex-data (as in XML, e-mail, images and more) and mixed-data (as in structured mashed with semi-structured data types).

Doug Henschen, Executive Editor, Enterprise Apps

June 30, 2010

2 Min Read
InformationWeek logo in a gray background | InformationWeek

Today's headline on Hadoop provides yet more evidence that big data is this year's hot topic in the world of information management, business intelligence and analytics. If you're involved in data warehousing, you surely know what Hadoop is by now. In case you've been under a rock, it's Java-based software framework for distributed processing of data-intensive transformations and analyses. The software automatically distributes processing of up to petabytes of data across thousands of nodes on low-cost, commodity hardware. Offering cost and speed advantages, Hadoop is a fast-growing choice for handling large-scale-data (as in petabytes) as well as complex-data (as in XML, e-mail, images and more) and mixed-data (as in structured mashed with semi-structured data types).

Image Gallery: Yahoo!'s Hadoop Implementation

(click for larger image and for full photo gallery)

I was quite impressed by the big crowd and mainstream-enterprise types attending last year's Hadoop World NYC conference. But as Hadoop-co-founder-turned-Cloudera-executive Doug Cutting and many others acknowledged at the event, much work had yet to be done to turn Hadoop into a familiar and easy-to-use environment. Today's story illustrates progress being made by Talend and Quest (in terms of technology) as well as by IBM (more symbolically).

Image Gallery: 11 Leading Data Warehousing Appliances

(click for larger image and for full photo gallery)

Of course, Hadoop is old news at Yahoo (where Cutting used to work). Check out Yahoo's Hadoop implementation in the image gallery at right.

Hadoop is not a place for conventional BI, reporting and analysis. For that, Hadoop users are bringing winnowed-down result sets into more accustomed data warehousing environments. There, too, times have changed, and new deployments are likely to be in one of the data warehousing appliances or configurations you'll see in the image gallery at left.

Enjoy the pictures (and the long weekend), and by all means share you insights and experiences with Hadoop or data warehouse appliance environments through the commenting function below.Hadoop is a fast-growing choice for handling large-scale-data (as in petabytes) as well as complex-data (as in XML, e-mail, images and more) and mixed-data (as in structured mashed with semi-structured data types).

About the Author

Doug Henschen

Executive Editor, Enterprise Apps

Doug Henschen is Executive Editor of InformationWeek, where he covers the intersection of enterprise applications with information management, business intelligence, big data and analytics. He previously served as editor in chief of Intelligent Enterprise, editor in chief of Transform Magazine, and Executive Editor at DM News. He has covered IT and data-driven marketing for more than 15 years.

Never Miss a Beat: Get a snapshot of the issues affecting the IT industry straight to your inbox.

You May Also Like


More Insights