5 Big Wishes For Big Data Deployments - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Data Management // Big Data Analytics
News
4/18/2013
01:27 PM
Doug Henschen
Doug Henschen
Slideshows
Connect Directly
LinkedIn
Twitter
RSS
E-Mail
50%
50%

5 Big Wishes For Big Data Deployments

Big data project leaders still hunger for some key technology ingredients. Starting with SQL analysis, we examine the top five wants and the people working to solve those problems.
Previous
2 of 6
Next


Wish 1: SQL Analysis At Big-Data Scale
You could compile a massive data set just by gathering all the stories and reports that have been written about the shortage of big-data talent. The most acute need is for data scientist types who know data and who also know how to write custom code, MapReduce jobs, and algorithms to gain insights from big data. But what if SQL-savvy professionals schooled in relational databases and business intelligence (BI) and analytics tools could do more of the heavy lifting? There are many more SQL professionals out there than there are data scientists, and most SQL pros would be eager to expand their career potential.

There's a big push to deliver SQL-analysis capabilities on top of Hadoop, and the talent shortage is just one reason. The second reason for the trend is that Apache Hive, Hadoop's incumbent data warehousing infrastructure, offers a limited subset of SQL-like query capabilities and suffers from slow performance tied to behind-the-scenes MapReduce processing.

Answering the call for broader, faster SQL querying on Hadoop are projects and initiatives including Cloudera Impala, EMC's HAWQ query feature on the Pivotal HD distribution, Hortonworks Stinger, IBM Big SQL, MapR-supported Apache Drill, and Teradata SQL-H.

Even the NoSQL camp wants better, SQL-like querying. Last year 10Gen added a real-time data aggregation framework to its popular MongoDB NoSQL database. The aggregation framework lets users directly query data within MongoDB without resorting to writing and running complicated, batch-oriented MapReduce jobs. More evidence is Acunu, which has developed a SQL-like AQL language to support querying on top of Cassandra.

The development of SQL querying capabilities is only the beginning. BI and analytics tools and systems native to big-data platforms are emerging. Examples include Datameer, Hadapt, Karmasphere and Platfora, and they're offering distinguishing query, analysis, data-visualization and monitoring capabilities on top of Hadoop.

RECOMMENDED READING:

Oracle Cuts Big Data Appliance Down To Size

Inside IBM's Big Data, Hadoop Moves

MongoDB Upgrade Fills NoSQL Analytics Void

10Gen Enterprise Release Takes MongoDB Uptown

Will Microsoft's Hadoop Bring Big Data To Masses?

6 Big Data Advances: Some Might Be Giants

Hadoop Meets Near Real-Time Data

Big Data Analytics Masters Degrees: 20 Top Programs

Big Data's Surprising Uses: From Lady Gaga To CIA

13 Big Data Vendors To Watch In 2013

Big Data Talent War: 7 Ways To Win

Teradata Joins SQL-On-Hadoop Bandwagon

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Previous
2 of 6
Next
Comment  | 
Print  | 
More Insights
InformationWeek Is Getting an Upgrade!

Find out more about our plans to improve the look, functionality, and performance of the InformationWeek site in the coming months.

Slideshows
Blockchain Gets Real Across Industries
Lisa Morgan, Freelance Writer,  7/22/2021
Commentary
Seeking a Competitive Edge vs. Chasing Savings in the Cloud
Joao-Pierre S. Ruth, Senior Writer,  7/19/2021
News
How CIO Roles Will Change: The Future of Work
Jessica Davis, Senior Editor, Enterprise Apps,  7/1/2021
White Papers
Register for InformationWeek Newsletters
Video
Current Issue
Monitoring Critical Cloud Workloads Report
In this report, our experts will discuss how to advance your ability to monitor critical workloads as they move about the various cloud platforms in your company.
Slideshows
Flash Poll