MapR Ships Drill For SQL Analysis Of Big Data - InformationWeek
IoT
IoT
Data Management // Big Data Analytics
News
9/16/2014
09:26 AM
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%
RELATED EVENTS
Using Cyber Threat Intelligence Wisely
Aug 23, 2017
A wide range of threat intelligence feeds and services have cropped up keep IT organizations up to ...Read More>>

MapR Ships Drill For SQL Analysis Of Big Data

MapR says Apache Drill SQL-on-Hadoop option supports flexible data exploration, more extensive SQL support than Cloudera Impala.

16 NoSQL, NewSQL Databases To Watch
16 NoSQL, NewSQL Databases To Watch
(Click image for larger view and slideshow.)

Plenty of Hadoop vendors and hangers-on are promising SQL-on-Hadoop capabilities, but in the process they're buying into the old, inflexible model-before-querying approach to data analysis.

Hadoop software distributor MapR on Tuesday announced it will start shipping Apache Drill software that it says delivers a more flexible, big-data-savvy data-exploration approach.

Unlike Apache Hive and Cloudera's Impala option for SQL analysis on Hadoop, MapR says Drill, which is based on Google Dremel, does not require IT people to anticipate queries and set up data models in advance. Instead, Drill is designed for data-exploration first, and the list of compatible big data includes Hadoop sources including HDFS, Hive, and HBase tables; NoSQL data from sources such as MongoDB and REST APIs; and self-describing data such as Avro, Parquet, and JSON files with nested structures.

[Want more on Cloudera's SQL option? Read Cloudera Impala Brings SQL Querying To Hadoop.]

"The model-first approach is the antithesis of the approach of exploring what big data is trying to tell you," said Jack Norris, MapR's chief marketing officer, in a phone interview with InformationWeek. "Drill allows schema discovery on the fly, support for modern data structures, and support for ANSI SQL."

Drill's approach is more flexible than that of Hive or Impala, said Norris, because data analysts can explore the data before they set up fixed schemas, ETL processes, or hardened production queries. Instead of fixing on a schema before the query engine can touch the data, Drill lets users explore first, and the engine automatically discovers source schemas and adjusts query plans accordingly as SQL queries are applied.

Source: MapR
Source: MapR

In addition to providing an SQL query interface, Drill exposes as an ODBC connector through which data sources can be explored with simple desktop tools, like Microsoft Excel or Tableau Software, or through more sophisticated business intelligence suites. Though it's currently in a 0.5 (pre-production-ready) beta release, Drill supports 15 of the 22 SQL queries used in the TCP-H performance benchmark whereas Cloudera Impala supports only two of those queries, according to MapR executives.

Though Drill is described by MapR as an open community, MapR is its chief advocate, and it is the only Hadoop vendor distributing the software. Cloudera, the leading Hadoop distributor by customer numbers, is pushing Impala, while Hortonworks is advancing the capabilities of Apache Hive, the most popular SQL-on-Hadoop tool available.

Currently in early beta, Drill is far from recommended production use, and MapR's announcement offered few beta customer references. Instead, partners and analysts offered their opinions on MapR's news.

"Apache Drill's ability to provide access to data in Hadoop without the need for centralized schemas and also NoSQL datasets with complex data structures including nested and repeated fields differentiates it from traditional approaches to SQL-on-Hadoop," stated Matt Aslett, research director, data platforms and analytics, 451 Research, in MapR's press release.

Cloud Connect (Sept. 29 to Oct. 2, 2014) brings its "cloud-as-business-enabler" programming to Interop New York for the first time in 2014. The two-day Cloud Connect Summit will give Interop attendees an intensive immersion in how to leverage the cloud to drive innovation and growth for their business. In addition to the Summit, Interop will feature five cloud workshops programmed by Cloud Connect. The Interop Expo will also feature a Cloud Connect Zone showcasing cloud companies' technology solutions. Register with Discount Code MPIWK or $200 off Total Access or Cloud Connect Summit Passes.

Doug Henschen is Executive Editor of InformationWeek, where he covers the intersection of enterprise applications with information management, business intelligence, big data and analytics. He previously served as editor in chief of Intelligent Enterprise, editor in chief of ... View Full Bio

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
9/16/2014 | 1:28:57 PM
"Few" end-users, not "no" end users.
This article was revised to reflect that in this early beta stage there are few beta customers for Drill (not "no" end users, as I originally had it), it's just that they're tech industry insiders including CISCO and Solutionary. At the launch of Impala there were a handful of beta customers, including Monsanto, as I recall, who were able to discuss their use of tool. It's always good to get a sense of things from users of the technology rather than the filtered, canned insights of vendors, partners, and analysts. Watch the Apache Drill community for more real-world testimonials.
How Enterprises Are Attacking the IT Security Enterprise
How Enterprises Are Attacking the IT Security Enterprise
To learn more about what organizations are doing to tackle attacks and threats we surveyed a group of 300 IT and infosec professionals to find out what their biggest IT security challenges are and what they're doing to defend against today's threats. Download the report to see what they're saying.
Register for InformationWeek Newsletters
White Papers
Current Issue
IT Strategies to Conquer the Cloud
Chances are your organization is adopting cloud computing in one way or another -- or in multiple ways. Understanding the skills you need and how cloud affects IT operations and networking will help you adapt.
Video
Slideshows
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Flash Poll