Oracle Big Data SQL: 5 Key Points - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Data Management // Big Data Analytics
01:42 PM
Doug Henschen
Doug Henschen
Connect Directly

Oracle Big Data SQL: 5 Key Points

Oracle's new big data tool won't cover all the analysis bases, but it will enable SQL-savvy professionals to query Hadoop and NoSQL sources.

3. DBAs get Oracle Database-style security controls. So Oracle Big Data SQL opens up table-based access to the data in Hadoop, but with access comes risk. Thus, Oracle gave this feature a way to apply the same kinds of grants, permissions, and policies that DBAs apply when they set up Oracle Database. You might have an "analyst" role defined in Oracle Database that is allowed to see and query some columns but not others, while certain fields of data might be redacted.

"If I want to expose that group of analysts to a set of data that's in Hadoop, I can create an external table in Oracle Database over that data in Hadoop and grant whatever permissions and policies you deem appropriate," McClary explained.

[Want more on the Spark option for big data analysis? Read Databricks Spark Plans: Big Data Q&A.]

4. Oracle Big Data SQL is not a SQL-on-Hadoop tool. This is an important distinction. Oracle Big Data SQL is not just a way to use Oracle SQL against Hadoop. It's a way to query Oracle Database, Hadoop, and NoSQL sources simultaneously.

"SQL on Hadoop is a great idea and we'll continue to ship solutions that provide that, including Impala, Hive, and future efforts to bring Hive on top of Spark," McClary said. "What we're trying to do here is solve a different and perhaps bigger problem, which is integrating big data with the rest of the enterprise architecture."

Oracle Big Data SQL is not a SQL-on-Hadoop option, it's a way to SQL query Oracle Database, Hadoop, and NoSQL sources simultaneously.
Oracle Big Data SQL is not a SQL-on-Hadoop option, it's a way to SQL query Oracle Database, Hadoop, and NoSQL sources simultaneously.

Describing Oracle Big Data SQL as "democratizing big data" and "making it consumable by people outside of Silicon Valley," McClary said the point is bringing the value found in big data sources "home into the business." Home, in this case, means into Oracle Database, where it can be analyzed by the many SQL-savvy professionals instead of just a priesthood of PhD-level data scientists.

5. Oracle Big Data SQL will not do everything. It was refreshing to hear Oracle grant that not everything can be expressed or discerned through SQL. Options like Apache Spark and the R language, for example, support machine learning and advanced analytical data manipulations and workflows that are "all valid," said McClary. "There's a place for SQL in reasoning and operating on large sets of data and there's a place for other languages in doing what they're best suited to handle," he said.

The point of Oracle Big Data SQL is accessing and analyzing data in Hadoop and NoSQL sources without requiring a new set of people with a new set of skills. "It's not enough to have big data experiments, you have to be able to operationalize it," said Mendelson. "That means that the people who are used to running your systems need to be able to provide secure access not just to the privileged few, but potentially to everyone."

InformationWeek's new Must Reads is a compendium of our best recent coverage of the Internet of Things. Find out the way in which an aging workforce will drive progress on the Internet of Things, why the IoT isn't as scary as some folks seem to think, how connected machines will change the supply chain, and more. (Free registration required.)

Doug Henschen is Executive Editor of InformationWeek, where he covers the intersection of enterprise applications with information management, business intelligence, big data and analytics. He previously served as editor in chief of Intelligent Enterprise, editor in chief of ... View Full Bio
We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
2 of 2
Comment  | 
Print  | 
More Insights
Newest First  |  Oldest First  |  Threaded View
User Rank: Ninja
7/29/2014 | 9:05:58 AM
Re: Well in
The oracle data tool will not be able to cover the entire analyses basis but the comforting thing is that its advantages are more than the disadvantages. The other good thing is that it will be able to open up access to Hadoop data in the third quarter. The fascinating thing is that it will enable the oracle data shops to be able to use bigger data with the help and their expertise. The data storage feature will also be increased meaning that the user will be able to get more space to save their data. The oracle team will only need to elaborate more on the capabilities of the map reduce and the yarn too, Plus their capabilities.
D. Henschen
D. Henschen,
User Rank: Author
7/21/2014 | 3:14:32 PM
SQL and the post-MapReduce Era
There has been a lot said about moving on from MapReduce recently, but I think the big data world needs a clearer understanding of the capabilities of MapReduce, MapReduce 2.0 on YARN and alternatives. As for the Oracle Big Data SQL alternative, Oracle offered the example of correlating Twitter data with customer information, as described above, but I'm hoping to hear about many more examples of what's possible at Oracle Open World (which will, no doubt, be when this feature sees it's general release).
10 Ways to Prepare Your IT Organization for the Next Crisis
Cynthia Harvey, Freelance Journalist, InformationWeek,  5/20/2020
IT Spending Forecast: Unfortunately, It's Going to Hurt
Jessica Davis, Senior Editor, Enterprise Apps,  5/15/2020
Helping Developers and Enterprises Answer the Skills Dilemma
Joao-Pierre S. Ruth, Senior Writer,  5/19/2020
White Papers
Register for InformationWeek Newsletters
Current Issue
Key to Cloud Success: The Right Management
This IT Trend highlights some of the steps IT teams can take to keep their cloud environments running in a safe, efficient manner.
Flash Poll