Software // Information Management
News
5/1/2013
01:28 PM
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%
Repost This

MapR Brings Search To Hadoop

MapR brings new power to HBase, taps LucidWorks to integrate Apache Lucene/Solr search into M7 Hadoop distribution.

Last fall MapR set out to improve on HBase, Hadoop's built-in NoSQL database. On Wednesday it delivered on that promise and it announced a next move: integrating search capabilities with its M7 Hadoop distribution with partner LucidWorks.

With the latest MapR M7 release, available immediately, the company says it has delivered higher performance and easier administration for both Hadoop and HBase by forging its own path on certain aspects of Hadoop infrastructure and administration. Specifically, M7 does away with region servers, table splits and merges, and data compaction steps tied to standard Apache software. Instead it implements an architecture exclusive to MapR for snapshotting, high availability and system recovery.

"We've eliminated the tradeoffs that organizations face in terms of getting scale, consistency, reliability and continuous low-latency performance in one solution, but M7 works across all these dimensions," MapR VP of marketing Jack Norris told InformationWeek.

MapR points to advantages including instant recovery from hardware or software errors, the ability to do online schema modifications for HBase applications, and performance specs exceeding 1 million operations per second on a 10-node Hadoop cluster.

[ Want more on improvements to Hadoop's NoSQL database? Read MapR Promises A Better HBase. ]

To support search, MapR introduced the beta offering of LucidWorks Search software integrated with the M7 platform. The search technologies will be optional, and plans call for general release next quarter. LucidWorks offers a supported software distribution, consulting and training for open source Apache Lucene/Solr search, and it adds commercial development platforms designed to simplify and accelerate the building of search applications.

With search integrated directly with Hadoop, customers will have an easier time building out recommendation engines for retail scenarios, fraud-detection for financial transactions and predictive applications for any number of industries, according to Norris.

"You could do some of these applications in a MapReduce framework, but if you need online performance, MapReduce latency is a problem and having a search platform is extremely useful," Norris explained. MapR can stream data from Hadoop clusters into the search engine from NFS, the file system used in M7 in place of HDFS.

LucidWorks offers an enterprise-hardened and secured version of Apache Lucene/Solr. The software provides a REST-based API, ODBC connectivity, provisions for LDAP and NIS security, and connections to HDFS and NFS among other features.

MapR will provide first-level support for the new search option, but LucidWorks will be available for deeper problem solving when tougher problems emerge, according to Norris. The cost of the LucidWorks Search option was not disclosed.

E2 is the only event of its kind, bringing together business and technology leaders across IT, marketing, and other lines of business looking for new ways to evolve their enterprise applications strategy and transform their organizations to achieve business value. Join us June 17-19 for three days of 40+ conference sessions and workshops across eight tracks and discover the latest insights in enterprise social software, big data and analytics, mobility, cloud, SaaS and APIs, UI/UX and more. Register for E2 Conference Boston today and save $200 off Full Event Passes, $100 off Conference, or get a FREE Keynote + Expo Pass!

Comment  | 
Print  | 
More Insights
The Agile Archive
The Agile Archive
When it comes to managing data, donít look at backup and archiving systems as burdens and cost centers. A well-designed archive can enhance data protection and restores, ease search and e-discovery efforts, and save money by intelligently moving data from expensive primary storage systems.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Government, May 2014
Protecting Critical Infrastructure: A New Approach NIST's cyber-security framework gives critical-infrastructure operators a new tool to assess readiness. But will operators put this voluntary framework to work?
Video
Slideshows
Twitter Feed
Audio Interviews
Archived Audio Interviews
GE is a leader in combining connected devices and advanced analytics in pursuit of practical goals like less downtime, lower operating costs, and higher throughput. At GIO Power & Water, CIO Jim Fowler is part of the team exploring how to apply these techniques to some of the world's essential infrastructure, from power plants to water treatment systems. Join us, and bring your questions, as we talk about what's ahead.