Karmasphere Has High Hopes For Impala
Karmasphere provides a reporting, analysis and data-visualization platform for Hadoop. The company has been helping data professionals mine and analyze Web, mobile, sensor and social media data in Hadoop since 2010. The software also is available as a service on Amazon Web Services for use in conjunction with Elastic MapReduce.
Karmasphere uses Hive, the data warehousing component built on top of Hadoop. The company concedes that Hive has its flaws, like lack of speed tied to MapReduce batch processing. But Karmasphere is integrating its software with the Cloudera Impala real-time query framework as one way around those flaws. "Impala dramatically improves speed-to-insight by enabling users to perform real-time, interactive analysis directly on source data stored in Hadoop," stated Karmasphere in an October announcement about the partnership.
We'll see how quickly Impala will mature from private beta testing to proven production use, but if it delivers as promised, Karmasphere and others will see a huge leap forward in low-latency big data analysis.