Big Data // Big Data Analytics
News
4/18/2013
01:27 PM
Doug Henschen
Doug Henschen
Slideshows
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%

5 Big Wishes For Big Data Deployments

Big data project leaders still hunger for some key technology ingredients. Starting with SQL analysis, we examine the top five wants and the people working to solve those problems.
Previous
4 of 6
Next


Wish 3: Easier Paths To Advanced Analytics
Developing algorithms and predictive models is work that has to be carried out by hard-to-find, expensive data scientists. Or is it? Scarcity of talent is one reason big-data, analytics and business intelligence vendors are developing machine-learning approaches. Proven in applications including optical character recognition, spam filtering and computer security threat detection, machine learning uses learning algorithms that are trained by the data itself. If you show the algorithm thousands or tens of thousands of examples of scanned text characters, unsolicited email messages, or virus bots and malware, it can reliably find more examples.

The same approach can be applied to spotting customers who are ready to churn or jet engines that are about to fail. With machine learning, trained models also can continue to learn from new data. Amazon.com and Netflix, for example, use algorithms to spot patterns in customer transactions so they can recommend other books or movies. When a new book or movie comes out, these companies can start recommending it as soon as their algorithms discerns the preference pattern in the data.

Apache Mahout is the leading route to deploying machine-learning-based clustering, classification and collaborative filtering algorithms on Hadoop, but these techniques are also supported by the R statistical programming language. Commercial vendors supporting or embedding machine-learning techniques include Alpine Data Labs, Birst, Causata, Lionsolver, Revolution Analytics and a growing list of others.

RECOMMENDED READING:

Oracle Cuts Big Data Appliance Down To Size

Inside IBM's Big Data, Hadoop Moves

MongoDB Upgrade Fills NoSQL Analytics Void

10Gen Enterprise Release Takes MongoDB Uptown

Will Microsoft's Hadoop Bring Big Data To Masses?

6 Big Data Advances: Some Might Be Giants

Hadoop Meets Near Real-Time Data

Big Data Analytics Masters Degrees: 20 Top Programs

Big Data's Surprising Uses: From Lady Gaga To CIA

13 Big Data Vendors To Watch In 2013

Big Data Talent War: 7 Ways To Win

Teradata Joins SQL-On-Hadoop Bandwagon

Previous
4 of 6
Next
Comment  | 
Print  | 
More Insights
6 Tools to Protect Big Data
6 Tools to Protect Big Data
Most IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest, Dec. 9, 2014
Apps will make or break the tablet as a work device, but don't shortchange critical factors related to hardware, security, peripherals, and integration.
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join us for a roundup of the top stories on InformationWeek.com for the week of December 14, 2014. Be here for the show and for the incredible Friday Afternoon Conversation that runs beside the program.
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.