Big Data // Big Data Analytics
News
4/22/2014
10:06 AM
Connect Directly
Twitter
RSS
E-Mail
50%
50%

Microsoft Bing Enters Prediction Business

Microsoft claims Bing can predict the future, but it is starting out with soft questions like who will win The Voice.

20 Great Ideas To Steal In 2014
20 Great Ideas To Steal In 2014
(Click image for larger view and slideshow.)

From Google's Flu Trends project to Nate Silver's election forecasts, data-driven attempts to predict the future continue to grab headlines. This week, Microsoft teased its interest in this emerging prognostication industry, releasing a new predictive tool for Bing. Currently, the feature only forecasts TV competition winners, but it could soon include sports matches, elections, concert ticket prices, and more.

In a blog post, the Bing Predictions Team said its technology can forecast results using Bing search queries and social media data from sources such as Facebook and Twitter. But researchers have recently criticized similar experiments, such as Google's aforementioned flu tracker, for methodological errors. Microsoft will have to prove it can turn its data into useful insights.

At launch, Bing's predictive tools apply only to The Voice, American Idol, and Dancing with the Stars. But a Microsoft website promises "predictions for winners of sporting events, most popular vacation destinations, whether concert ticket prices are going to go up, and more." The blog post contextualized Microsoft's interest around increased Bing traffic during elections, suggesting Bing's crystal ball will eventually predict political contests too.

[While waiting for predictions to come true, you may need to do some work. Read Microsoft Office: 4 Changes, Explained.]

Bing makes predictions about TV competitions by drawing inferences about a given competitor's popularity. Microsoft broadly defines popularity according to query volume, the positive or negative charge of search terms, and social media trends. The company said it calibrated Bing "to account for biases, such as regional preferences, and other measurable and observable trends." Microsoft also said Bing knows how to balance recent popularity against long term trends. According to the Bing Team's data, for example, winners on The Voice aren't necessarily those who performed best in the most recent episode, but rather those who've already developed a following.

To activate the feature, Bing users can affix "predictions" to the end of their searches -- i.e. "The Voice predictions." Results appear in a carousel display above the normal search returns.

Experts debate the extent to which search engines and social tools can predict the future. In 2008, Google claimed it could anticipate flu outbreaks by tracking queries related to influenza. The search giant has since made a variety of similar claims. In June, for example, Google said it can predict a film's box office performance with up to 94% accuracy. Researchers have also used Wikipedia and Twitter-derived user analytics to predict flu surges, while others have linked search results to stock market fluctuations.

Bing now predicts the outcome of The Voice and other TV competitions (Image credit: Microsoft)
Bing now predicts the outcome of The Voice and other TV competitions (Image credit: Microsoft)

In a 2010 study, five Yahoo researchers claimed "what consumers are searching for online can predict their collective future behavior days or even weeks in advance." The study cited opening weekend box office revenue, first-month video game sales, and Billboard Hot 100 rankings as examples. Interestingly, all five researchers now work for Microsoft Research, according to their respective LinkedIn profiles.

Microsoft has repeatedly touted predictive technologies in recent months, but usually in the context of machine learning, with examples ranging from Windows Phone 8.1's Cortana, to new Office tools that infer which emails the user tends to care about. With the company now including social-scale predictions in search results, it has opened up a new world of possibilities -- but first, it will have to convince skeptics.

Google's Flu Trends garnered criticism last fall, when the company revealed it had significantly overestimated influenza occurrences. At the time, Google said increased media attention had skewed the results by encouraging more people to enter flu-related search terms, even if they weren't feeling sick.

A March paper in the journal Science pushed the issue further, however. Researchers argued that projects such as Flu Trends can encourage a "big data hubris" in which companies ascribe empirical muster to conclusions spun from flawed number-crunching strategies. The study noted that during a 108-week stretch ending in September 2013, Google Flu Trends' weekly conclusions overestimated influenza outbreaks 100 times. The authors attributed the unreliable forecasts to a basic flaw in Google's methodology: People search for "flu" without necessarily know what influenza actually is, which means Google's data set is perverted by millions of users with flu-like ailments but not the flu itself.

Time will tell if Microsoft's Bing Team has learned to avoid such oversights.

Emerging standards for hybrid clouds and converged datacenters promise to break vendors' proprietary hold. Also in the Lose The Lock-In issue of InformationWeek: The future datacenter will come in a neat package (free registration required).

Michael Endler joined InformationWeek as an associate editor in 2012. He previously worked in talent representation in the entertainment industry, as a freelance copywriter and photojournalist, and as a teacher. Michael earned a BA in English from Stanford University in 2005 ... View Full Bio

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
AnnFeeney
50%
50%
AnnFeeney,
User Rank: Apprentice
4/23/2014 | 8:42:28 AM
Re: Seems smart
With sporting events, solo sports (the closest equivalent to The Voice) will be relatively easy compared to team sports. Each additional player introduces so many more variables. In one sense, though, most sports are easier to calculate because there's an established ordinal definition of winning. In artistic performances, so many of the measures are subjective.

At some point, the system for predicting the winner of the Voice could become so fine-tuned that it could predict the winner from the first perfomance. A sufficiently advanced neural net, given enough data, should be able to do it. Perhaps in the near future, one of the judges would be a computer interface. (I'm imagining a kind of Kirk-Spock-McCoy face off, with the human McCoy snarling at the Spock interface that music is about human emotions, not algorithms.)

Studio executives would probably be delighted to have a system that could be applied to audition tapes, YouTube, and so on. 
Shane M. O'Neill
50%
50%
Shane M. O'Neill,
User Rank: Author
4/22/2014 | 2:53:05 PM
Re: Seems smart
Sports bets are notoriously difficult to get right consistently -- even with all the data on the table. I'm not convinced an analysis of Bing search queries and Facebook and Twitter data would help predict that shocking NFL playoff upset. But I'll take it over my own lazy research, which usually consists of listening to a podcast and skimming a few articles. If Bing can give betters an edge, it could disrupt an industry. Vegas definitely will not be happy.
Lorna Garey
50%
50%
Lorna Garey,
User Rank: Author
4/22/2014 | 11:18:30 AM
Seems smart
This plan lets MS hone and tune its predictive tool on prognistications that don't matter in the long run (except to the singers being eliminated!) while also getting some mainstream publicity. It'll be interesting to see what the Vegas odds-makers think of the effort, especially once it branches out to sporting events.
6 Tools to Protect Big Data
6 Tools to Protect Big Data
Most IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Government Tech Digest Oct. 27, 2014
To meet obligations -- and avoid accusations of cover-up and incompetence -- federal agencies must get serious about digitizing records.
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
A roundup of the top stories and community news at InformationWeek.com.
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.