Google Big Data Tools Vs. The Flu Bug - InformationWeek
Data Management // Big Data Analytics
02:14 PM
Connect Directly
Ransomware: Latest Developments & How to Defend Against Them
Nov 01, 2017
Ransomware is one of the fastest growing types of malware, and new breeds that escalate quickly ar ...Read More>>

Google Big Data Tools Vs. The Flu Bug

Google Flu Trends mines aggregated search queries to estimate flu activity across much of the world.

 7 Big Data Solutions Try To Reshape Healthcare
7 Big Data Solutions Try To Reshape Healthcare
(click image for larger view and for slideshow)

This winter is turning into a severe flu season across large sections of the United States. According to the U.S. Centers for Disease Control (CDC), 29 states and New York City are reporting high levels of influenza-like illness (ILI), while another nine states are experiencing moderate flu levels.

What's the best way to monitor flu activity across the world? That's debatable, of course, but Google has an innovative solution: Use aggregated search data to track the flu in "near real-time," according to the company.

The Google Flu Trends site isn't new --, the company's philanthropic arm, launched it in 2008 -- but it's a good example of how organizations and governments can mine big data for valuable insights.

So why use search queries to track flu activity worldwide? After all, isn't that what global health agencies like the CDC do already? Yes, but Google Flu Trends, by analyzing aggregated queries, can detect disease outbreaks much faster than these agencies, Google claims. And while health reports are often updated weekly and limited to a single country, Google Flu Trends has a near-global reach: It gathers data from wherever people use Google search. And since it's updated daily, it delivers more timely information.

[ Learn how big data is expected to improve food and drug safety. See FDA Hops On Big Data Bandwagon. ] explains the connection between search queries and flu outbreaks:

"We have found a close relationship between how many people search for flu-related topics and how many people actually have flu symptoms. Of course, not every person who searches for 'flu' is actually sick, but a pattern emerges when all the flu-related search queries are added together."

By comparing query totals with data from conventional flu surveillance systems, Google has found that flu-related search queries are (not surprisingly) quite common during the flu season. And by counting the number of these queries, Google can then estimate flu activity in regions of the world that use its search engine.

Google determines flu activity levels -- intense, high, moderate, low or minimal -- by comparing current estimates from search data with official historic influenza information for a particular region. On January 8, 2013, for instance, it listed flu activity in the U.S. as "intense," a determination in line with the CDC reports of severe flu outbreaks across much of the country.

Flu Trends uses IP address information from Google's server logs to determine the origin of search users' queries.

Google doesn't position Flu Trends as a replacement for traditional data from health agencies, but rather as a complement that can help public health officials detect disease outbreaks early on, and hopefully limit the number of people affected.

In January 2008, for instance, Google Flu Trends detected a significant increase in flu activity in the U.S. Mid-Atlantic region. By comparison, published CDC reports were about two weeks behind, and hadn't yet shown this increase.

Conventional flu-surveillance reports typically come from doctors and health professionals. They're a good source of demographic data, which health authorities can't get from search queries.

Flu Trends' reach isn't truly global at this time; Google provides flu estimates for more than 25 countries from North and South America, Europe, Australia and parts of Asia. However, it doesn't include flu data for China, India, Indonesia, the Middle East and almost all of Africa (except for South Africa).

Of course, most search users don't want Google to keep track of every time they're (potentially) ill. The search giant addresses these privacy concerns by using aggregated, anonymized counts of weekly queries.

One thing Google Flu Trends can't do: Suggest the best recipe for chicken soup.

Predictive analysis is getting faster, more accurate and more accessible. Combined with big data, it's driving a new age of experiments. Also in the new, all-digital Advanced Analytics issue of InformationWeek: Are project management offices a waste of money? (Free registration required.)

Comment  | 
Print  | 
More Insights
Newest First  |  Oldest First  |  Threaded View
How Enterprises Are Attacking the IT Security Enterprise
How Enterprises Are Attacking the IT Security Enterprise
To learn more about what organizations are doing to tackle attacks and threats we surveyed a group of 300 IT and infosec professionals to find out what their biggest IT security challenges are and what they're doing to defend against today's threats. Download the report to see what they're saying.
Register for InformationWeek Newsletters
White Papers
Current Issue
2017 State of IT Report
In today's technology-driven world, "innovation" has become a basic expectation. IT leaders are tasked with making technical magic, improving customer experience, and boosting the bottom line -- yet often without any increase to the IT budget. How are organizations striking the balance between new initiatives and cost control? Download our report to learn about the biggest challenges and how savvy IT executives are overcoming them.
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Flash Poll