Big Data. Big Decisions
InformationWeek
Special Coverage Series


Gnip Provides Twitter's Full Public Archive

Want a heaping helping of public opinion? Gnip has a deal for you: Every public tweet since Twitter launched in March 2006.

10 Tips For Tapping Consumer Sentiment On Social Networks
10 Tips For Tapping Consumer Sentiment On Social Networks
(click image for larger view and for slideshow)

Want a heaping helping of public opinion? Gnip has a deal for you: Every public tweet ever posted on Twitter since the social network launched in March 2006.

The company's new Historical PowerTrack for Twitter delivers all publicly available tweets from the past six years. It leaves out direct (private) messages and deleted comments, however.

Gnip, a social data provider based in Boulder, Colo., has a distribution license agreement with Twitter that allows it to provide historical tweets to its customers. It also delivers the full corpus of Twitter users' observations to the U.S. Library of Congress, which archives the tweets to preserve the public's real-time take on historical events.

But beyond the historical value of the Twitter fire hose, how can businesses benefit from this massive volume of data?

"We're going to see an impact on all industries. Obviously, a lot of business intelligence has focused on internal sales data, inventory--those kinds of things," Gnip president and COO Chris Moody said in a phone interview with InformationWeek.

[ Online comments don't always fall neatly into like or dislike. Learn 7 Ways Sentiment Is Hard To Decipher Online. ]

Marketers, for instance, can use the Twitter archive to refine their predictive analyses.

"You're doing a product launch, and 25% of the conversation is negative. You'll now be able to decide if 25% [is] good or bad. You can go back and look at your last five product launches and see how they compare," Moody said.

The full Twitter historical corpus allows businesses to study people's opinions on multiple generations of their product lines.

"So much product opportunity comes from understanding what customers think of your existing products, and of your competitors' products," Moody said.

Emergency services organizations could also benefit from historical tweets by studying human behavior during natural disasters. For instance, emergency planners could examine tweets from a previous hurricane.

"They can look at conversations geographically and understand what evacuation routes people took," Moody said. "And they can plan more effectively for the next disaster."

Historical tweets can benefit Wall Streeters as well. "Hedge funds have been trading on social data for a while, at least as an input to their high-frequency trading algorithms," said Moody. Having access to the full Twitter archive may help them develop better algorithms that more accurately determine the financial markets' next moves.

As Moody sees it, social data is "a new lens" to the world. "You've got this incredible view that was not available before," he said. "Understanding potential demand--and the world view--is what's interesting to me."

Other companies, some of which are Gnip's customers, also collect Twitter data. But Moody says Gnip's full historical corpus is an industry first. "We have every tweet, which, depending on your use case, can be very important," said Moody. "And we have the rights to re-syndicate that data."

Gnip uses a cloud-based architecture to deliver data in the JSON format to its customers. Its standard service uses a real-time streaming interface, but the historical Twitter data is delivered via download.

"Streaming has bandwidth limitations that could cause customers to wait a long time to receive all the data. By doing a download, we can get the data to them much faster," said Moody.

Will humankind benefit from the release of billions of 140-character observations?

"We don't know what the world's going to do. We don't have any idea of all of the amazing innovation that will come from this data," said Moody.

He added: "We hold this incredible data set, which really represents the world's thoughts at any one moment in time. If you have access to that data, what would you do with it? We're about to find out. That's why we think this is incredibly exciting."

In-memory analytics offers subsecond response times and hundreds of thousands of transactions per second. Now falling costs put it in reach of more enterprises. Also in the Analytics Speed Demon special issue of InformationWeek: Louisiana State University hopes to align business and IT more closely through a master's program focused on analytics. (Free registration required.)



Related Reading


More Insights




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

BYTE encourages readers to engage in spirited, healthy debate, including taking us to task. However, BYTE moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. BYTE further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.

Follow InformationWeek

By The Numbers

What Are Your Primary Concerns About Using Big Data Software?

Base: 417 respondents at organizations using or planning to deploy data analytics, BI or statistical analysis software
Data: InformationWeek 2013 Analytics, Business Intelligence and Information Management Survey of 541 business technology professionals, October 2012

What Do You Think?

What's your attitude about SQL analysis on top of Hadoop?
We want fast, standard SQL analysis capabilities on Hadoop ASAP
Hadoop is for unstructured data; SQL is for relational databases
We'll give SQL on Hadoop a try, but relational DBs will remain the mainstay
Given strong SQL support on Hadoop, we'd nix the data warehouse
We're not interested in Hadoop
No opinion



Related Content

From Our Sponsor

Five Big Data Challenges and How to Overcome Them with Visual Analytics

Five Big Data Challenges and How to Overcome Them with Visual Analytics

Business leaders often need a visual snapshot of data to quickly grasp and use it. This paper identifies five challenges in presenting data and how visual analytics can resolve them. Solutions are suggested to overcome the challenges of: speed, data clarity, data quality, displaying meaningful results, and dealing with outliers.

Game-Changing Analytics: How IT Executives Can Use Analytics to Create Innovation and Business Success

Game-Changing Analytics: How IT Executives Can Use Analytics to Create Innovation and Business Success

Today's competitive advantage requires a deeper understanding of your business, your market and your customers. As an IT executive, you can drive that knowledge transformation. In this white paper, learn how to make decisions as a strategic business leader and three steps to begin an analytics initiative within your enterprise.

Data Visualization Techniques: From Basics to Big Data with SAS Visual Analytics

Data Visualization Techniques: From Basics to Big Data with SAS Visual Analytics

High-performance data visualization turns sophisticated analyses into meaningful graphics, leading to faster and smarter decision making. In this white paper, learn how visual analytics can transform big data, with additional features such as real-time functionality, mobile compatibility, robust applications for technical groups and accessibility for nontechnical users.

Big Data: Lessons from the Leaders

Big Data: Lessons from the Leaders

Financial performance, competitive advantage, operational efficiency, strategic decision making - every business goal can extract value from big data, and the time for doubt or inaction has long passed. In this Economist Intelligence Unit report, in-depth interviews with data pioneers reveal the link between the effective use of big data and the bottom line among other results.

Decision-Driven Data Management: A Strategy for Better Decisions with Better Data

Decision-Driven Data Management: A Strategy for Better Decisions with Better Data

Which came first, the data or the decision? This white paper makes the case for having a decision in mind, then tailoring big data's volume, variety and velocity to achieve business results such as overcoming customer dissatisfaction or creating well-informed strategies in real time.

Informationweek Reports

Research: The Big Data Management Challenge

Research: The Big Data Management Challenge

The challenge of big data is real, but most organizations don't differentiate 'big data' from traditional data, and nearly 90% of respondents to our survey use conventional databases as the primary means of handling data. We'll help you understand what constitutes big data (it's not just size) and the numerous management challenges it poses.