Commentary

John Foley
Editor, InformationWeek  

Amazon Debuts Public Data Cloud

Amazon.com has introduced a new service in which it hosts large data sets -- economic, demographic, scientific, and medical data, for example -- that are open for anyone to access. It's an interesting proposal, but one that casts Amazon in the potentially difficult role of having to be an information gatekeeper.

Amazon.com has introduced a new service in which it hosts large data sets -- economic, demographic, scientific, and medical data, for example -- that are open for anyone to access. It's an interesting proposal, but one that casts Amazon in the potentially difficult role of having to be an information gatekeeper.The new offering, hosted on Amazon's recently introduced Elastic Block Storage, is called Amazon Web Services (AWS) Hosted Public Data Sets. Amazon first described the service a few weeks ago; today marks the official launch.

So far, Amazon has assembled a half dozen or so data sets from a variety of sources, including a repository of 3-D chemical structures, and census, labor, transportation, and economic stats from the U.S. government. It's looking to expand the number and types of data sets hosted on AWS.


More Insights

White Papers

More >>

Reports

More >>

Webcasts

More >>

Here's how it works: Data sets are hosted for free on Amazon's Elastic Block Storage, and users then use the data set to create their own volume, which they can modify and manipulate. The catch is that users need to have an EC2 account, and they'll pay for any compute and storage resources consumed in the process. Amazon says most data sets range from 1 GB to 1 TB in size; it can accommodate larger data sets by divvying them into 1 TB volumes.

What kind of data qualifies? Amazon says, vaguely, that the data must be "useful and interesting" and that the person or organization sharing it must have the right to do so. That seems straightforward enough with data made available by the feds, but it could get dicey depending on the nature of the data or its source. For example, many kinds of health care, financial, and demographic information may have privacy or governance implications, and there could be copyright issues with other kinds of content.

When I asked Amazon VP Adam Selipsky where the company would draw the line between what data gets accepted as a public data set and what doesn't, he admitted that Amazon doesn't have clear-cut guidelines. "We use judgment there," he said. It will be interesting to see how well Amazon adapts to this role as information steward and how long it takes before someone cries foul.


Related Reading




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

InformationWeek encourages readers to engage in spirited, healthy debate, including taking us to task. However, InformationWeek moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. InformationWeek further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
T-Shirt Giveaway T-Shirt Giveaway: Each week we're selecting one great comment from our readers. The author of the comment will receive an InformaitonWeek Community t-shirt. So get posting!
Subscribe to RSS

Resource Links