Big Data // Big Data Analytics
Commentary
7/19/2013
03:59 PM
Wyatt Kash
Wyatt Kash
Commentary
Connect Directly
LinkedIn
Twitter
RSS
E-Mail
50%
50%
Repost This

Data.gov Gets Updated: A Closer Look

Version 2.0 of government's data portal relies heavily on open-source platforms, but finding usable data can still be like looking for buried treasure.

5 Big Wishes For Big Data Deployments
5 Big Wishes For Big Data Deployments
(click image for larger view and for slideshow)

The White House's release this week of a newly designed version of its government data portal, Data.gov, was greeted with predictable fanfare and generally underwhelming reviews. The site's introduction highlights the administration's latest steps to make government data more readily available to the public, but it also marks another step forward for the government's use of open-source software.

U.S. Deputy CTO Nick Sinai and Senior Advisor Ryan Panchadsaram, in announcing the new design of Data.gov, were upfront in declaring, "Next.Data.gov is far from complete (think of it as a very early beta)."

On the surface, the new portal is in fact a fresh start aimed at making it easier for data enthusiasts and application developers to find, visualize and reuse government data. Under the hood, however, it also embraces the government's expanding reliance on open-source software.

[ Want more on how agencies might meet President's open data mandate? Read Data Management Key To Federal Open Data Policy. ]

The new Data.gov site, for instance will be making use of Apache Solr for search, CKAN (Comprehensive Knowledge Archive Network) for its data management platform and WordPress for its content management system. It even makes use of open-source fonts.

The preview of the new site is a clear response to the President's technology-driven management agenda announced earlier this month and a White House executive order issued in May for agencies to make their data more accessible, including being machine readable by default.

But in many respects, the new site is also likely to disappoint die-hard data users as being not much more than a shiny new showroom attached to the same old government data warehouse, a warehouse still in need of operating improvements and accessible data.

The new edition of Data.gov does offer some improvements over the original site, which began in 2009 as a basic clearinghouse for federal agency datasets, many of which were not designed for the general population. Although Data.gov has been routinely criticized for this lack of accessibility, it deserves credit for spawning more than a dozen special interest communities around health, education, energy, safety and other data. It has also led, thanks to the vision of federal CTO Todd Park, to a collection of cottage industries that are putting government data to work for private enterprise.

The new site design clearly reflects an injection of new thinking from various sources. Among them are the Office of Science and Technology Policy, the General Services Administration, and private sector experts working through the Presidential Innovation Fellows program.

For instance, Next.Data.gov has fused the usual social media tools into the site to capture streams of comments that highlight how private enterprise and the public at large are taking advantage of government data. The design brings the public's ideas about government data front and center.

The result is slicker promotion of some of the government's best data harvesting successes, such as the release of a database showing the significant variations in what hospitals and healthcare providers charge across the nation for 100 most common inpatient services and 30 most common outpatient services.

The new site also uses D3.js, a Web-based JavaScript library that helps manipulate and visualize data in documents. This makes it possible to look beyond the card catalogue view that Data.gov generally provided, and actually view the data in Data.gov's repository dynamically.

The new version also has winnowed down Data.gov's offering, pulling together what would seem to be the most usable subset of Data.gov's total inventory of data sets and application programming interfaces. The new site features 75,713 datasets and 100 APIs compared to the 184,259 datasets 295 government APIs previously listed on Data.gov.

But the real test of the new design is whether users can find and make ready use of the government's vast data resources. And the early results suggest a lot more needs to be done to reduce the number of steps it takes to find and actually extract what in many cases remains buried data treasure.

Find out what government IT teams need to know to deliver new, more agile enterprise networks and services. Also in the new, all-digital Next-Gen Networks issue of InformationWeek Government: How the Navy cut the price tag for its newly awarded Next Generation Enterprise Network contract to HP by more than a billion dollars. (Free registration required.)

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
anon3103639501
50%
50%
anon3103639501,
User Rank: Apprentice
7/24/2013 | 12:05:44 PM
re: Data.gov Gets Updated: A Closer Look
"But in many respects, the new site is also likely to disappoint die-hard data users as being not much more than a shiny new showroom attached to the same old government data warehouse, a warehouse still in need of operating improvements and accessible data."

It does disappoint and using the following click trail as an example:

Start at: http://next.data.gov/

pick a Community like Safety: http://next.data.gov/safety

pick Resources: http://next.data.gov/safety/sa...

pick the National Map: http://nationalmap.gov/viewer....

then Click here to go to The National Map Viewer

and Download Platform!: http://viewer.nationalmap.gov/...

and you finally get to the data and its display

Bottom line: This is yet another new interface to the old Data.gov interface that eventually takes you (if you are lucky enough to find it) to where the actual data has been for years!

My current efforts for OMB to fix this problem are just starting: http://semanticommunity.info/D...
Becca L
50%
50%
Becca L,
User Rank: Apprentice
7/22/2013 | 8:23:50 PM
re: Data.gov Gets Updated: A Closer Look
You said it. I have yet to find an organization less helpful or more cringe-worthy than .gov sites. I find them nearly impossible to navigate, woefully outdated and the number of broken links is astounding. Any upgrades are great steps in my book, but they have a long way to go. I'm encouraged that this issue is receiving any attention at all.
InformationWeek Elite 100
InformationWeek Elite 100
Our data shows these innovators using digital technology in two key areas: providing better products and cutting costs. Almost half of them expect to introduce a new IT-led product this year, and 46% are using technology to make business processes more efficient.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Government, May 2014
NIST's cyber-security framework gives critical-infrastructure operators a new tool to assess readiness. But will operators put this voluntary framework to work?
Video
Slideshows
Twitter Feed
Audio Interviews
Archived Audio Interviews
GE is a leader in combining connected devices and advanced analytics in pursuit of practical goals like less downtime, lower operating costs, and higher throughput. At GIO Power & Water, CIO Jim Fowler is part of the team exploring how to apply these techniques to some of the world's essential infrastructure, from power plants to water treatment systems. Join us, and bring your questions, as we talk about what's ahead.