Commentary

J. Nicholas Hoover
Senior Editor, InformationWeek  

Garbage In, Garbage Out In Washington

Problems abound with data about stimulus spending being reported on the federal government's stimulus-tracking Website, recovery.gov, as a cursory search for "stimulus" and "errors" on your favorite news search engine should tell you. While many of the numbers have been, quite simply, incorrectly reported by recipients, others are a symptom of greater problems in government IT.

Problems abound with data about stimulus spending being reported on the federal government's stimulus-tracking Website, recovery.gov, as a cursory search for "stimulus" and "errors" on your favorite news search engine should tell you. While many of the numbers have been, quite simply, incorrectly reported by recipients, others are a symptom of greater problems in government IT.Those problems lay at the nexus of interoperability, data standardization, and data accuracy. Earlier this year, I wrote a story about information stovepipes in the government, but I only touched the top of the smokestack. In reporting that story, when I asked sources to point me toward the biggest stovepipes in government, they repeatedly responded that there were more than they could count.

Take the Department of Defense, for example. It has between 15 and 20 ERP systems, which are only patched together as an afterthought, if at all. "Each one of them is a starship unto itself, but where is the enterprise?" LeAntha Sumpter, deputy director for program development and implementation for defense procurement and acquisition policy at the DoD, asked at a conference in D.C. on Thursday. "Where does it come together?" Likewise, the Department of Transportation, one of the biggest recipients of stimulus money, has about a dozen ERP systems.


More Government Insights

White Papers

More >>

Reports

More >>

Webcasts

More >>

Just within the Department of Defense, said Sumpter, who's also the agency lead for posting data on usaspending.gov, there are somewhere between six and 10 different standards for country codes. That means that KS (Kansas) is also KS (South Korea). And to think, there you were, wondering why we sent military aid to Topeka.

That problem is exacerbated by a lack of training, education and oversight about data quality, said Marv Langston, former Navy CIO and deputy DoD CIO, citing an odd problem where a federal acquisition system at one time showed the Department of Defense as a huge buyer of soybeans. It turned out that the product code for soybeans used to be nothing but a series of ones, i.e. 11111. "People are lazy, it's just as simple as that," he said.

Combine those problems and mix them with a White House-led data transparency initiative, and you're bound for trouble. Sumpter noted that federal agencies are getting daily calls from reporters questioning stimulus data. "When you're looking at transparency from the federal level, we're getting killed at the operational level," she said.

Posting incorrect data online may cause a public relations headache, but it also exposes the flawed processes underneath, rather than hiding the problems via a thorough and time-consuming scrubbing that takes months. It just goes to show that the government needs to continue looking for ways to push for data standards such as the National Information Exchange Model or NIEM, and work on accurate data reporting off the bat, rather than having to do incredibly hard work of scraping over data for months before release. It also shows that citizen-reported data might require extra care.

"We're not inept, we just haven't done the due diligence," Sumpter said. "Unless you want to go down to the data level and look at integrating data at core levels, the biggest problem we have federal government-wide is definitions of data, the contexts in which they're used, and the processes around them."


Related Reading




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

InformationWeek encourages readers to engage in spirited, healthy debate, including taking us to task. However, InformationWeek moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. InformationWeek further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
T-Shirt Giveaway T-Shirt Giveaway: Each week we're selecting one great comment from our readers. The author of the comment will receive an InformaitonWeek Community t-shirt. So get posting!
Subscribe to RSS

Resource Links