IRS Database System Makes Tax Deadline, Finally - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

Data Management // Big Data Analytics
05:45 PM
Connect Directly

IRS Database System Makes Tax Deadline, Finally

After long delays and multiple false starts, IRS delivers on a new database to speed up returns processing and delivery of refunds, plus improve fraud detection.

CADE 2 almost didn't happen. Following schedule delays and budget overruns to earlier versions of the system, the agency scaled back its plans and extended its timeline for delivery, with plans stretching into the 2020's. As a result, CADE 2 deals only with individual taxes, not business taxes or those related to retirement plans. But the project completion date was accelerated, and daily processing went live in January, in time for this year's tax season.

The agency began moving the processing of simple returns like the 1040EZ form to CADE 2's predecessor system, a relational database developed with help from CSC. When Shulman took on the commissioner job in 2007, he made it a priority to push the project through to completion. "We took the IT portfolio and shut down some other projects," he said. "We put the A-team on it."

In November, 2008, the IRS hired CTO Terry Millholland, a former tech exec with Visa, Boeing and EDS, to oversee its tax systems, including the CADE project. The agency created a new governance plan for the program, headed up by an associate CIO and overseen by multiple oversight boards.

The IRS functioned as systems integrator for CADE 2, rather than contract out that job. Millholland's strategy has been to "get the data right, and the functionality will follow."

That meant creating a data model for 30,000 data elements, then extracting all the structured and unstructured data from the Individual Master File, converting it, and loading into CADE 2. And data integrity had to be such that "it balances to the penny," said Millholland.

In its budget request for fiscal year 2013, the IRS cited phase two of CADE 2 development as among its planned areas of investment. The agency is looking to retire the IMF and rewrite large chunks of machine code in Java. And it plans increased use of analytics tools with CADE 2 to support its compliance efforts. It already applies filters to scour tax returns for inaccuracies or fraud.

IRS watchdogs are keeping an eye on the project. Last September, the IRS agreed to take steps to improve the project's management, after the inspector general complained that the agency wasn't consistently implementing system development practices, that too many risks were undocumented, and that test plans were insufficiently developed.

The agency faces other tech challenges. Increased call volumes have led to a 48% increase in call wait times since 2008, a problem that will likely require new approaches to customer service, self service, and automation. And the IRS is still tweaking its electronic filing systems, which have experienced data transmission problems at times.

Attend InformqtionWeek's IT Government Leadership Forum, a day-long venue where senior IT leaders in government come together to discuss how they're using technology to drive change in federal departments and agencies. It happens in Washington, D.C., May 3.

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
2 of 2
Comment  | 
Print  | 
More Insights
Threaded  |  Newest First  |  Oldest First
User Rank: Apprentice
4/23/2012 | 11:20:26 PM
re: IRS Database System Makes Tax Deadline, Finally
This article is total fiction and how Schulman thinks he can get away with this nonachievement statement is astounding. The individual Master FIle (IMF) is still processing tax returns; it was converted to run on a daily basis instead of weekly (something that could have been done 20 years ago). There is no relational database that works as stated. There is a relational database that the IRS is "attempting" to load using Informatica--at the end of each daily porcessing cycle using IMF data. However, this RDMBS is not used for anything--because they can't even load it correctly.

Take a real bow shulman!!!
Augmented Analytics Drives Next Wave of AI, Machine Learning, BI
Jessica Davis, Senior Editor, Enterprise Apps,  3/19/2020
How Startup Innovation Can Help Enterprises Face COVID-19
Joao-Pierre S. Ruth, Senior Writer,  3/24/2020
Enterprise Guide to Robotic Process Automation
Cathleen Gagne, Managing Editor, InformationWeek,  3/23/2020
White Papers
Register for InformationWeek Newsletters
Current Issue
IT Careers: Tech Drives Constant Change
Advances in information technology and management concepts mean that IT professionals must update their skill sets, even their career goals on an almost yearly basis. In this IT Trend Report, experts share advice on how IT pros can keep up with this every-changing job market. Read it today!
Flash Poll