Ads by TechWords

See your link here
Receive the latest technology news and information.
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Been audited lately? Blame the IRS's massive, superfast data warehouse

The Compliance Database Warehouse is churning for you

March 22, 2008 12:00 PM ET

Active Comments
Anonymous says: Columnar databases, especially industry leading product such as Sybase IQ, seem to be very well suited for storing vast amounts...
Billy Bathgates says: billy bathgates sure has penetrated the government in a most flagrant and indecent manner. Why should our tax dollars be...


Computerworld - As April 15 looms and tax season starts entering its frenzied home stretch, individuals and corporate accountants contemplating their tax returns may find themselves wondering just how much they can, let us say, fudge the numbers.

Don't be tempted. In recent years, the Internal Revenue Service has become markedly better at spotting potential tax fraud and more aggressive in pursuing it.

In 2006, the agency collected a record $59.2 billion via 1.4 million audits. Revenue has grown during the past seven years and is up 75% since 2000.

In particular, the IRS has increased its targeting of tax cheats in the middle-class (between $25,000 and $100,000 in annual income) and super-rich (greater than $1 million a year) brackets. It audited 436,000 middle-class households and individuals in 2006, three times as many as in 2000. The chances of being audited rose commensurately, to 1 in 140 in 2006 from 1 in 377 six years earlier, according to The New York Times.

Meanwhile, the above-$1 million club faced a 1-in-11 chance of an audit in 2006, up from 1 in 20 three years earlier (download PDF).

At the same time, the IRS has been able to steadily reduce the number of audits of honest taxpayers, according to Jeff Butler, director of research databases at the IRS.

Butler oversees the agency's research data warehouse, key to the agency's recent improvements.

"What we do is, in some respects, cutting-edge," said Butler, with the restrained pride of a longtime bureaucrat (he has been at the IRS for a total of 15 years, with an additional five-year stint at the U.S. Department of Transportation). "We're getting to be as sophisticated as the largest credit card companies or banks."

A world-class data setup

Butler may be too modest. Unifying all tax returns and related information from the past 10 years, the 150TB Compliance Data Warehouse (CDW) is comparable in size to the largest known databases in the world, such as those run by YouTube, AT&T and the CIA.

It's not just the CDW's size that impresses, but also its capabilities.

IRS researchers can use it to "search and analyze hundreds of millions or even billions of records at a time, so we can answer questions, look at trends, do simulations and optimization modeling," Butler said. And those analyses, rather than taking weeks or months as they would have in the past, now take hours or days, he said.

Using the CDW, the IRS was able to discover areas where tax cheating had become rampant, such as the Earned Income Tax Credit or small-business tax shelters, Butler said.

Researchers have also used the CDW to discover or confirm who is at risk for falling behind on their tax payments. For instance, young graduates laden with college debt are particularly susceptible.



Jump to comments

april 15th

Additional Resources

Xerox
By using solid ink technology only from Xerox, you could save up to 65% by printing color for the cost of black and white. Enter for a chance to WIN a PhaserTM 8860 network color printer!
Microsoft
Save time and mitigate security risk. Deploy it now.
Sybase
In this white paper, IDC analyzes the role of next-generation mobile enterprise platforms as organizations seek a more strategic deployment of mobile solutions.

Learn the important issues you must consider before starting your next mobility initiative. Get your mobility white paper from IDC now, compliments of Sybase.

What People Are Saying

Featured Zone
The SAS Business Analytics Zone
Is your enterprise constantly challenged by the need to manage huge data volumes in near-real time to make fast, accurate decisions? If so, get into the zone — and learn more about how SAS® Data Integration and SAS® Data Quality solutions - powered by DataFlux - can help you access, validate, cleanse, enhance and distribute trustworthy information. SAS provides the software solutions to address a volatile economy, increased regulations, talent shortages and global competition. Our unique framework of Business Analytics offerings enables organizations to solve complex problems, manage for performance, drive sustainable growth and anticipate change.
Enter the SAS Business Analytics Zone now
See All Zones

 

SAS Information Management Kit

SAS is the leader in business intelligence and analytical software and services. Only SAS offers leading data integration, storage, analytics and business intelligence applications within a comprehensive enterprise intelligence platform. SAS gives 97 of the top 100 companies in the 2007 Fortune 500 THE POWER TO KNOW®.

Webcast: The Information Management Roadmap
Imagine high-quality data, cleansed, analyzed and delivered throughout your organization. Join Computerworld, IT visionary Thornton May and a panel of experts to learn how SAS® can help you make it happen.

View this webcast 
Research Report: Information Management Initiatives at Midsize and Large Organizations
See the top-line results of this Computerworld sponsored survey to see how IT and business leaders are handling information management implementation.

Download this report 
White Paper: Information Management: Better Information for Winning Decisions.
This white paper explains how the SAS Information Evolution Model aids companies in assessing how they use this information to make strategic decisions and drive business.

Download this white paper