Ads by TechWords

See your link here
Receive the latest technology news and information.
Data Management
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Infectious disease surveillance 2.0: Crawling the Net to detect outbreaks

Think of it as an early-warning system for health officials

July 8, 2008 12:00 PM ET

Computerworld - While recent outbreaks of salmonella in the U.S. have made headlines, an automated real-time system that scours the Web for information about disease outbreaks spied early reports in New Mexico about suspicious gastrointestinal illnesses days before the U.S. Centers for Diseases Control and Prevention (CDC) issued an official report on the problem.

The system, called HealthMap, is a free data-mining tool that extracts, categorizes, filters and links 20,000 Web-based data sources such as news sites, blogs, e-mail lists and chat rooms to monitor emerging public health issues. HealthMap, which is profiled in the July issue of the journal Public Library of Science Medicine and is open to anyone, was developed in late 2006 by John Brownstein and Clark Freifeld. Both men work in the informatics program at Children's Hospital Boston.

The system's goal is to try to detect early outbreaks of diseases before they are spotted by traditional surveillance sources or the international public health community, Brownstein said. Often, reports of new occurrences of infectious disease surface on the Web well before they percolate to the attention of the existing public health reporting infrastructure, he said.

"There were media reports and chat room discussions -- things going on in the informal settings -- saying things about the SARS outbreak [in China] before traditional official channels were reporting on the outbreak," he noted.

As of July 8, the tool was tracking 226 reports of salmonella, 110 reports of avian influenza, 49 reports of Dengue fever, 28 reports of anthrax, 24 reports of West Nile virus and 21 cases of hemorrhagic fever worldwide.

The tool is especially vital to developing countries or other areas of the world that might not have traditional disease surveillance mechanisms in place, Brownstein said. "There was no real source that brought all information about outbreaks together," he said. "This is a way to bring all this information together in a very organized and synthesized way while filtering a lot of noise that might otherwise exist on the Web."

HealthMap scours Web sources every hour, seven days a week, Freifeld added. The pair use RSS feeds when available to access the data or screen-scraping techniques for Web sources not available in feeds. The data is then categorized by the type of disease being described and the location of the outbreak, Freifeld added.

Articles are then analyzed for duplication and content, with duplicate articles removed and those that include new information about a related topic added to a Google Map. On the map, alerts are color-coded for how much recent news about those outbreaks has been noted. HealthMap also includes pop-up windows on highlighted cities or states that provide links to all the news reports on an outbreak for that location.



Jump to comments

CDC

Additional Resources

Xerox
By using solid ink technology only from Xerox, you could save up to 65% by printing color for the cost of black and white. Enter for a chance to WIN a PhaserTM 8860 network color printer!
Microsoft
Save time and mitigate security risk. Deploy it now.
Sybase
In this white paper, IDC analyzes the role of next-generation mobile enterprise platforms as organizations seek a more strategic deployment of mobile solutions.

Learn the important issues you must consider before starting your next mobility initiative. Get your mobility white paper from IDC now, compliments of Sybase.

What People Are Saying

Featured Zone
The SAS Business Analytics Zone
Is your enterprise constantly challenged by the need to manage huge data volumes in near-real time to make fast, accurate decisions? If so, get into the zone — and learn more about how SAS® Data Integration and SAS® Data Quality solutions - powered by DataFlux - can help you access, validate, cleanse, enhance and distribute trustworthy information. SAS provides the software solutions to address a volatile economy, increased regulations, talent shortages and global competition. Our unique framework of Business Analytics offerings enables organizations to solve complex problems, manage for performance, drive sustainable growth and anticipate change.
Enter the SAS Business Analytics Zone now
See All Zones

 

SAS Information Management Kit

SAS is the leader in business intelligence and analytical software and services. Only SAS offers leading data integration, storage, analytics and business intelligence applications within a comprehensive enterprise intelligence platform. SAS gives 97 of the top 100 companies in the 2007 Fortune 500 THE POWER TO KNOW®.

Webcast: The Information Management Roadmap
Imagine high-quality data, cleansed, analyzed and delivered throughout your organization. Join Computerworld, IT visionary Thornton May and a panel of experts to learn how SAS® can help you make it happen.

View this webcast 
Research Report: Information Management Initiatives at Midsize and Large Organizations
See the top-line results of this Computerworld sponsored survey to see how IT and business leaders are handling information management implementation.

Download this report 
White Paper: Information Management: Better Information for Winning Decisions.
This white paper explains how the SAS Information Evolution Model aids companies in assessing how they use this information to make strategic decisions and drive business.

Download this white paper