Semantic Web helps protect public health
New ways of analyzing complex data sets from multiple sources helps guard against threats
Computerworld - After 9/11, health officials and the U.S. government became very concerned about the possibility of bioterror and interested in ways of identifying possible bioterror incidents early to react and contain them, says Parsa Mirhaji. "People were talking about all kinds of wild things like tracking fruit juice and handkerchief sales to identify a flu-like outbreak that can indicate a potential anthrax attack," says the assistant professor of medicine and the director of the Center for Biosecurity and Public Health Information Research at the University of Texas in Houston.
The problem was not a lack of data but almost too much. "We have information from hospital emergency departments, community clinics, pharmacy prescription drug sales, clinical laboratories, environmental safety commissions, pollution exposure in air and water." This is all complex data, and to do any good it needs to be analyzed quickly, first for normal patterns to provide a basis for comparison and then for deviations from those patterns that might indicate a natural disease outbreak or a bioterror incident.
This data comes from multiple sources and systems that use different, often incompatible schema. It comes in fast, in near real time, and to be of use, it needs to be analyzed very quickly. Conventional analysis methods simply are not as effective -- in an outbreak, officials depending on them would constantly be behind the spread of an infectious disease.
To solve that problem, Mirhaji's team turned to Semantic Web technology, a term that is beginning to appear in conversations among those working with very large amounts of data involved in scientific and medical research.
Web of meaning and connectivity
Semantic Web refers to the web of meaning and connectivity in large and complex data sets accessible from a distributed network such as World Wide Web, a network of collaborators and trustees or within boundaries of an organization. It's a way to organize complex data in meaningful ways by assigning a formal meaning to each element of data. This makes all data explicit, unambiguous and its interpretation identical for both machines and humans.
A simple example might be cell phone numbers. Presume, for instance, that a company issues cell phones with a consecutive set of numbers to all its salespeople. If you know this, then whenever you see one of the numbers in that set, you know that the person holding it is a sales professional who works for a specific company.
Suppose you are at a conference in Japan and someone gives you a business card written in Japanese, which you cannot read. But you see that the cell number is part of the series issued to this company's salespeople. Immediately, you know something about the person who just gave you his card, even if you cannot talk to him.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Finding the right cloud solutions for your organization
- HP is driving the evolution of what we call the Instant-On Enterprise. It is an enterprise that embeds technology into everything it does...
- Converged Infrastructure for Dummies
- As you know, everything is mobile, connected, interactive, and immediate. This is exactly why organizations need a highly agile IT infrastructure in order...
- Seven Priorities for Integrated Network Management - How HP Intelligent Management Center Delivers an Enterprise-class Solution
- This white paper describes the major requirements for network management solutions to help the organizations become more profitable, efficient and reliable.
Intel and the... - Building Cloud-Optimized Data Center Networks white paper
- Enterprises are turning to the Cloud to improve business agility, reduce expenses and accelerate business innovation. Cloud computing redefines the way IT assets...
- Gartner on the Network Infrastructure Market
- The network infrastructure market has evolved rapidly, from one in which most organizations adhered to a single-vendor architecture to a more business-driven network... All Networking White Papers
- The Higher-Bandwidth, Lower-Cost Connection of Choice: 10GBASE-T LAN on Motherboard
- Learn how Expedient, a cloud provider, is using 10 Gigabit Ethernet to boost its services and rein in costs.
- Distributed Database Security with Real-time Monitoring
- View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,... - Leverage automation today to reduce IT complexity
- Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific...
All Networking Webcasts