Language analysis software aids U.S. Web search for terrorist activity
Computerworld - It's one thing to track and monitor terrorists the U.S. government already knows about. But it's even tougher uncovering the ones who are unknown.
To help in that effort, a Cambridge, Mass.-based globalization software company, Basis Technology Inc., has created the Rosette Arabic Language Analyzer. The tool can plug into data mining applications used by U.S. defense and security agencies that are involved in scouring the Internet for Web sites written in Arabic. By automating the search, information that can help investigators find new potential suspects in the fight against terrorism can be gleaned quickly, according to the company.
Carl Hoffman, CEO of Basis Technology, said the analyzer, which is currently in beta testing, plugs into content management and knowledge management applications used by the U.S. government and defense contractors, including Convera's RetrievalWare and Fast Search & Transfer ASA's Data Search software. It's also available for Microsoft SQL Server and Oracle Text/interMedia. The tool allows the government to focus on obscure Web sites whose URLs change regularly, so the sites can be monitored for terrorist activity.
When interesting or worrisome information is found, it can be turned over to intelligence organizations for direct investigation by personnel who specialize in such probes, Hoffman said.
The analyzer is one of 37 Rosette foreign language analyzers offered by Basic Technology. The analyzers identify the language of the content and then convert the text into standardized Unicode, the international character set that provides a unique number for every character in any language. This is the first commercially available Arabic language analyzer created in the U.S., Hoffman said.
Basic Technology began working on the Arabic Language Analyzer shortly after the Sept. 11 terrorist attacks in the U.S., Hoffman said. "A number of government agencies in the intelligence community strongly encouraged us to move in this direction," he said.
Everette Jordan, director of the National Virtual Translation Center, an organization jointly sponsored by the FBI and CIA under the USA Patriot Act, said in a statement that "linguistics technology is beginning to play an increasingly important role when it comes to ensuring national security."
"Because of the enormous volume of multilingual intelligence information that must be analyzed with limited human resources, technologies that can assist in sifting, sorting and finding critical information are essential in ensuring that threats are detected as quickly as possible," Jordan said.
Glenn Nordin, assistant director of language intelligence policy at the U.S. Department of Defense, said in a statement that analyzers such as this one help because "U.S. government computer systems are largely designed to work with the Latinalphabet and U.S. character sets, [and] processing information in Arabic is a difficult undertaking.
"In the absence of universal transliteration standards, human transcript of foreign text into the Latin alphabet can result in significant corruption of the data and mismatches in searches," Nordin said. "Finding solutions that enable intelligence analysts to extract and disseminate information in the original language and script could be of critical importance."
Nordin and Jordan could not be reached for additional comment today.
Read more about Business Intelligence/Analytics in Computerworld's Business Intelligence/Analytics Topic Center.
- 15 Non-Certified IT Skills Growing in Demand
- How 19 Tech Titans Target Healthcare
- Twitter Suffering From Growing Pains (and Facebook Comparisons)
- Agile Comes to Data Integration
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
- The value of smarter oil and gas fields With global energy requirements continuing to rise, the exploration, development and production of new oil and gas resources are shifting to increasingly challenging...
- Smarter Environmental Analytics Solutions: Offshore Oil and Gas Installations Example This IBM Redbooks® Solution Guide describes a solution for implementing smarter environmental monitoring and analytics for oil and gas industries. The solution implements...
- Piecing Together the Business Intelligence Puzzle Business intelligence (BI) technology collects and analyzes company data, delivering relevant information to corporate decision-makers in an effort to produce favorable outcomes.
- Harness IT -- An Introduction to Business Intelligence Solutions Learn the key selection criteria required to provide your organization with the capability to address structured data, unstructured data and mobile demands so...
- Live Webcast Increasing the Value of Your Reports and Dashboards Learn how incorporating other analytical capabilities such as predictive modeling and visualization can increase the value of your reports and dashboards by providing...
- The Software-Defined Data Center: Is your ADC ready? Data center transformation is accelerating beyond virtualization to next-generation cloud architectures and software-defined data centers, bringing new challenges for application performance, scalability and...
- Application Acceleration: Optimize the End-User Experience Watch this on-demand webcast and learn how you can optimize your web content, accelerate performance across any device and browser combination, and offload... All Business Intelligence/Analytics White Papers | Webcasts