Language analysis software aids U.S. Web search for terrorist activity
Computerworld - It's one thing to track and monitor terrorists the U.S. government already knows about. But it's even tougher uncovering the ones who are unknown.
To help in that effort, a Cambridge, Mass.-based globalization software company, Basis Technology Inc., has created the Rosette Arabic Language Analyzer. The tool can plug into data mining applications used by U.S. defense and security agencies that are involved in scouring the Internet for Web sites written in Arabic. By automating the search, information that can help investigators find new potential suspects in the fight against terrorism can be gleaned quickly, according to the company.
Carl Hoffman, CEO of Basis Technology, said the analyzer, which is currently in beta testing, plugs into content management and knowledge management applications used by the U.S. government and defense contractors, including Convera's RetrievalWare and Fast Search & Transfer ASA's Data Search software. It's also available for Microsoft SQL Server and Oracle Text/interMedia. The tool allows the government to focus on obscure Web sites whose URLs change regularly, so the sites can be monitored for terrorist activity.
When interesting or worrisome information is found, it can be turned over to intelligence organizations for direct investigation by personnel who specialize in such probes, Hoffman said.
The analyzer is one of 37 Rosette foreign language analyzers offered by Basic Technology. The analyzers identify the language of the content and then convert the text into standardized Unicode, the international character set that provides a unique number for every character in any language. This is the first commercially available Arabic language analyzer created in the U.S., Hoffman said.
Basic Technology began working on the Arabic Language Analyzer shortly after the Sept. 11 terrorist attacks in the U.S., Hoffman said. "A number of government agencies in the intelligence community strongly encouraged us to move in this direction," he said.
Everette Jordan, director of the National Virtual Translation Center, an organization jointly sponsored by the FBI and CIA under the USA Patriot Act, said in a statement that "linguistics technology is beginning to play an increasingly important role when it comes to ensuring national security."
"Because of the enormous volume of multilingual intelligence information that must be analyzed with limited human resources, technologies that can assist in sifting, sorting and finding critical information are essential in ensuring that threats are detected as quickly as possible," Jordan said.
Glenn Nordin, assistant director of language intelligence policy at the U.S. Department of Defense, said in a statement that analyzers such as this one help because "U.S. government computer systemsare largely designed to work with the Latin alphabet and U.S. character sets, [and] processing information in Arabic is a difficult undertaking.
"In the absence of universal transliteration standards, human transcript of foreign text into the Latin alphabet can result in significant corruption of the data and mismatches in searches," Nordin said. "Finding solutions that enable intelligence analysts to extract and disseminate information in the original language and script could be of critical importance."
Nordin and Jordan could not be reached for additional comment today.
Read more about BI and Analytics in Computerworld's BI and Analytics Topic Center.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Capture-Enabled Business Process Management
- Organizations today must deal with a vast amount of incoming information from many different sources. Efficient, automated business processes are critical to managing...
- Using Case Management to Empower Employees and transform Customer Service
- This Kofax paper shows how successful customer service organizations have transformed customer service by empowering their employees. We will see how Dynamic Case...
- Case Study: Audi-Volkswagen Improves Procurement Control
- Audi-Volkswagen required a user-friendly, easy-to-use Business Process Management system that did not require programming skills or high levels of technical expertise in-house. This...
- AIIM Market Intelligence: The paper-free office, dream or reality?
- In this Aiim Market Intelligence report, produced in association with Kofax, we look at the success of paper-elimination projects, where and why paper...
- Information Governance: Turning Data Into Business
- This whitepaper explores current information governance practices, challenges, and ROI among US, UK, and German firms.
- Live Webcast
How to Reduce Complexity and Automate Your Partners for Efficient E-Business: - Date: Tuesday, June 5, 2012, 2:00 PM EDT
Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific... - Live Webcast
Data Privacy and Protection in Production Environments: New Research from Ponemon Institute - Date: Wednesday, June 13, 2012, 1:00 PM EDT / 10:00 AM PDT
In a recent study conducted by Ponemon Institute, fifty-five percent of respondents... - Live Webcast
Today's NAS: A Solution Beyond Old Limits - Date: Tuesday, July 17, 2012 2:00 PM EDT
Traditional NAS systems don't scale beyond fixed limits. Proliferation of NAS systems leads to management... - BMC Control-M - Single Point of Control Demo
- With BMC Control-M, you schedule and manage everything - down to the very last platform and application - from one simple interface. It's...
- Sun Chemical Customer Success Story
- Sun Chemical, the world's largest producer of printing inks and pigments, quadrupled its complex batch environment with zero extra headcount using BMC Control-M's...
- Service-Enabling CICS Applications: Best Practices
- This informative webcast provides an informed, thorough look into CICS service-enablement options and how they can affect your environment. You'll learn how to...
- Teaching Legacy Application Elephants How to Dance
- This four-minute video podcast shows how you can create services to continuously reuse enterprise applications, however and whenever needed, while leaving legacy logic...
- Verastream Host Integrator
- This six-minute product demo shows how you can use Verastream Host Integrator to modernize and service-enable legacy assets for use across your enterprise....