U.S. history moves online, with offshore help
The Smithsonian Institution found that putting old records online meant sending work overseas
Computerworld - Companies still have choices when it comes to moving work such as application development offshore. But in one niche field -- the creation of electronic documents from old records -- IT managers may have little choice but to send the work overseas.
That's the experience of the Smithsonian Institution, which is preparing next week to put online the records of the U.S. Exploring Expedition of 1838-1842, a vast but largely forgotten worldwide research endeavor that has been compared with the Apollo space missions in its scope and ambition. The U.S.-funded research expedition involved more than 300 men, six ships and experts in a wide range of areas: geology, botany, anthropology, art and others. On the expedition, they gathered a wide variety of materials, such as 50,000 dried plant specimens.
Major parts of the online effort, including XML encoding of 2,900 pages of records that will give the Smithsonian the ability to create a rich set of searchable and linked documents, was completed in the Philippines and other countries by Innodata Isogen Inc. of Hackensack, N.J.
"In terms of the marketplace, there aren't onshore options," said Martin Kalfatovic, head of the Smithsonian Institution Libraries' New Media Office. That view is shared by other experts in the area.
The work is labor-intensive. Imaging is typically done in the U.S., but any extensive keying work is usually completed overseas. "[In-country] capability has essentially disappeared," said David Bearman, president of Archives & Museum Informatics, a Toronto-based consulting firm.
The Smithsonian will launch its Web site on Wednesday, providing images of written documents as well as some of the artwork. But Smithsonian employees and volunteers will be working through much of the year to complete the XML work.
The Smithsonian project, which will include 15,000 images, has cost about $50,000, not including staff time or volunteer efforts. Text images were scanned using optical character-recognition technology, followed by proofreading and keystroking, if needed, and encoding. The accuracy level was specified at 99.997%, or about one error per page of manuscript. Prices rise exponentially for any level above that, said Kalfatovic.
Major users of the sort of service being provided to the Smithsonian include universities and law firms. And in libraries, bringing such material online and infusing it with some intelligence through XML is opening new avenues for knowledge exchange.
The goal "is to make sure content you have works well with content you don't produce," said David Seaman, director of the Digital Library Federation in Washington.
Read more about BI and Analytics in Computerworld's BI and Analytics Topic Center.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- X-Ray of the PCI Process-4 Proactive Steps
- This white paper from Forrester Research Inc., helps break PCI into understandable components. Security and risk professionals will gain knowledge and insight into...
- Forrester: Economic Impact of Switching to Google Apps
- Content provided by Google
Read this Forrester report on the "total economic impact" of Google Apps, and learn how switching to Google Apps creates... - Intelligent Systems: Unlocking Hidden Business Value with Data
- An intelligent system enables data to flow across an enterprise infrastructure, spanning the devices where valuable data is gathered from employees and customers,...
- Concepts of NonStop SQL/MX
- For DBAs and developers who are familiar with Oracle solutions and want to learn about NonStop SQL/MX, this whitepaper provides an overview of...
- HP Advanced Information Services for SAP In-Memory Appliance (SAP HANA)
- Organizations are eager to connect the vast amounts of data available within and outside their businesses to compete more effectively and make better... All BI and Analytics White Papers
- Quantifying the Business Value of VMware View - Webcast
- Many enterprises have discovered that the use of virtualization to support desktop workloads creates a range of significant benefits. These benefits include price...
- Good to Great - How to Take Business Analytics to the Next Level
- By attending this webcast you will learn how you can implement an effective BA strategy that will deliver maximum strategic value to your...
- Supporting Mobile Productivity With A Limited IT Budget
- Join us and hear from Kaseya mobile IT management experts as we discuss core strategies for supporting the mobile revolution on a shoestring...
- User Experience Monitoring
- In this webinar, you will learn hints & tips for improving end-user response times from Forrester Research analyst, Jean-Pierre Garbani.
- Hints & Tips Cisco
- Overwhelmed by tracking your Vblock, Flexpod or Cisco UCS performance? Spend one hour with Nimsoft to learn how you can eliminate the overhead... All BI and Analytics Webcasts