Skip the navigation
News

Internet Archive expands book digitizing effort

Internet Archive has so far scanned in 100,000 books

By Juan Carlos Perez
December 20, 2006 12:00 PM ET

IDG News Service - The Internet Archive has received a grant from the Alfred P. Sloan Foundation to expand its book-digitizing efforts, which so far have resulted in the scanning of about 100,000 books now available on the group's Web site.

The grant will also benefit the Open Content Alliance, an initiative launched in October 2005 and backed by the Internet Archive, Yahoo Inc. and others to digitize books and multimedia material and make them available online, the Internet Archive announced Wednesday.

The scanned works hosted by the Internet Archive are also available for indexing by any search engine that adheres to the OCA's open-access terms for the content. These principles include providing "the greatest possible degree of access to and reuse of collections in the archive, while respecting the rights of content owners and contributors," according to the OCA Web site.

The Sloan Foundation awarded the grant to support the digitization of historical collections from five major libraries by the Internet Archive, a nonprofit organization building an online library of texts, audio, video, software and Web pages.

The $1 million grant will be used in part to scan the complete personal library of founding father and U.S. President John Adams, housed at the Boston Public Library. Meanwhile, the Getty Research Institute in Los Angeles is making available art, architecture and performing arts books.

The archive of publications issued by New York City's Metropolitan Museum of Art will also be digitized, as well as California Gold Rush primary texts from the University of California at Berkeley's Bancroft Library. Finally, the Internet Archive will also scan the James Birney Collection of Anti-Slavery materials from Johns Hopkins University libraries in Baltimore.

Scanning books to make them available online has become a controversial practice primarily due to Google Inc.'s approach. The search engine giant is digitizing library collections that include copyright books without always asking for permission from the copyright owners. It indexes the full text of these works and makes them searchable through its Book Search service.

Google faces lawsuits alleging that this is a violation of copyright law. Google claims it is protected by the fair use principle, because it only displays snippets of text from copyright works.

The Internet Archive has refrained from digitizing copyright books, although it is interested in seeing copyright issues worked out, because its ultimate goal is to provide access to as many works as possible for the benefit of people worldwide, said Brewster Kahle, Internet Archive founder.

For example, Kahle is interested in sorting out the issue of books whose copyright owners can't be found, often called "orphan works," as well as the issue of copyright works that are out of print. In these two cases, Kahle believes that libraries should take a leading role in finding "the right path through it." In the case of in-print copyright books, a collaboration between libraries and publishers could generate significant progress, he said.

While others are criticizing Google for its wholesale scanning of copyright works, Kahle finds fault with the agreements the company is hammering out with its partner libraries. In his opinion, the contracts put too many restrictions on how libraries and people may use and share digital copies of public-domain works. "Google has bound the libraries pretty tightly," he said. "Public domain works should stay in the public domain."

Google didn't immediately respond to a request for comment.

In addition to Yahoo and the aforementioned libraries, participants in the OCA include Microsoft Corp., Adobe Systems Inc., Columbia University, Hewlett-Packard Co., the University of Toronto, Xerox Corp. and the University of North Carolina at Chapel Hill.

Reprinted with permission from IDG.net. Story copyright 2010 International Data Group. All rights reserved.
Additional Resources
Forrester Consulting - Optimizing Users and Applications in a Mobile World
WHITE PAPER
Solving application issues over the WAN requires careful consideration. Based on their independent research, Forrester Consulting offers recommendations on how to tackle application performance issues, insufficient bandwidth and the inability to quickly restore users in a disaster.

Read now.

Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

Storage White Papers
Datacenter Consolidation Best Practices Whitepaper
The benefits of storage consolidation are being realized by companies and seen as a way to streamline many storage-driven applications. Learn why the...
Eliminating VMware / Storage Related Performance Challenges
How to proactively monitor the performance in a Fibre Channel SAN / vSphere environment is always a concern. Understand the importance of a...
Cloud Environments Have Familiar Storage Challenges
Cloud environments have many storage challenges that are familiar to data center managers, but due to their density and abstraction, the issues become...
Eight Considerations for Evaluating Disk-Based Backup Solutions
In the past, the movement from tape- to disk-based backup has been less compelling due to the expense of storing backup data on...
ExaGrid Helps U.S. Federal Government Agencies Reduce Backup Windows and Improve Data Protection
The U.S. Government has been the largest user of tape-based backup systems since the 1970s. Most agencies have begun to deploy disk storage...
All Storage White Papers
Storage Webcasts
Understand Your Data: The Future of Backup and Archiving
Archiving and Backup are the foundation of the next generation of information governance. However, commodity data protection tools and basic archives are only...
Optimizing Networks for the Cloud
Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn...
All Storage Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs