Ads by TechWords

See your link here
Receive the latest technology news and information.
Security
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Google used rival's database 'inadvertently'

Not saying how it happened, only that it did

April 10, 2007 12:00 PM ET

IDG News Service - After evading the question for four days, Google Inc. folded on Monday and admitted in a blog post that developers at Google China had copied part of a software tool from rival Chinese Internet company Sohu.com Inc. for one of its own products. But Google later said the database was used unintentionally.

Google has yet to explain exactly how portions of a dictionary of Chinese words and names developed by Sohu -- which had not been made public or licensed for use outside Sohu -- ended up inside its Google Pinyin Input Method Editor (IME), saying only that it was an accident and the Sohu database was used to develop Google's product.

"Shortly after the product was released, we learned that content from a non-Google database had been inadvertently integrated into our dictionary," Google said Monday in an e-mail response to questions. The statement offered no further details of how the dictionary became integrated with Google's software.

On the surface, using part of a rival's copyrighted software in this way appears to violate Google's Code of Conduct.

"We respect our competitors and, above all else, believe in fair play in all circumstances; we would no sooner use a competitor's confidential information to our advantage than we would wish them to use ours," the Code says. "If an opportunity arises to take advantage of competitors' confidential information, remember: don't be evil. We compete, but we don't cheat."

While Google employees found to violate the Code "will be subject to disciplinary action, up to and including termination of employment," the company has not said whether such steps have been taken in this case.

On Sunday, Google released an updated version of its Pinyin IME with a new dictionary. That revision and an apology issued on Monday may have headed off a legal showdown with Sohu, but the damage to Google's reputation among Chinese Internet users was already done.

"Their image of innovation and 'don't be evil' was almost destroyed," said Jason Yin, managing director of market research firm In-Stat China, calling the events that unfolded over the weekend a "PR disaster" for Google China.

Pinyin IMEs are widely used in China as a way to type Chinese characters using their Pinyin romanization equivalents. Each IME draws on a built-in dictionary of Chinese words and names to suggest possible matches for users as they type Pinyin. These dictionaries take time and effort to compile, and ultimately determine the difference between a good IME and a bad one.

In the case of Sohu, two engineers spent more than a year compiling its dictionary, drawing on a database of popular search queries from the company's Sogou search engine.

Juan Carlos Perez, in Miami, contributed to this report.


Reprinted with permission from

IDG.net
Story copyright 2009 International Data Group. All rights reserved.

Jump to comments

Google china copied part of a software tool

Additional Resources

EFD vs. HDD - What You Need to Know
WHITE PAPER
Enterprise flash drives provide a new Tier 0 storage layer capable of delivering high I/O performance at a very low latency. Proper use of EFDs in an Oracle environment can deliver increased performance compared to fibre channel drives. Read the recommendations for identification of the best DB components for EFDs.
Gartner Research Report: Magic Quadrant for Application Delivery Controllers, 2009
WHITE PAPER
The market for products to improve the delivery of application software over networks remains dynamic and innovative. Vendors focused on solving enterprises' most-pressing application problems have become the top players.
Eight Criteria for Server Load Balancing
WHITE PAPER
Server load balancers are a simple yet highly effective means to scale an application environment while ensuring its availability. Today's solutions should also address application performance and security. Read about the top eight criteria you should consider when choosing a server load balancer and how Citrix NetScaler meets those requirements.

What People Are Saying

White Papers & Webcasts

Death to PST Files
Download Now  

Web 2.0, Social Media and the Dark Web - A Web Criminals Paradise?
In this discussion, learn about the challenges of protecting your users from the potentially unsafe content hidden in the "Dark Web".

eGuide: Enterprise Security
Smart Security Strategies for 2010. Read now!  

Disaster Recovery 2008: Reduced Costs and Improved Performance
How long can your Enterprise afford to be without your data? With an accelerated disaster recovery program, you never have to answer this...


IT Jobs