Facebook rolls out storage system to wrangle massive photo stores
Homegrown system, Facebook Haystack, built to handle multiplying files of user photos
Computerworld - Needing to better deal with 50 billion files worth of photos, engineers at Facebook are installing a new photo storage system they say is 50% faster than traditional systems.
The storage system, dubbed Haystack, has been under development in-house for the past couple of years, and Facebook has been rolling it out in limited test versions to parts of the network for the past few months. The company expects to use Haystack to store all Facebook photos by next week, according to Bobby Johnson, director of engineering at Facebook.
And Jonathan Heiliger, vice president of technical operations, told Computerworld today that based on tests, Haystack is more than 50% faster than traditional photo storage systems.
"In terms of cost, if it's twice as efficient, we can have 50% less hardware," said Johnson. "With 50 billion files on disk, the cost adds up. It's essentially giving us some [financial] headroom."
Johnson and Heiliger said they began building the new storage system to better handle the growing number of photos Facebook has to store. Many of their 175 million and 200 million users share photos of everything from their pets to vacations, weddings and days at the beach. That means users are posting and calling up their own photos, as well as their friends' and family members' photos. Keeping the system running efficiently was a growing challenge.
Johnson noted that Facebook deals with 15 billion photos - not including all of the replications. User data grows by 500GB per day. And Facebook has 50 million requests per second to its back-end servers.
A spokesman for Facebook said more specifics about the new system will be released in a few weeks.
Johnson, though, said the system is so much faster than the previous one because of changes made to its setup. Haystack is tailored for small files that don't change very often, instead of for a small number of large files that are changing all the time. Traditional file directories also need file names, and a lot of resource cost goes to just finding the files. The new system uses ID numbers instead of names; that mapping is very small and doesn't involve directory structures or file names.
Johnson said that so far, the rollout of the new system has gone very smoothly.
Five-year-old Facebook's user base passed one-time leader MySpace last year, according to a recent report.
Facebook, once regarded as the up-and-coming social network, had almost 222 million unique visitors last month, while MySpace came in at 125 million, according to online researcher comScore Inc. That's a dramatic change, since the Facebook-MySpace race for unique visitors was a near dead heat in April 2008.
The company is closing in on a big milestone -- 200 million users, executives said today.
Read more about Web Apps in Computerworld's Web Apps Topic Center.
- The 20 Best iPhone/iPad Games of 2013 So Far
- 9 Steps to Build Your Personal Brand (and Your Career)
- 7 Consumer Technologies Coming to an Enterprise Near You
- 11 Signs Your IT Project is Doomed
- A walking tour: 33 questions to ask about your company's security
- 15 social media scams
- The 7 elements of a successful security awareness program
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- Anticipate, Engage and Deliver Exceptional Web Experiences IBM Customer Experience Suite and IBM Intranet Experience Suite help organizations delight customers through a consistently exceptional web experience and empower employees with...
- Harness IT -- An Introduction to Business Intelligence Solutions Learn the key selection criteria required to provide your organization with the capability to address structured data, unstructured data and mobile demands so...
- Business Intelligence Shows its Smarts Today's Business Intelligence (BI) tools provide a new way to think about data with self-service capabilities and user-friendly analytics that can be used...
- Proactive Planning for Big Data Big data is less about the terabytes and more about the query tools and business intelligence needed to make sense of massive amounts...
- Becoming An Analytics Driven Organization Join us on Tuesday, June 18, 2013, 11:00 AM EDT and learn how your agency can create an analytics culture that will enable...
- 3 Reasons Why Sepaton is the World's Fastest Backup Solution Leading analyst, Storage Switzerland learns how Sepaton backs up and deduplicates massive data volumes while maintaining the industry's fastest performance - all in... All Web Apps White Papers | Webcasts
Our weekly newsletter will cover a wide range of topics and trends related to consumerization. Stay up to date with news, reviews and in-depth coverage of BYOD, smartphones, tablets, MDM, cloud, social and how consumerization affects IT. Subscribe now!