Go Back   IceInSpace > Equipment > Software and Computers
Register FAQ Calendar Today's Posts Search

Reply
 
Thread Tools Rate Thread
  #1  
Old 24-11-2008, 06:49 PM
glenc's Avatar
glenc (Glen)
star-hopper

glenc is offline
 
Join Date: Jun 2005
Location: Terranora
Posts: 4,380
48,000 hard drives

http://googleblog.blogspot.com/2008/...mapreduce.html

"We are excited to announce we were able to sort 1TB (stored on the Google File System as 10 billion 100-byte records in uncompressed text files) on 1,000 computers in 68 seconds. By comparison, the previous 1TB sorting record is 209 seconds on 910 computers.

Sometimes you need to sort more than a terabyte, so we were curious to find out what happens when you sort more and gave one petabyte (PB) a try. One petabyte is a thousand terabytes, or, to put this amount in perspective, it is 12 times the amount of archived web data in the U.S. Library of Congress as of May 2008. In comparison, consider that the aggregate size of data processed by all instances of MapReduce at Google was on average 20PB per day in January 2008.

It took six hours and two minutes to sort 1PB (10 trillion 100-byte records) on 4,000 computers. We're not aware of any other sorting experiment at this scale and are obviously very excited to be able to process so much data so quickly.

An interesting question came up while running experiments at such a scale: Where do you put 1PB of sorted data? We were writing it to 48,000 hard drives (we did not use the full capacity of these disks, though), and every time we ran our sort, at least one of our disks managed to break (this is not surprising at all given the duration of the test, the number of disks involved, and the expected lifetime of hard disks). To make sure we kept our sorted petabyte safe, we asked the Google File System to write three copies of each file to three different disks..."
Reply With Quote
  #2  
Old 24-11-2008, 08:14 PM
erick's Avatar
erick (Eric)
Starcatcher

erick is offline
 
Join Date: Dec 2006
Location: Gerringong
Posts: 8,548
The sort of thing people do when they are bored and waiting for clouds to clear?

This would do a super quick job for Santa - he's gotta sort six billion people into "naughty" and "nice"! Sounds like a 0.0001 second job for these guys!

Last edited by erick; 24-11-2008 at 09:14 PM.
Reply With Quote
  #3  
Old 24-11-2008, 08:53 PM
Kal's Avatar
Kal (Andrew)
1¼" ñì®våñá

Kal is offline
 
Join Date: Nov 2006
Location: Sydney
Posts: 1,845
When your company has around 700,000 servers you can afford to waste some CPU cycles doing these small trivial things
Reply With Quote
Reply

Bookmarks


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +10. The time is now 10:27 AM.

Powered by vBulletin Version 3.8.7 | Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Advertisement
Bintel
Advertisement