SubscribeThe project comes out of DiBona's efforts last fall to put together an informal system in which Google acts as both a repository and courier for large data sets between teams of scientists. Now, he leads a team that sets up small form-factor PCs, hooked up to drive arrays that can store up to 3 terabytes of data.
The process lightens the load, but it isn't simple: DiBona ships both the PC and array to teams of scientists at various research institutions, which then connect their local servers to the array via an eSATA connection. Once the data transfer is complete, the drives get sent straight back to Mountain View, where DiBona and others copy the data to Google's servers for archival purposes. The idea then is that if other scientists around the world needed access to such a large quantity of data, Google would simply reverse the process.
"Right now, we're just acting as a conduit," DiBona says. "We make a copy of it, and then we can use the hard drives for something else. They'll get banged around a little bit too much (to store the data directly on the drives). They're not intended to be a long-term storage medium -- they're like envelopes to us."
You are not logged in, either login or create an account to post comments
posted by magicbus at 5:24 PM on May 1, 2007