Oct
15

Foursquare’s MongoDB Outage

Posted on October 15, 2010 by Ron Bodkin

Foursquare recently suffered a total site outage for eleven hours. The outage was caused by unexpected uneven growth in their MongoDB database that their monitoring didn’t detect. The system outage was prolonged when an attempt to add a partition didn’t work due to fragmentation, and required taking the database offline to compact it. This article…

0 Comments
Aug
4

LinkedIn’s Data Infrastructure

Posted on August 4, 2010 by Ron Bodkin

Jay Kreps of LinkedIn presented some informative details of how they process data at the recent Hadoop Summit. Kreps described how LinkedIn crunches 120 billion relationships per day and blends large scale data computation with high volume, low latency site serving. Much of LinkedIn’s important data is offline – it moves fairly slowly. So they…

0 Comments
Jul
14

Facebook on Hadoop, Hive, HBase, and A/B Testing

Posted on July 14, 2010 by Ron Bodkin

The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook’s VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop. Schroepfer gave an…

1 Comments
Page 8 of 9