Aug
4

LinkedIn’s Data Infrastructure

Posted on August 4, 2010 by Ron Bodkin

Jay Kreps of LinkedIn presented some informative details of how they process data at the recent Hadoop Summit. Kreps described how LinkedIn crunches 120 billion relationships per day and blends large scale data computation with high volume, low latency site serving. Much of LinkedIn’s important data is offline – it moves fairly slowly. So they…

0 Comments
Jul
14

Facebook on Hadoop, Hive, HBase, and A/B Testing

Posted on July 14, 2010 by Ron Bodkin

The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook’s VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop. Schroepfer gave an…

1 Comments
Jul
12

Yahoo! Updates from Hadoop Summit 2010

Posted on July 12, 2010 by Ron Bodkin

The Hadoop Summit of 2010 started off with a vuvuzela blast from Blake Irving, Chief Product Officer for Yahoo. Yahoo delivered keynote addresses that outlined the scale of their use, technical directions for their contributions, and architectural patterns in how they apply the technology. The increasing interest in Hadoop was evident: this year’s conference had…

1 Comments
Page 3 of 4