MongoSV Conference

It was great attending the MongoSV conference last week. The conference was packed with attendees from a mix of backgrounds, showing lots of interest in scaling with MongoDB. There were some good technical talks from 10gen about their database. I…

Read More

Foursquare’s MongoDB Outage

Foursquare recently suffered a total site outage for eleven hours. The outage was caused by unexpected uneven growth in their MongoDB database that their monitoring didn't detect. The system outage was prolonged when an attempt to add a partition didn't…

Read More

LinkedIn’s Data Infrastructure

Jay Kreps of LinkedIn presented some informative details of how they process data at the recent Hadoop Summit. Kreps described how LinkedIn crunches 120 billion relationships per day and blends large scale data computation with high volume, low latency site…

Read More

Yahoo! Updates from Hadoop Summit 2010

The Hadoop Summit of 2010 started off with a vuvuzela blast from Blake Irving, Chief Product Officer for Yahoo. Yahoo delivered keynote addresses that outlined the scale of their use, technical directions for their contributions, and architectural patterns in how…

Read More

GigaOm Structure Highlights

The GigaOM Stucture conference a couple of weeks ago addressed many areas of cloud computing. One of the key themes of the event was the emergence of new data architectures. Throughout the panels, interviews, and presentations many speakers identified significant…

Read More

SQL Window Functions in Hive

One of the powerful capabilities of SQL 2003 is the ability to use window functions, e.g., this query computes the ratio of a score to the average score of lower scores, for each product select score/(sum(score)/row_number()), product over (partition by product…

Read More
  • 1
  • 2