Home > Big Data > Hadoop at Twitter – Dmitriy Ryaboy from April 2010

Hadoop at Twitter – Dmitriy Ryaboy from April 2010


Another in a series of posts on Big Data, NoSQL, and Hadoop. Previous recent related posts include:

In the video below, Dmitriy Ryaboy of Twitter provides an overview of the Hadoop stack at Twitter (from April 2010):

In a nutshell, here’s Twitter’s Big Data/Hadoop technology stack:

Also some nice discussion on the use of Pig for large-scale data analysis in Hadoop (without having to write sequences of data manipulation in MapReduce). The following slide summarize the benefits of using Pig:

And here’s a summary slide highlighting why Twitter uses Pig over SQL for expressing data query and transformation syntax:

glenn

Advertisements
Categories: Big Data Tags: , ,
  1. No comments yet.
  1. No trackbacks yet.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: