[ Content | View menu ]

MapReduce

Mark Mzyk | January 22, 2008

If, like me, you’ve been seeing various blogs speak on MapReduce, you might, like me, wonder exactly what MapReduce is?

You can always turn to the wikipedia entry. But sometimes, you’d like to dig a little deeper. A good whitepaper might hit the spot. So here it is: MapReduce: Simplified Data Processing on Large Clusters.

Thanks goes out to Eric for pointing me to that whitepaper.

Filed in: Programming.

4 Comments

  1. Comment by Dan Creswell:

    And for those with some extra time on their hands, there’s a bunch of videos on the subject. e.g. From the scalability conference of last year:

    http://video.google.com/videosearch?q=seattle+scalability+conference

    January 22, 2008 @ 03:39
  2. Comment by Kevin Smith:

    And for those with a lot of extra time and the yen to read working code, the Hadoop project is an open-source implementation of MapReduce in Java:

    http://lucene.apache.org/hadoop

    January 22, 2008 @ 23:08
  3. Pingback from The Key To World Domination: Good Writing | Programmer’s Paradox:

    […] posted on MapReduce before, but I want to revisit the topic here.  Not MapReduce itself, but instead the white […]

    February 5, 2008 @ 20:18
  4. Pingback from Further MapReduce Observations and Ramblings | Programmer’s Paradox:

    […] posted several times about MapReduce. I’ll admit, I’m fascinated by it. Here are some observations I […]

    February 12, 2008 @ 22:57