ElasticVapor :: Life in the Cloud
Contact CloudCamp SpotCloud Enomaly About

Tuesday, August 26, 2008

Is MapReduce going mainstream?

It's been an interesting summer for Google's MapReduce software paradigm. I'm not going to get into the finer details of MapReduce, the general idea is its Google's magic sauce, basically it's what lets them run their massively distributed data sets. So any company that wants to be like Google or needs to compete with Google should pay attention to MapReduce.

Last month Intel, HP and yahoo announced a joint research program to examine it's usage and now today Greenplum, a provider of database software for the what they describe as "next generation of data warehousing and analytics", announced support for MapReduce within its massively parallel database engine.

Greenplums announcement to integrate MapReduce functionality into its enterprise focused database is an important step toward taking MapReduce out of academic research labs and moving it to lucrative corporate users.

To give you some background, currently the two most popular implemtations of MapReduce are the open source Apache Hadoop project and unfortunately named Pig project. For those of you who don't know about about Hadoop, it is an open source platform for distributing, managing and then collecting computing work throughout a large computing cloud using MapReduce. Pig, a Yahoo Research project currently being incubated at Apache, is a language designed to make using the Hadoop infrastructure effectively. It has been described as SQL for MapReduce, allowing queries to be written and then parallelised and run on the Hadoop platform.

I found this quote interesting, it was mentioned in Greemplums press release.

"Greenplum has seamlessly integrated MapReduce into its database, making it possible for us to access our massive dataset with standard SQL queries in combination with MapReduce programs," said Roger Magoulas, Research Director, O'Reilly Media. "We are finding this to be incredibly efficient because complex SQL queries can be expressed in a few lines of Perl or Python code.

Also interesting to note that earlier this year IBM released an Eclipse plug-in that simplifies the creation and deployment of MapReduce programs. This plug-in was developed by the team at IBM Software Group's High Performance On Demand Solutions Unit at the IBM Silicon Valley Laboratory. So it may be a matter of time before we see MapReduce commercially offered by IBM.

So what's next? Will we see a Microsoft implementation or an Oracle MapReduce? For now, MapReduce appears to be the new "coolness" and with all the industry attention it seems to be getting I think we may be on the verge of finally seeing MapReduce enter the mainstream consciousness.

As a side note, my favorite MapReduce implementation is called Skynet. The name says it all.

Labels: Cloud Computing, google, greenplum, mapreduce

posted by @ruv at 6:20 PM

1 Comments :

Blogger Alex said...

Microsoft's foray into this space is called Dryad. The basic idea is the same, but Dryad constructs more general data-flow graphs and trades off some ease of use for more flexibility.

http://research.microsoft.com/research/sv/Dryad/

August 26, 2008 7:24 PM  

Post a Comment

Subscribe to Post Comments [Atom]

Links to this post :

  <$BlogBacklinkTitle$>  
<$BlogBacklinkSnippet$>
<$I18NPostedByBacklinkAuthor$> @ <$BlogBacklinkDateTime$>

Create a Link

<< Home

About Me

My Photo
Name: @ruv
Location: Toronto, Canada

Reuven Cohen is Founder & CTO for Toronto based Enomaly Inc. Founded in 2004 Enomaly is the leading developer of Cloud Computing products and solutions focused on Cloud Service providers. Enomaly's products include Enomaly ECP, a complete revenue generating cloud platform, enabling telcos and hosting providers to deliver revenue-generating Infrastructure-on-demand (IaaS) cloud computing services to their customers, quickly and easily, with a compelling and highly differentiated feature set. Reuven is also the founder of  CloudCamp (50+ Cities around the Globe) and Cloud Interoperability Forum and has consulted with the US, UK, Canadian and Japanese governments on their cloud strategies. 

View my complete profile

Reuven is also founder of several technology organizations;
> Enomaly.com
> Cloud Camp
> the Unified Cloud Interface Project
> Cloud Interoperability Forum
> Cloud Interop Magazine
> Contact Reuven

(twitter @ruv : Linkedin : RSS Feed)

Subscribe by Email

Enter your email address:

Previous Posts

  • Ruv's Law: Less is Less, More is More
  • Dell's CloudBox (Containerized Data Centers)
  • MetaCDN, Cloud based content delivery networks
  • IBM says don't use tape, use the Cloud
  • Enomalism Google Group / Discussion List
  • Enomalism's RESTful API
  • CloudVirt
  • The Standardized Cloud
  • Microsoft's Cloud Thickens
  • New Xen Features Outlined

Search Site



follow me on Twitter

Twitter Updates

    Subscribe to
    Posts [Atom]

    > Disclosure Policy

     

    public cloud hosting platform, private cloud hosting platform, turnkey cloud hosting platform, Eucalyptus cloud, 3tera, vmops, AWS, Amazon Web Services, EC2, Elastic compute Cloud, Azure, Microsoft, Xen, Vmare, KVM, Virtualization, parallels, S3, Grid Computing, The Cloud, Elastic Computing, CA, Dell, HP, Intel, EMC, , google cloud, cloud computing google , cloud computing amazon, amazon cloud, wiki cloud computing, cloud computing microsoft, cloud computing companies, grid computing, cloud computing software, google web hosting, hosting, yahoo web hosting, best web hosting, business web hosting, web hosting reviews, website hosting, web hosting sites, web hosting services, web hosting review