Tag Archives: amazon

An example of using F# and C# (.net/mono) with Amazon’s Elastic Mapreduce (Hadoop)

Posted on February 7, 2011 by Amund Tveit

This posting gives an an example how F# and C# can scale potentially to up to thousands of machines with Mapreduce in order to efficiently process TeraByte (TB) and PetaByte (PB) data amounts. It shows a C# (c sharp) mapper … Continue reading →

Posted in cloud computing | Tagged amazon, aws, c++, f#, hadoop, mapreduce | 8 Comments

Programmatic Deployment to Elastic Mapreduce with Boto and Bootstrap Action

Posted on October 1, 2010 by Amund Tveit

A while back I wrote about How to combine Elastic Mapreduce/Hadoop with other Amazon Web Services. This posting is a small update to that, showing how to deploy extra packages with Boto for Python. Note that Boto can deploy mappers … Continue reading →

Posted in cloud computing, Hadoop and Mapreduce | Tagged amazon, automation, aws, deployment, elastic mapreduce, hadoop, mapreduce | 4 Comments

Towards Cloud Supercomputing

Posted on May 24, 2010 by Amund Tveit

Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and research. Update 2010-Nov-15: Amazon cluster compute instances enters 231th place on top 500 supercomputing list. Update 2010-Jul-13: Can remove towards from … Continue reading →

Posted in cloud computing | Tagged amazon, animoto, cray, dell, ec2, eli lilly, genentech, hadoop, ibm, johnson&johnson, justin.tv, mapreduce, microsoft, mpi, oracle, rackspace, sun, supercomputing, zynga | Leave a comment

Atbrox Customer Case Study – Scalable Language Processing with Elastic Mapreduce (Hadoop)

Posted on November 14, 2009 by Amund Tveit

We developed a tool for scalable language processing for our customer Lingit using Amazon’s Elastic Mapreduce. More details: http://aws.amazon.com/solutions/case-studies/atbrox/ Contact us if you need help with Hadoop/Elastic Mapreduce.

Posted in cloud computing | Tagged amazon, aws, data processing, elastic mapreduce, hadoop, language processing, nlp | 2 Comments

How to combine Elastic Mapreduce/Hadoop with other Amazon Web Services

Posted on November 11, 2009 by Amund Tveit

Elastic Mapreduce default behavior is to read from and store to S3. When you need to access other AWS services, e.g. SQS queues or database services SimpleDB and RDS (MySQL) the best approach from Python is to use Boto. To … Continue reading →

Posted in cloud computing, Hadoop and Mapreduce, infrastructure | Tagged amazon, aws, hadoop, mapreduce, python, simpledb, sqs | 4 Comments