Tag Archives: hadoop

So, what is Hadoop?

Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and Research. Hadoop is a set of open source technologies that supports reliable and cost-efficient ways of dealing with large amounts of … Continue reading

Posted in cloud computing | Tagged , , , , , , , , , | Leave a comment

Mapreduce & Hadoop Algorithms in Academic Papers (updated)

The newest and most up-to-date version (May 2010) this blog post is available at http://mapreducebook.org Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. This posting is an … Continue reading

Posted in cloud computing, Hadoop and Mapreduce | Tagged , , , , | 7 Comments

Parallel Machine Learning for Hadoop/Mapreduce – A Python Example

Atbrox is startup providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. Update 2010-June-17 Code for this posting is now on github –http://github.com/atbrox/Snabler This posting gives an example of how to use … Continue reading

Posted in cloud computing, Hadoop and Mapreduce, infrastructure | Tagged , , , , , , , , | 14 Comments

Atbrox Customer Case Study – Scalable Language Processing with Elastic Mapreduce (Hadoop)

We developed a tool for scalable language processing for our customer Lingit using Amazon’s Elastic Mapreduce. More details: http://aws.amazon.com/solutions/case-studies/atbrox/ Contact us if you need help with Hadoop/Elastic Mapreduce.

Posted in cloud computing | Tagged , , , , , , | 2 Comments

How to combine Elastic Mapreduce/Hadoop with other Amazon Web Services

Elastic Mapreduce default behavior is to read from and store to S3. When you need to access other AWS services, e.g. SQS queues or database services SimpleDB and RDS (MySQL) the best approach from Python is to use Boto. To … Continue reading

Posted in cloud computing, Hadoop and Mapreduce, infrastructure | Tagged , , , , , , | 4 Comments