Monthly Archives: February 2010

Initial Thoughts on Yahoo’s Ranking Challenge

Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and Research. Yahoo recently announced the Learning to Rank Challenge – a pretty interesting web search challenge (as the somewhat similar Netflix … Continue reading

Posted in cloud computing | Tagged , , , , , , , | Leave a comment

So, what is Hadoop?

Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and Research. Hadoop is a set of open source technologies that supports reliable and cost-efficient ways of dealing with large amounts of … Continue reading

Posted in cloud computing | Tagged , , , , , , , , , | Leave a comment

Mapreduce & Hadoop Algorithms in Academic Papers (updated)

The newest and most up-to-date version (May 2010) this blog post is available at http://mapreducebook.org Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. This posting is an … Continue reading

Posted in cloud computing, Hadoop and Mapreduce | Tagged , , , , | 7 Comments

Parallel Machine Learning for Hadoop/Mapreduce – A Python Example

Atbrox is startup providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. Update 2010-June-17 Code for this posting is now on github –http://github.com/atbrox/Snabler This posting gives an example of how to use … Continue reading

Posted in cloud computing, Hadoop and Mapreduce, infrastructure | Tagged , , , , , , , , | 14 Comments