Monthly Archives: February 2010

Initial Thoughts on Yahoo’s Ranking Challenge

Posted on February 28, 2010 by Amund Tveit

Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and Research. Yahoo recently announced the Learning to Rank Challenge – a pretty interesting web search challenge (as the somewhat similar Netflix … Continue reading →

Posted in cloud computing | Tagged classification, machine learning, netflix, ranking, regression, relevance, search, yahoo | Leave a comment

So, what is Hadoop?

Posted on February 17, 2010 by Amund Tveit

Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and Research. Hadoop is a set of open source technologies that supports reliable and cost-efficient ways of dealing with large amounts of … Continue reading →

Posted in cloud computing | Tagged bigtable, facebook, gfs, google, hadoop, hbase, mapreduce, thrift, yahoo, zookeeper | Leave a comment

Mapreduce & Hadoop Algorithms in Academic Papers (updated)

Posted on February 12, 2010 by Amund Tveit

The newest and most up-to-date version (May 2010) this blog post is available at http://mapreducebook.org Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. This posting is an … Continue reading →

Posted in cloud computing, Hadoop and Mapreduce | Tagged algorithms, hadoop, machinelearning, mapreduce, search | 7 Comments

Parallel Machine Learning for Hadoop/Mapreduce – A Python Example

Posted on February 8, 2010 by Amund Tveit

Atbrox is startup providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. Update 2010-June-17 Code for this posting is now on github –http://github.com/atbrox/Snabler This posting gives an example of how to use … Continue reading →

Posted in cloud computing, Hadoop and Mapreduce, infrastructure | Tagged github, hadoop, machine learning, machinelearning, mapreduce, open source, python, ridge regression, svm | 14 Comments

Monthly Archives: February 2010

Initial Thoughts on Yahoo’s Ranking Challenge

So, what is Hadoop?

Mapreduce & Hadoop Algorithms in Academic Papers (updated)

Parallel Machine Learning for Hadoop/Mapreduce – A Python Example

Archives

Meta