-
Archives
- November 2014
- September 2014
- April 2014
- August 2013
- May 2013
- February 2013
- October 2012
- September 2012
- May 2012
- April 2012
- March 2012
- January 2012
- November 2011
- May 2011
- April 2011
- February 2011
- January 2011
- October 2010
- September 2010
- August 2010
- May 2010
- February 2010
- November 2009
- October 2009
- September 2009
-
Meta
Category Archives: Hadoop and Mapreduce
Word Count with MapReduce on a GPU – A Python Example
Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and research. GPU – Graphical Processing Unit like the NVIDIA Tesla – is fascinating hardware, in particular regarding extreme parallelism (hundreds of … Continue reading
Mapreduce & Hadoop Algorithms in Academic Papers (3rd update)
Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and research. Contact us if you need help with algorithms for mapreduce This posting is the May 2010 update to the similar … Continue reading
Posted in cloud computing, Hadoop and Mapreduce
Tagged google, hadoop, machinelearning, mapreduce, yahoo
22 Comments
Mapreduce & Hadoop Algorithms in Academic Papers (updated)
The newest and most up-to-date version (May 2010) this blog post is available at http://mapreducebook.org Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. This posting is an … Continue reading
Posted in cloud computing, Hadoop and Mapreduce
Tagged algorithms, hadoop, machinelearning, mapreduce, search
7 Comments
Parallel Machine Learning for Hadoop/Mapreduce – A Python Example
Atbrox is startup providing technology and services for Search and Mapreduce/Hadoop. Our background is from from Google, IBM and Research. Update 2010-June-17 Code for this posting is now on github –http://github.com/atbrox/Snabler This posting gives an example of how to use … Continue reading
Posted in cloud computing, Hadoop and Mapreduce, infrastructure
Tagged github, hadoop, machine learning, machinelearning, mapreduce, open source, python, ridge regression, svm
14 Comments
How to combine Elastic Mapreduce/Hadoop with other Amazon Web Services
Elastic Mapreduce default behavior is to read from and store to S3. When you need to access other AWS services, e.g. SQS queues or database services SimpleDB and RDS (MySQL) the best approach from Python is to use Boto. To … Continue reading
Posted in cloud computing, Hadoop and Mapreduce, infrastructure
Tagged amazon, aws, hadoop, mapreduce, python, simpledb, sqs
4 Comments