Tag Archives: mapreduce

Combining Hadoop/Elastic Mapreduce with AWS Redshift Data Warehouse

There are currently interesting developments of scalable (up to Petabytes), low-latency and affordable datawarehouse related solutions, e.g. AWS Redshift (cloud-based) [1] Cloudera’s Impala (open source) [2,3] Apache Thrill (open source) [4] This posting shows how one of them – AWS … Continue reading

Posted in analytics, cloud computing, Hadoop and Mapreduce | Tagged , , , , | 5 Comments

atbr – supports websocket-based sharding

atbr (large-scale and low-latency in-memory key-value pair store) now supports websocket-based sharding for parallel deployments. Websocket Sharding Example Checkout and install atbr Start 3 servers loaded with data Start shard server talking to shards Connect to shard server and lookup … Continue reading

Posted in cloud computing | Tagged , , , , , , , | 1 Comment

Mapreduce in Search

Wrote about mapreduce in search in a presentation for next week. Mapreduce in Search (more up-to-date pdf version of the presentation) Best regards, Amund Atbrox

Posted in Atbrox, Hadoop and Mapreduce, infrastructure, search | Tagged , , | 2 Comments

An example of using F# and C# (.net/mono) with Amazon’s Elastic Mapreduce (Hadoop)

This posting gives an an example how F# and C# can scale potentially to up to thousands of machines with Mapreduce in order to efficiently process TeraByte (TB) and PetaByte (PB) data amounts. It shows a C# (c sharp) mapper … Continue reading

Posted in cloud computing | Tagged , , , , , | 8 Comments

2nd International Workshop on Mapreduce and its applications

If you are interested in Mapreduce (/Hadoop), I would like to recommend participating or submitting your paper to the 2nd International Workshop on Mapreduce and its applications (MAPREDUCE’11). The mapreduce workshop is held in correspondance with the The 20th International … Continue reading

Posted in cloud computing | Tagged , , , | Leave a comment