Tag Archives: mapreduce

Preliminary Experiences Crawling with 80legs

80legs is a company specializing in the crawling and preprocessing part of search, where you can upload your seed urls (where to start crawling), configure your crawl job (depth, domain restrictions etc.) and also run existing or custom analysis code … Continue reading

Posted in cloud computing | Tagged , , , , , | 3 Comments

How to use C++ Compiled Python for Amazon’s Elastic Mapreduce (Hadoop)

Sometimes it can be useful to compile Python code for Amazon’s Elastic Mapreduce into C++ and then into a binary. The motivation for that could be to integrate with (existing) C or C++ code, or increase performance for CPU-intensive mapper or … Continue reading

Posted in cloud computing | Tagged , , , , , , , | 7 Comments

Hadoop World 2009 – some notes from application session

Other recommended writeups : Hadoop World NYC (Hilary Mason) The View from HadoopWorld (Stephen O’Grady) Post Hadoop World Thoughts (Deepak Singh) Hadoop World, NYC 2009 (Dan Milstein) Hadoop World Impressions (Steve Laniel) — Location: Roosevelt Hotel, NYC 1235 Joe Cunningham … Continue reading

Posted in cloud computing | Tagged , , , , | 1 Comment

atbrox ready for business

We are here to help you: Understand if and how the cloud can be cost-efficient in your setting Efficiently analyze large data sets using the cloud Architect, develop and deploy scalable and reliable software for the cloud Adapt and migrate … Continue reading

Posted in Atbrox | Tagged , , , , , , , , , | Leave a comment