May 08

Atbrox is startup company providing technology and services for Search and Mapreduce/Hadoop. Our background is from Google, IBM and research. Contact us if you need help with algorithms for mapreduce

This posting is the May 2010 update to the similar posting from February 2010, with 30 new papers compared to the prior posting, new ones are marked with *.

Motivation
Learn from academic literature about how the mapreduce parallel model and hadoop implementation is used to solve algorithmic problems.

Which areas do the papers cover?

Who wrote the above papers?
Companies: China Mobile, eBay, Google, Hewlett Packard and Intel, Microsoft, Wikipedia, Yahoo and Yandex.
Government Institutions and Universities: US National Security Agency (NSA)
, Carnegie Mellon University, TU Dresden, University of Pennsylvania, University of Central Florida, National University of Ireland, University of Missouri, University of Arizona, University of Glasgow, Berkeley University and National Tsing Hua University, University of California, Poznan University, Florida International University, Zhejiang University, Texas A&M University, University of California at Irvine, University of Illinois, Chinese Academy of Sciences, Vrije Universiteit, Engenharia University, State University of New York, Palacky University, University of Texas at Dallas

Atbrox on LinkedIn

Best regards,
Amund Tveit (Atbrox co-founder)

Digg This
Reddit This
Stumble Now!
Buzz This
Vote on DZone
Share on Facebook
Bookmark this on Delicious
Kick It on DotNetKicks.com
Shout it
Share on LinkedIn
Bookmark this on Technorati
Post on Twitter
Google Buzz (aka. Google Reader)

22 Responses to “Mapreduce & Hadoop Algorithms in Academic Papers (3rd update)”

  1. Neil Conway Says:

    You might consider adding: “Zhao et al. Botgraph: Large scale spamming botnet detection.” from NSDI’09. They describe an approach to using Dryad/MapReduce for detecting spamming botnets at webmail providers like Hotmail.

  2. helwr Says:

    MPI-MapReduce (released Mar 2010)
    http://www.sandia.gov/~sjplimp/mapreduce.html

  3. Alex Says:

    great job!
    I’ve compiled this Mapreduce bibliography which you may find useful: http://www.columbia.edu/~ak2834/mapreduce.html

  4. Matteo Says:

    A Model of Computation for MapReduce (SODA 2010): http://research.yahoo.com/pub/2945

  5. MapReduce/Hadoop Algorithms | Matteo's Wasps' Nest Says:

    [...] I would like to do is to start from scientific articles published in conferences/journal (here is a good bibliography) and describe the techniques they used to design (and when possible, analyze) algorithms on MR/H. [...]

  6. pinboard May 21, 2010 — arghh.net Says:

    [...] Mapreduce & Hadoop Algorithms in Academic Papers (3rd update) [...]

  7. Mark Kerzner Says:

    Hi, very useful, thank you. This one is a link, not a download

    Large-Scale Behavioral Targeting (2009)

  8. Resultado de ALT.NET Hispano VAN sobre NoSQL - Angel "Java" Lopez Says:

    [...] Scalability of the Hadoop distributed file system MapReduce Hadoop algorithms in academic papers [...]

  9. NoSQL Resources « Angel “Java” Lopez on Blog Says:

    [...] Scalability of the Hadoop distributed file system MapReduce Hadoop algorithms in academic papers [...]

  10. Mapreduce and Hadoop Algorithms in Bioinformatics Papers | Abhishek Tiwari Says:

    [...] 2010 August 10 tags: Algorithm by abhishektiwari Purely inspired by Atbrox’s list of academic papers for Mapreduce & Hadoop Algorithms. Unlike computer science where applications of Mapreduce/Hadoop are very much diversified, most of [...]

  11. Quora Says:

    Are there any data mining options for NoSQL databases?…

    Have a look at: Pig http://pig.apache.org/ Hive http://hive.apache.org/ Cascading http://www.cascading.org/ Cascalog https://github.com/nathanmarz/cascalog Mahout http://mahout.apache.org/ Mapreduce & Hadoop Algorithms in Academic Papers (3rd update) h…

  12. Nona Mills Says:

    [...] I would like to do is to start from scientific articles published in conferences/journal (here is a good bibliography) and describe the techniques they used to design (and when possible, analyze) algorithms on MR/H. [...]

  13. Hadoop Use Cases | Data Musings Says:

    [...] 2. Hadoop inspite of its popularity, has not threatened the Relational Databases, and rightly so. Hadoop and Relational Databases complement each other in many ways. The listing above confirms this fact. Hadoop has been mainly used as a tool to store large amounts of raw data and execute increasingly complicated algorithms to achieve effective business decisions like recommendations,  search indices and the like. I came across the list of some very interesting problems here: Map-Reduce and Hadoop Algorithms in Academic Papers [...]

  14. Mapreduce and Hadoop Algorithms in Bioinformatics Papers | Abhishek Tiwari Says:

    [...] MapReduce, Popular // abhishektiwariView Comments // Purely inspired by Atbrox’s list of academic papers for Mapreduce & Hadoop Algorithms. Unlike computer science where applications of Mapreduce/Hadoop are very much diversified, most of [...]

  15. Jeff Hammerbacher Says:

    Hey Amund,

    I’ve started collecting the subset of these papers which are about applications of MapReduce over on Mendeley in a public group: http://www.mendeley.com/groups/1058401/mapreduce-applications/. Feel free to join and add papers!

    Later,
    Jeff

  16. Twitted by vambati Says:

    [...] This post was Twitted by vambati [...]

  17. mapreduce Says:

    I’ve found another one from the MapReduce-MPI site.

    Parallelizing BLAST and SOM algorithms with MapReduce-MPI library, S.-J. Sul and A. Tovchigrechko, IEEE International Parallel & Distributed Processing HICOMB Symposium, (2011).

  18. 知行和一 » mapreduce框架的算法 Says:

    [...] 2、http://atbrox.com/2010/05/08/mapreduce-hadoop-algorithms-in-academic-papers-may-2010-update/ [...]

  19. Four short links: 17 May 2010 - O'Reilly Radar Says:

    [...] MapReduce and Hadoop Algorithms in Academic Papers — a collection of such papers, interesting for those who wrangle big data. (via tlockney on delicious) [...]

  20. Xử lý dữ liệu phân tán bằng Hadoop » Góc IT Says:

    [...] danh sách của các thuật toán Hadoop và MapReduce trong các tài liệu học thuật. Trang Web này cung cấp một góc nhìn thú vị về cách Hadoop được sử dụng cho [...]

  21. Distributed data processing with Hadoop » Góc IT Says:

    [...] out this list of Mapreduce and Hadoop algorithms in academic papers. This site provides an interesting perspective on how Hadoop is used for a variety of applications [...]

  22. Mapreduce & Hadoop Algorithms in Academic Papers (3rd update) | Luciuz's Homepage Says:

    […]            Mapreduce & Hadoop Algorithms in Academic Papers (3rd update) […]

Leave a Reply

preload preload preload