DISTRIBUTED MODELS, MAPREDUCE AND LARGE SCALE ALGORITHMS- November 2022

Length: 10 hours - 2 cfu

 

Abstract

As a fundamental tool in modeling and analyzing real world data, large-scale algorithms are a central part of any tool set for big data analysis. Processing datasets with hundreds of billions of entries is only possible via developing distributed algorithms under distributed frameworks such as MapReduce, Pregel, Gigraph, and alike. For these distributed algorithms to work well in practice, we need to take into account several metrics such as the number of rounds of computation and the communication complexity of each round. Given the popularity and ease-of-use of MapReduce framework, developing practical algorithms with good theoretical guarantees for basic algorithmic primitives is a problem of great importance. In this course, we discuss how to design and implement algorithms based on traditional MapReduce architecture. In this regard, we discuss various basic algorithmic problems such as computing connected components, maximum matching, MST, counting triangle, clustering, diversity maximization and so on so for. In particular, we discuss a computation model for MapReduce and describe various techniques to develop efficient algorithms in this framework.

Dates & Venue

Giorni Aula Orario
21/11/2022 Room Meeting floor - Via Celoria 18 - 20133 Milan

14:00-16:30

22/11/2022 Room Meeting floor - Via Celoria 18 - 20133 Milan

09:30-11:00

23/11/2022 Room Meeting  8° floor - Via Celoria 18 - 20133 Milan

09:30-11:00

24/11/2022 Room Meeting floor - Via Celoria 18 - 20133 Milan

09:30-11:00

25/11/2022 Room Meeting floor - Via Celoria 18 - 20133 Milan

09:30-11:30

 

Lecturer:

Dr. Silvio Lattanzi  - Google Research 

Prof. Nicolò Cesa Bianchi - Dipartimento di Informatica

 

 

Assessor:

Dr. Silvio Lattanzi  - Google Research 

Prof. Nicolò Cesa Bianchi - Dipartimento di Informatica