Length: 10 hours - 2 cfu
Abstract
As a fundamental tool in modeling and analyzing real world data, large-scale algorithms are a central part of any tool set for big data analysis. Processing datasets with hundreds of billions of entries is only possible via developing distributed algorithms under distributed frameworks such as MapReduce, Pregel, Gigraph, and alike. For these distributed algorithms to work well in practice, we need to take into account several metrics such as the number of rounds of computation and the communication complexity of each round. Given the popularity and ease-of-use of MapReduce framework, developing practical algorithms with good theoretical guarantees for basic algorithmic primitives is a problem of great importance. In this course, we discuss how to design and implement algorithms based on traditional MapReduce architecture. In this regard, we discuss various basic algorithmic problems such as computing connected components, maximum matching, MST, counting triangle, clustering, diversity maximization and so on so for. In particular, we discuss a computation model for MapReduce and describe various techniques to develop efficient algorithms in this framework.
Dates & Venue
Giorni | Aula | Orario |
21/11/2022 | Room Meeting 8° floor - Via Celoria 18 - 20133 Milan |
14:00-16:30 |
22/11/2022 | Room Meeting 8° floor - Via Celoria 18 - 20133 Milan |
09:30-11:00 |
23/11/2022 | Room Meeting 8° floor - Via Celoria 18 - 20133 Milan |
09:30-11:00 |
24/11/2022 | Room Meeting 8° floor - Via Celoria 18 - 20133 Milan |
09:30-11:00 |
25/11/2022 | Room Meeting 8° floor - Via Celoria 18 - 20133 Milan |
09:30-11:30 |
Lecturer:
Dr. Silvio Lattanzi - Google Research
Prof. Nicolò Cesa Bianchi - Dipartimento di Informatica
Assessor:
Dr. Silvio Lattanzi - Google Research