Computer Science 532 - Systems for Data Science

Fall
2019
02
3.00
Marco Serafini
TU TH 2:30PM 3:45PM
UMass Amherst
25860
Morrill 2 Room 222
mserafini@umass.edu
25859
In this course, students will learn the fundamentals behind large-scale systems in the context of data science. We will cover the issues involved in scaling up (to many processors) and out (to many nodes) parallelism in order to perform fast analyses on large datasets. These include locality and data representation, concurrency, distributed databases and systems, performance analysis and understanding. We will explore the details of existing and emerging data science platforms, including MapReduce-Hadoop, Spark, and more.
Open to MS-CMPSCI students. LECT 01 FOR UNDERGRADS; LECT 02 FOR GRADS. SEATS HELD IN LECT 02 FOR INCOMING GRAD STUDENT REGISTRATION. WAS COMPSCI 590S. STUDENTS NEEDING SPECIAL PERMISSION MUST REQUEST OVERRIDES VIA THE ON-LINE FORM: https://www.cics.umass.edu/overrides.
Permission is required for interchange registration during all registration periods.