Computer Science 532 - Systems for Data Science
Spring
2021
01
3.00
Hui Guan
M W 4:00PM 5:15PM
UMass Amherst
84340
Fully Remote Class
huiguan@umass.edu
84341
In this course, students will learn the fundamentals behind large-scale systems in the context of data science. We will cover the issues involved in scaling up (to many processors) and out (to many nodes) parallelism in order to perform fast analyses on large datasets. These include locality and data representation, concurrency, distributed databases and systems, performance analysis and understanding. We will explore the details of existing and emerging data science platforms, including MapReduce-Hadoop, Spark, and more.
Open to junior and senior Computer Science majors who have completed COMPSCI 311, 345, and 377 with a grade of B or better. LECT 01 FOR UNDERGRADS; LECT 02 FOR GRADS. SEATS HELD IN LECT 02 FOR INCOMING STUDENT REGISTRATION. STUDENTS NEEDING SPECIAL PERMISSION MUST REQUEST OVERRIDES VIA THE ON-LINE FORM: https://www.cics.umass.edu/overrides.