Computer Science 532 - Systems for Data Science
Fall
2021
02
3.00
Hui Guan
M W 4:00PM 5:15PM
UMass Amherst
12538
Hasbrouck Lab Add room 126
huiguan@umass.edu
12537
In this course, students will learn the fundamentals behind large-scale systems in the context of data science. We will cover the issues involved in scaling up (to many processors) and out (to many nodes) parallelism in order to perform fast analyses on large datasets. These include locality and data representation, concurrency, distributed databases and systems, performance analysis and understanding. We will explore the details of existing and emerging data science platforms, including MapReduce-Hadoop, Spark, and more.
Open to MS-CMPSCI students. LECT 01 FOR UNDERGRADS; LECT 02 FOR GRADS. SEATS HELD IN LECT 02 FOR INCOMING STUDENT REGISTRATION. STARTING IN FALL 2022, THE PREREQUISITE WILL CHANGE AND UNDERGRADUATES WILL NEED TO HAVE COMPLETED COMPSCI 377 AND COMPSCI 445 WITH A C OR BETTER. STUDENTS NEEDING SPECIAL PERMISSION MUST REQUEST OVERRIDES VIA THE ON-LINE FORM: https://www.cics.umass.edu/overrides.