Computer Science 532 - Systems for Data Science
Spring
2021
02
3.00
Hui Guan
M W 4:00PM 5:15PM
UMass Amherst
84341
Fully Remote Class
huiguan@umass.edu
84340
In this course, students will learn the fundamentals behind large-scale systems in the context of data science. We will cover the issues involved in scaling up (to many processors) and out (to many nodes) parallelism in order to perform fast analyses on large datasets. These include locality and data representation, concurrency, distributed databases and systems, performance analysis and understanding. We will explore the details of existing and emerging data science platforms, including MapReduce-Hadoop, Spark, and more.
Open to MS-CMPSCI students. LECT 01 FOR UNDERGRADS; LECT 02 FOR GRADS. SEATS HELD IN LECT 02 FOR INCOMING STUDENT REGISTRATION. STUDENTS NEEDING SPECIAL PERMISSION MUST REQUEST OVERRIDES VIA THE ON-LINE FORM: https://www.cics.umass.edu/overrides.