Computer Science 590S - Systems for Data Science
Fall
2016
02
3.00
Emery Berger
TU TH 2:30PM 3:45PM
UMass Amherst
80344
80342
In this course, students will learn the fundamentals behind large-scale systems in the context of data science. We will cover the issues involved in scaling up (to many processors) and out (to many nodes) parallelism in order to perform fast analyses on large datasets. These include locality and data representation, concurrency, distributed databases and systems, performance analysis and understanding. We will explore the details of existing and emerging data science platforms, including map-reduce and graph analytics systems like Hadoop and Apache Spark.
Open to MS-CMPSCI students. LECT 01 FOR UNDERGRADS; LECT 02 FOR GRADS. 8 SEATS HELD IN LECT 02 FOR INCOMING STUDENT REGISTRATION. STUDENTS NEEDING SPECIAL PERMISSION MUST REQUEST OVERRIDES VIA THE ON-LINE FORM: https://www.cics.umass.edu/overrides.