Data Analytics and Computation 758 - Text as Data

Fall
2023
01
3.00
Min Pang

M W 4:00PM 5:15PM

UMass Amherst
83084
Machmer Hall room W-13
mrpang@umass.edu
With the recent explosion in availability of digitized text, social scientists increasingly are turning to computational tools for the analysis of text as data. In this course, students will first learn how to convert text to formats suitable for analysis. From there, the course will introduce and proceed through tutorials on a variety of natural language processing approaches to the treatment of text-as-data. This will include relatively simple dictionary approaches for measurement, supervised learning approaches for document classification, vector representations, contextualized embeddings, and more.

Open to DACSS master's students. DACSS 601 Open to DACSS students; others by instructor permission.

This course fills a technical elective requirement for the DACSS Masters degree and the DACSS Advanced Certificate.

This course assumes a working knowledge of R. If you do not have a strong background in R (e.g., have not already taken and passed DACSS 601 Data Science Fundamentals or an equivalent), please contact the instructor to discuss strategies for preparing for this course.

Permission is required for interchange registration during the add/drop period only.