Week 02
From Text to Matrices: Representing Text as Data
Topics
- How to represent text as data?
- What is a Bag of Words?
- What are tokens?
- Why should we care about tokens?
Readings
Required Readings
Grimmer, Justin, Margaret E. Roberts, and Brandon M. Stewart. Text as data: A new framework for machine learning and the social sciences. Princeton University Press, 2022 - Chapters 3-5
Applied Papers:
Coding Materials