Text as Data

Dates: Sep 2025 - Dec 2025

• Processed and analyzed the UN General Debate Corpus using text as data modeling in R.

• Trained and tested a machine learning model using Lasso regression to extract text predictors of critical speeches.

Full report:

    / [pdf]

Project link: https://github.com/ebartt/textasdata