MacroLab, UCSD: Data Pipelines and Visualizations

Dates: Sep 2025 - Jun 2026

• Extracted and organized data from public sources using Python packages Pandas, Scrapy, and BeautifulSoup.

• Engineered data pipelines and cleaning processes.

• Synthesized data using Python and Microsoft PowerBI in a study about the privatization of the media industry.

• Code for gathering data from public sources can be found on my Github, including repositories for gathering the Pulitzer prize list, Booker prize winners, and NYT and Amazon Bestsellers lists.

Note: The final live versions of these visualizations will be published to the Macro Lab website in June 2026.

• Pulitzer prize list project: https://github.com/ebartt/pulitzer_soup

• NYT project: https://github.com/ebartt/nytscraping_capstone

• National Book Award project: https://github.com/ebartt/natlbookawardReport

• Amazon bestsellers project: https://github.com/ebartt/amazonbookscrape

Preliminary dashboards:

    / [pdf]

Project link: https://macrolab.vercel.app/