Back to projects

Election Data

Shipped · 2024

Political Finance Analytics Platform

A PySpark analytics pipeline for U.S. election donations at scale

Role · Architect

A scalable analytics platform for U.S. political-finance data. It architects a PySpark-based ETL pipeline over election-donation records, optimizing throughput and benchmarking distributed systems to surface statewide donor insights.

Donor insight at scale

A data-engineering-forward project: the interesting work is in making the pipeline fast and the distributed system well-characterized, so statewide donor patterns fall out cleanly.