Back to projects
Election Data
Shipped · 2024Political Finance Analytics Platform
A PySpark analytics pipeline for U.S. election donations at scale
Role · Architect
A scalable analytics platform for U.S. political-finance data. It architects a PySpark-based ETL pipeline over election-donation records, optimizing throughput and benchmarking distributed systems to surface statewide donor insights.
Donor insight at scale
A data-engineering-forward project: the interesting work is in making the pipeline fast and the distributed system well-characterized, so statewide donor patterns fall out cleanly.