Resume - Tony Zhang
911tonyzhang@gmail.com
911tonyzhang@gmail.com
TL on the data and metric governance team, driving end to end data governance and monitoring systems.
Bootstrap the compliance stack with GRC UI and backend automation for compliance program.
Consolidate internal stacks for data quality monitoring and integration with Datahub for cataloging.
Build Foursquare's data stack in EMR Serverless and saved multi-million of compute cost per year.
Drive delta format adoption at Foursquare and adoption of optimization techniques like bloomfilter and liquid clustering.
Adoption of streamlit within the data org.
Build foursquare's data governance stack that handles PB of data governance in s3, with proper data skipping for efficiency gain.
Rebuild a data product for offline conversion in 3 month that contributes to 10MM of revenue for Foursquare. Lead cross-function collaboration from mvp to launch.
Maintain AWS and Databricks vendor relations, drive efficiency through long-term saving plans.
Manage a team of 10+ data engineers, dozens of spark jobs, hundreds of dags, and petabyte scale data pipelines for adtech applications.
Foster strong engineering culture on delivery, productivity (CI/CD), best practices (readability, test-driven) and support (pair-programming, design review)
Design cross-cloud architecture and manage complex pipeline migration from Hadoop stack.
Lead infrastructure design and tooling, including EMR, BigQuery, Airflow/Composer.
M.S. - Engineering, University of Virginia, Aug 2018
B.S. - Engineering, University of Virginia, May 2016