> Curriculum Vitae

v1.2.5 - Q4 2025 | Download PDF


Education

Doctor of Philosophy in Computer Science
Aug 2017 | University of Colorado, Boulder, CO
  • Dissertation Title: Hardware Awareness for the Selection of Optimal Iterative Linear Solvers
Master of Science in Computer Science
May 2013 | University of Colorado, Boulder, CO
Bachelor of Science in Computer Science, minor in Mathematics
May 2011 | University of Arkansas, Fayetteville, AR

Work & Research Experience

AI Performance Engineer, TPU Inference @ Google
May 2023 – Present | Seattle, WA
  • Architected and implemented Paged Attention for MaxText from design to merge, resolving complex concurrency bugs and achieving a 47% throughput increase in microbenchmarks.
  • Developed custom Pallas kernels for 2D block-wise and sub-channel quantization, enabling advanced precision support within the TPU-Inference framework.
  • Built model accuracy benchmarking framework for tracking long-term improvements and regressions of TPU-Inference models.
Machine Learning Engineer, Web-based QA @ Amazon (Alexa AI)
Oct 2021 – May 2023 | Seattle, WA
  • Reduced overall latency of our model by 6x compared to the model’s baseline on GPUs.
  • Responsible for performance analysis and optimization of a transformer-based research DL model.
  • Created codebase compiling our model to ONNX, TensorRT, and Inferentia formats and ran associated benchmarking.
Research Engineer @ Amazon Web Services (AWS HPC)
Mar 2020 – May 2021 | Seattle, WA
  • Created automated performance regression testing system for AWS HPC infrastructure.
  • Benchmarked prototype EC2 instances to identify optimal hardware configurations for future HPC endpoints.
Research Engineer, ASR @ Amazon Alexa
Mar 2018 – Mar 2020 | Seattle, WA
  • Created a new team to maintain an in-house deep learning framework and address organizational needs.
  • Optimized distributed programming workflows using C++, Python, CUDA, TensorFlow, and MxNet.
Software Development Engineer, Mobile Hub @ Amazon Web Services
Aug 2017 – Mar 2018 | Seattle, WA
  • Built Java-based backend services and data collection pipelines for customer usage analytics.
Doctoral Research Assistant @ University of Colorado, Lighthouse Project
Aug 2014 – Aug 2017 | Boulder, CO
  • Utilized runtime performance data from supercomputers to train ML algorithms that predict optimal iterative linear solvers (C++, Trilinos, Python).
Computation Intern @ Lawrence Livermore National Laboratory
Summers 2014, 2015 | Livermore, CA
  • Optimized BLAST hydrodynamics code for future architectures and developed benchmarking suites for high-performance linear algebra libraries using HPCToolkit.
Software Developer/Researcher @ TerraSpark Geosciences
Aug 2011 – Jan 2014 | Boulder, CO
  • Implemented GPU-based seismic interpretation solutions using OpenCL, reducing processing time from hours to seconds compared to original CPU code.

Skills

Patents & Selected Publications