Research Assistant — SIUE
Feb 2026 – PresentEdwardsville, IL, USA
- Researched and applied uniform and quadratic quantization techniques to compress large language models and diffusion models, reducing model size while preserving generation quality.
- Implemented model retraining and fine-tuning pipelines using PyTorch and Bash to optimize quantized models, ensuring high performance and accuracy across various downstream benchmarks.