I am a PhD student in ECE at the University of Illinois Urbana-Champaign, advised by Prof. Jian Huang in Illinois PlatformX. My research explores hardware/software co-design techniques for building efficient ecosystems for Neural Processing Units (NPUs), such as TPUs. Although the NPU hardware architecture provides massive computational power for machine learning (ML) workloads, their real-world performance and efficiency depend critically on the surrounding ecosystem. My work aims to define the hardware/software interfaces and abstractions that best balance complexity, expressiveness, performance, and efficiency by determining which optimizations belong in hardware, which belong in software, and which require coordinated support across both. I build the hardware mechanisms and software systems to make NPUs faster, more cost-effective, and more energy-efficient.
Beyond NPUs, I am also interested in other ML accelerators such as GPUs and inter-core connected AI chips (i.e., Graphcore IPUs, Tenstorrent). I am also exploring how to leverage AI/LLM agents to develop the NPU ecosystem.
* Co-primary authors.