Jae-Won Chung

Ph.D. Candidate @ UMich CSE

Summary

I'm a fifth year PhD candidate in CSE at the University of Michigan. I build efficient software systems for deep learning, with a recent focus on the efficient management of not only time, but also energy.

I view power and energy as fundamental systems resources that are worth carefully optimizing and allocating, not only in hardware, but also from software. Doing so provides automatic downstream benefits, such as reducing operational expenses, alleviating power delivery pressure for datacenters, and allowing the hardware to truly max out on performance.

I lead the ML.ENERGY initiative as part of my research and open-source efforts. I am fortunate to be advised by Professor Mosharaf Chowdhury and be part of SymbioticLab.

Selected Publications

The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization

NeurIPS D&B spotlight, 2025 (spotlight acceptance rate = 2.81%)

Jae-Won Chung, Jeff J. Ma, Ruofan Wu, Jiachen Liu, Oh Jun Kweon, Yuxuan Xia, Zhiyu Wu, Mosharaf Chowdhury

Perseus: Reducing Energy Bloat in Large Model Training

SOSP, 2024 (Acceptance rate = 17.34%)

Jae-Won Chung, Yile Gu, Insu Jang, Luoxi Meng, Nikhil Bansal, Mosharaf Chowdhury

Toward Cross-Layer Energy Optimizations in AI Systems

DOE ASCR Energy-Efficient Computing for Science Workshop, 2024

Jae-Won Chung, Nishil Talati, and Mosharaf Chowdhury

Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services

Preprint, 2024

Jiachen Liu, Jae-Won Chung, Zhiyu Wu, Fan Lai, Myungjin Lee, Mosharaf Chowdhury

Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training

USENIX NSDI, 2023 (Acceptance rate = 18.38%)

Jie You*, Jae-Won Chung*, Mosharaf Chowdhury (* Equal Contribution)

Experience

Graduate Student Research Assistant

Sep 2021 - May 2027 (expected)

Advisor: Prof. Mosharaf Chowdhury

Building software systems for machine learning that treat power and energy as first-class systems resources. I created Zeus, the first energy optimization system for DNN training on GPUs. Zeus is a PyTorch ecosystem project and serves as the bedrock for the ML.ENERGY Benchmark & Leaderboard, the first energy benchmark for LLM inference, and Perseus, a large model training energy optimizer that reduces energy consumption by up to 30% without training slowdown.

Keywords:

  • MLSys
  • Energy
  • LLM
  • Training
  • Inference
  • Open-Source

Research Scientist Intern

May 2025 - Aug 2025

MoE (Mixture-of-Experts) training support on MTIA platforms. Fixed issues and closed gaps across the entire stack, including MTIA kernels, PyTorch MTIA backend, collective communication, and Meta's internal large model training framework.

Keywords:

  • MLSys
  • LLM
  • Training

Research Intern

Mar 2020 - Jun 2021

Advisor: Prof. Byung-Gon Chun

Developed Crane, a GPU cluster manager for elastic AutoML jobs. Wrote components for automatic cluster bootstrapping on Docker Swarm and enabled full operation on top of Kubernetes. Worked on efficient AutoML scheduling policies on GPU clusters.

Keywords:

  • MLSys
  • AutoML
  • Training
  • Cluster Management
  • Scheduling
  • Open-Source
Dec 2019 - Jun 2020

Advisor: Prof. Soo-Mook Moon

Created ShadowTutor, a server-client collaborative DNN inference system that distills knowledge from a server-side large DNN to a small DNN on the client in an online fashion.

Keywords:

  • MLSys
  • Inference
  • Knowledge Distillation

Research Intern

Jun 2019 - Dec 2019

Advisor: Prof. Kyoung Mu Lee

Worked on finding better meta-initialization points for Model-Agnostic Meta-Learning (MAML) using LSTM-based neural memory modules. Also worked on embedding images of the same class into a single class embedding vector and augmenting MAML with self-attention scores derived from class embeddings.

Keywords:

  • ML
  • Computer Vision
  • Meta-Learning
  • Few-Shot Classification
  • Optimization
Jun 2019 - Aug 2019

Advisor: Prof. Jongho Lee

Designed and implemented CAD-QSMNet, a full deep learning pipeline for Quantitative Susceptibility Mapping (QSM) for brain MRI images, including a new U-Net variant model.

Keywords:

  • ML
  • Computer Vision
  • Medical Imaging
  • Data Engineering

Open-Source Projects

Number of stars and forks are as of October 4th, 2025.
BERT4Rec-VAE-Pytorch (395 94)

Implementation of BERT4Rec and Netflix VAE recommendation models.

  • Python
  • PyTorch
  • RecSys
  • |
  • GitHub
Reason (194 4)

A shell for research papers. Supports UNIX-like commands that instead work on a set of research papers.

Pegasus (31 3)

An SSH command runner with a focus on simplicity. Useful when you have a bunch of commands to run and a bunch of SSH nodes available.

Selected Talks

Energy and Power as First-Class ML Design Metrics

NeurIPS tutorial | December 2025

  • Energy
  • MLSys
Energy and Power as First-Class ML Design Metrics

UW-Madison madSystems seminar | October 2025

  • Energy
  • MLSys
How to Create Mediocre Open-Source Repositories

University of Michigan (SymbioticLab lunch seminar) | August 2024

Energy-Efficient Software Systems for Machine Learning

Seoul National University | October 2023

  • Energy
  • MLSys
Energy-Efficient Deep Learning with PyTorch and Zeus

PyTorch Conference | October 2023

Energy-Efficient Deep Learning with Zeus

Massachusetts Institute of Technology | September 2023

  • Energy
  • MLSys
Memory Plus Meta-Learning

Deepest | August 2019

Education

  • PhD, Computer Science and Engineering
    (In progress)
    University of Michigan
    Sep 2021 - May 2027 (expected)
  • MS, Computer Science and Engineering
    University of Michigan
    Sep 2021 - Apr 2023
  • BS, Electrical and Computer Engineering
    Summa cum laude
    Seoul National University, South Korea
    Mar 2015 - Aug 2021

Technical Proficiency

Languages

  • Python
  • Rust
  • Go, C++, CUDA, Verilog
  • Zig, JavaScript

Tools and Frameworks

  • FastAPI, Mkdocs, Pandas, NumPy
  • PyTorch, Kubernetes, LaTeX

Others

  • Commandline
  • Neovim
  • GitHub
  • Open-Source
  • Documentation

Honors & Awards

  • Second Best Solution in Carbon Hack '22
    $25,000 prize with Chase.
  • Kwanjeong Overseas Scholarship
    $25,000 awarded.
  • Best Tutor Award
    SNU computer architecture, Fall 2020.
  • Kwanjeong Undergraduate Scholarship
    $20,000 awarded over two years.

Teaching

  • Graduate Student Instructor. Three lectures on GenAI and Systems for GenAI fundamentals.
    Fall 2025
  • Undergrad Operating Systems
    Lead TA. Linux kernel lectures, four Linux-based term projects, and team design reviews.
    Spring 2021
  • Undergrad Computer Architecture
    Peer tutor. Gave 30 hours of online lecture. Best tutor award!
    Fall 2020

Community Service

English Proficiency

Interests

  • Software Systems
  • Deep Learning
  • Fingerstyle Guitar