Donghyun Son

I’m an undergraduate student majoring Computer Science and Engineering at Seoul National University, and currently an exchange student at UT Austin. Here, I’m fortunate to work with Saurabh Agarwal for an efficient agentic system, advised by Aditya Akella. In SNU, I worked on quantization algorithms for KV cache, supervised by Sungjoo Yoo. Prior to that, I worked as a machine learning engineer at Hyperconnect (acquired by Match Group), where I built an ML-based content moderation system for Match Group brands.
I’m broadly interested in efficient algorithms for model training and inference. My previous works focus on data efficient methods including multiple subtasks approach, domain generalization, and few-shot personalization.
Recently, I’m interested in building an efficient LLM inference system. Particularly, I’m interested in the following topics.
- KV cache compression: quantization, eviction
- agentic systems: optimizing multi-agent systems
- compressing the reasoning path
Aside from research, I enjoy algorithmic problem solving and have competed in competitive programming contests such as ICPC, Google Hashcode, and SCPC. You can find me on codeforces and BOJ.
I’m currently looking for job or Ph.D. opportunities that are aligned with my interests. Feel free to reach me!
Links: Github / CV / Google Scholar / X
selected publications
- PreprintNSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache2025
- WSDM23 (Oral)Reliable decision from multiple subtasks through threshold optimization: Content moderation in the wildIn Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023
- OOD-CV@ICCV23Gradient estimation for unseen domain risk minimization with pre-trained modelsIn Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023