“Standing on the shoulders of giants.” — Isaac Newton
Research Interest
Building efficient, high-performing Vision-Language Models (VLMs), with focus on:
- Distillation / RL
- Agent
- Pruning
- Compression
- Data / Training / Inference Efficiency
Collaboration Requests & Job Applications
If your research interests align with mine and you already have a draft idea to discuss and develop together, feel free to reach out for collaboration. For university collaboration, alignment is the primary criterion. We are also actively looking for talented internship and full-time candidates. Preferred qualifications are strong first or co-first author publications at top-tier main-track conferences (not workshops) and deep expertise in one area. In my view, a strong profile means five-to-ten first or co-first author papers at top-tier venues (CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, ACL, EMNLP). I wouldn't consider other conferences and journals since I am not familiar with the others. If you meet these criteria, click "Open Gmail" to open a pre-filled draft, attach your resume, and send. Please keep it as a brief cold email.
NVIDIA Research Intern Application
NVIDIA Research Full-time Application
Work Experience
-
LEAD
AXPO: Agent eXplorative Policy Optimization
[Completed]
—
Agentic RL training that recovers tool usage through tool-call resampling, improving multimodal reasoning performance against larger baselines.
Under Review U.S. Patent Application 2026 ArXiv Release Internal Release
-
LEAD
Masking Teacher and Reinforcing Student
[Completed]
—
Mask-progressive RL distillation that gradually unmasks teacher weights and uses offline RL with accuracy & distillation rewards.
CVPR 2026 Accept U.S. Non-Provisional Patent Filed 2026 ArXiv Release Internal Release
-
LEAD
GenRecal
[Completed]
—
Cross-architecture VLM distillation via a Recalibrator that aligns heterogeneous token representations regardless of vocabulary, token splits, or index ordering.
Under Review U.S. Patent Upgraded to Non-Provisional 2025 Internal Tech Transfer
-
LEAD
Unified RL & Imitation Learning for VLMs
[Completed]
—
Combines RL with adversarial imitation, using an LLM-based discriminator and multi-teacher guidance to build lightweight yet powerful VLMs.
NeurIPS 2025 Accept U.S. Patent Upgraded to Non-Provisional 2025 ArXiv Release Internal Release
-
LEAD
VLsI: Verbalized Layers-to-Interactions
[Completed]
—
Layer-wise distillation using intermediate verbalizers, enabling small VLMs (2B/7B) to align with large VLMs' reasoning progression and outperform GPT-4V.
CVPR 2025 Accept U.S. Non-Provisional Patent Filed 2025 ArXiv Release Internal Release Internal Tech Transfer
-
LEAD
GenRecal
[Initiated]
—
Initial design of the Recalibrator framework for cross-tokenizer VLM distillation; first proof-of-concept and internal demo.
U.S. Provisional Patent Filed 2025 ArXiv Release Internal Release
-
LEAD
Unified RL & Imitation Learning for VLMs
[Initiated]
—
First formulation of the RL + adversarial imitation training pipeline; initial experiments and team setup.
U.S. Provisional Patent Filed 2025
Education
Publications
| Overall | 16 Accepts · 4 Pending · 2 Tech Reports |
| Computer Vision | 5 CVPR · 2 ICCV · 1 ECCV · 1 ICIP |
| Machine Learning | 3 NeurIPS · 1 ICLR |
| NLP | 1 ACL · 1 EMNLP |
| Journal | 1 Pattern Recognition |
- First & Lead Author 18 / 22 = 82%
- Co-Author 4 / 22 = 18%
-
"Agent Explorative Policy Optimization for Multimodal Agentic Reasoning"Under Review [Paper (Coming Soon)] [Project]U.S. Patent Application, NVIDIA Research, 2026 -

-

-

-
"Masking Teacher and Reinforcing Student for Distilling Vision-Language Models"Computer Vision and Pattern Recognition (CVPR), 2026 [Paper]U.S. Patent Application Filed (Non-Provisional), NVIDIA Research, 2026 -

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Reviewer Experience
Journal
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
- IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
Conference
- Computer Vision and Pattern Recognition (CVPR)
- International Conference on Computer Vision (ICCV)
- European Conference on Computer Vision (ECCV)
- International Conference on Learning Representations (ICLR)
- Neural Information Processing Systems (NeurIPS)
- International Conference on Machine Learning (ICML)
- AAAI Conference on Artificial Intelligence (AAAI)
- Association for Computational Linguistic (ACL)
- Empirical Methods in Natural Language Processing (EMNLP)
Invited Talks & Awards
- Silver Reviewer Award, International Conference on Machine Learning (ICML)2026
- Best Runner-Up Award (Oral, Top 1%), Multi-Turn Interactions in LLMs Workshop @ NeurIPS2025
- Invited Talk at Kongju University2025
- Invited Talk at Hanyang University ERICA2025
- Invited Talk at Kookmin University2025
- Invited Talk at Dongseo University2025
- 31st Samsung HumanTech Paper Awards in Computer Science & Engineering2025
- KCC XAI Workshop, Best Paper Awards2024
- Invited Talk at Korea Institute of Science and Technology Information (KISTI)2024
- Invited Talk at NAVER HyperCLOVA X2024
- Invited Talk at Kakao Brain2024
- 1st Award for Research Presentation, Center for Applied Research in AI (CARAI)2023
- GIRE Research Mentor, Seoul International School — "MUsE", Journal of Student Research [Publication]2023
- KAIST SW IT Academy, AI Project Director of 1st Award Topic2023
- Invitation to Presenter in Korean Conference on Computer Vision (KCCV)2023
- KAIST SW IT Academy, Lecturer of Data Structure & Algorithm, Deep Learning, Computer Vision2023
- Invitation to Presenter in Korean Conference on Computer Vision (KCCV)2022
- 2nd Award for Research Presentation, Center for Applied Research in AI (CARAI)2022
- KAIST FellowshipMar. 2020 — Feb. 2024
- National Government FellowshipMar. 2018 — Feb. 2020
- 1st President's Award for International Autonomous Vehicle Competition, Team Leader [YouTube][Talk]2018