BK Lee

“Standing on the shoulders of giants.” — Isaac Newton

Research Interest

Building efficient, high-performing Vision-Language Models (VLMs), with focus on:

Distillation
RL
Agent
Pruning

Collaboration Requests & Job Applications

If your research interests align with mine and you already have a draft idea to discuss and develop together, feel free to reach out for collaboration. For university collaboration, alignment is the primary criterion. We are also actively looking for talented internship and full-time candidates. Preferred qualifications are strong first author publications at top-tier main-track conferences (not workshops) and deep expertise in one area; co-first author papers are discouraged for now. In my view, a strong profile means five-to-ten first author papers at top-tier venues (CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, ACL, EMNLP) in focused one area from deep expertise. I wouldn't consider other conferences and journals since I am not familiar with the others. If you meet these criteria, click "Open Gmail" to open a pre-filled draft, attach your resume, and send. Please keep it as a brief cold email.

University Collaboration with NVIDIA

NVIDIA Research Intern Application

NVIDIA Research Full-time Application

Work Experience

NVIDIA Research Scientist Oct. 2025 — Current

LEAD ZPPO: Zone of Proximal Policy Optimization [Completed] — RL post-training that keeps the teacher inside the prompt rather than the policy gradient, using BCQ/NCQ question reformulations and a prompt replay buffer to lift small students on hard questions without policy drift.
Under Review U.S. Provisional Patent Filed 2026
LEAD AXPO: Agent eXplorative Policy Optimization [Completed] — Agentic RL training that recovers tool usage through tool-call resampling, improving multimodal reasoning performance against larger baselines.
Under Review U.S. Provisional Patent Filed 2026 Internal Tech Transfer
LEAD Masking Teacher and Reinforcing Student [Completed] — Mask-progressive RL distillation that gradually unmasks teacher weights and uses offline RL with accuracy & distillation rewards.
CVPR 2026 Accept U.S. Non-Provisional Patent Filed 2026
LEAD Unified RL & Imitation Learning for VLMs [Completed] — Combines RL with adversarial imitation, using an LLM-based discriminator and multi-teacher guidance to build lightweight yet powerful VLMs.
NeurIPS 2025 Accept U.S. Patent Upgraded to Non-Provisional 2025
LEAD GenRecal [Completed] — Cross-architecture VLM distillation via a Recalibrator that aligns heterogeneous token representations regardless of vocabulary, token splits, or index ordering.
ECCV 2026 Accept U.S. Patent Upgraded to Non-Provisional 2025 Internal Tech Transfer
LEAD VLsI: Verbalized Layers-to-Interactions [Completed] — Hardened the verbalizer-based layer-wise distillation into a production-ready recipe and led its internal tech transfer into NVIDIA's VLM development pipeline.
CVPR 2025 Accept U.S. Patent Upgraded to Non-Provisional 2025 Internal Tech Transfer

NVIDIA Research Intern Oct. 2024 — Oct. 2025

LEAD Unified RL & Imitation Learning for VLMs [Initiated] — First formulation of the RL + adversarial imitation training pipeline; initial experiments and team setup.
U.S. Provisional Patent Filed 2025
LEAD GenRecal [Initiated] — Initial design of the Recalibrator framework for cross-tokenizer VLM distillation; first proof-of-concept and internal demo.
U.S. Provisional Patent Filed 2025
LEAD VLsI: Verbalized Layers-to-Interactions [Initiated] — Layer-wise distillation using intermediate verbalizers, enabling small VLMs (2B/7B) to align with large VLMs' reasoning progression and outperform GPT-4V.
CVPR 2025 Accept U.S. Provisional Patent Filed 2025

Education

KAIST Mar. 2020 — Aug. 2025

Ph.D., School of Electrical Engineering GPA 3.77 / 4.3

Dissertation: Building High-performing, Efficient-size Vision Language Models: Merge, Modify, and Distill [Link] [Degree Certificate]

KAIST Mar. 2018 — Feb. 2020

M.S., The Cho Chun Sik Graduate School of Green Transportation GPA 3.72 / 4.3

Thesis: Training Encoder-Attention through Fully-Connected CRFs for Efficient End-to-End Lane Detection Model [Link] [Degree Certificate]

Hanyang University Mar. 2014 — Feb. 2018

B.S., Mathematics and Electronic Engineering GPA 3.86 / 4.5

Thesis: Learning Rank of Evaluated Songs for Classifying Unknown Music by Convolutional Neural Network [Link] [Degree Certificate]

Publications

Overall	18 Accepts · 4 Pending · 2 Tech Reports
CV	5 CVPR · 2 ICCV · 3 ECCV · 1 ICIP
ML	3 NeurIPS · 1 ICLR
NLP	1 ACL · 1 EMNLP
Journal	1 Pattern Recognition

Publication Profile

79% 21%

First & Lead Author 19 / 24 = 79%
Co-Author 5 / 24 = 21%

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Byung-Kwan Lee, Ximing Lu, Shizhe Diao, Minki Kang, Saurav Muralidharan, Karan Sapra, Andrew Tao, Pavlo Molchanov, Yejin Choi, Yu-Chiang Frank Wang, Ryo Hachiuma

Under Review [Paper][Project]

U.S. Patent Application Filed (Provisional), NVIDIA Research, 2026
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Seokju Cho, Ryo Hachiuma, Abhishek Badki, Hang Su, Byung-Kwan Lee, Chan Hee Song, Sifei Liu, Subhashree Radhakrishnan, Seungryong Kim, Yu-Chiang Frank Wang, Min-Hung Chen

Under Review [Paper][Project][Code]

U.S. Patent Application Filed (Provisional), NVIDIA Research, 2026
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Minki Kang, Shizhe Diao, Ryo Hachiuma, Sung Ju Hwang, Pavlo Molchanov, Yu-Chiang Frank Wang, Byung-Kwan Lee

Under Review [Paper][Project]

U.S. Patent Application Filed (Provisional), NVIDIA Research, 2026
Hide to See: Reasoning-prefix Masking for Visual-anchored Thinking in VLM Distillation

Seonghoon Yu, Dongjun Nam, Byung-Kwan Lee†, Jeany Son†

Under Review [Paper][Code]
Why and When Visual Token Pruning Fails? A Study on Relevant Visual Information Shift in MLLMs Decoding

Jiwan Kim, Kibum Kim, Wonjoong Kim, Byung-Kwan Lee, Chanyoung Park

European Conference on Computer Vision (ECCV), 2026 [Paper][Project]
GenRecal: Generation after Recalibration from Large to Small Vision Language Models

Byung-Kwan Lee, Ryo Hachiuma, Yong Man Ro, Yu-Chiang Frank Wang, Yueh-Hua Wu

European Conference on Computer Vision (ECCV), 2026 [Paper][Project]

U.S. Patent Application Filed (Non-Provisional), NVIDIA Research, 2025
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Byung-Kwan Lee, Yu-Chiang Frank Wang, Ryo Hachiuma

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026 [Paper]

U.S. Patent Application Filed (Non-Provisional), NVIDIA Research, 2026
Recursive Think-Answer Process for LLMs and VLMs

Byung-Kwan Lee*, Youngchae Chee*, Yong Man Ro

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings, 2026 [Paper][Project]
RefineBench: Evaluating Refinement Capability in Language Models

Young-Jun Lee*, Seungone Kim*, Byung-Kwan Lee, Minkyeong Moon, Yechan Hwang, Jong Myoung Kim, Graham Neubig, Sean Welleck, Ho-Jin Choi

International Conference on Learning Representations (ICLR), 2026 [Paper][Project]

Best Runner-Up Award (Oral, Top 1%), Multi-Turn Interactions in LLMs Workshop @ NeurIPS 2025 [Link]
Unified Reinforcement and Imitation Learning for Vision-Language Models

Byung-Kwan Lee, Ryo Hachiuma, Yong Man Ro, Yu-Chiang Frank Wang, Yueh-Hua Wu

Neural Information Processing Systems (NeurIPS), 2025 [Paper][Project]

U.S. Patent Application Filed (Non-Provisional), NVIDIA Research, 2025
MultiVerse: A Multi-Turn Conversation Benchmark for Evaluating Large Vision and Language Models

Young-Jun Lee, Byung-Kwan Lee, Jianshu Zhang, Yechan Hwang, Byungsoo Ko, Han-Gyu Kim, Dongyu Yao, Xuankun Rong, Eojin Joo, Seung-Ho Han, Bowon Ko, Ho-Jin Choi

IEEE/CVF International Conference on Computer Vision (ICCV), 2025 [Paper][Project]

Workshop for Knowledge-Intensive Multimodal Reasoning, ICCV 2025 [Link]
VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Byung-Kwan Lee, Ryo Hachiuma, Yu-Chiang Frank Wang, Yong Man Ro†, Yueh-Hua Wu†

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [Paper][Project]

U.S. Patent Application Filed (Non-Provisional), NVIDIA Research, 2025
Phantom of Latent for Large Language and Vision Models

Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro

Technical Report [Paper][Code][HF Model]
TroL: Traversal of Layers for Large Language and Vision Models

Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro

Empirical Methods in Natural Language Processing (EMNLP), 2024 [Paper][Code][HF Model]
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Byung-Kwan Lee, Chae Won Kim, Beomchan Park, Yong Man Ro

Neural Information Processing Systems (NeurIPS), 2024 [Paper][Code][HF Model]

31st Samsung HumanTech Paper Awards in Computer Science & Engineering
MoAI: Mixture of All Intelligence for Large Language and Vision Models

Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro

European Conference on Computer Vision (ECCV), 2024 [Paper][Code][HF Model]
CoLLaVO: Crayon Large Language and Vision mOdel

Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro

Findings of the Association for Computational Linguistics (ACL), 2024 [Paper][Code][HF Model]

2024 KCC XAI Workshop, Best Paper Awards
Causal Unsupervised Semantic Segmentation

Junho Kim*, Byung-Kwan Lee*, Yong Man Ro

Journal of Pattern Recognition [Paper][Code]
Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning

Byung-Kwan Lee*, Junho Kim*, Yong Man Ro

IEEE/CVF International Conference on Computer Vision (ICCV), 2023 [Paper][Code]
Mitigating Dataset Bias in Image Captioning through CLIP Confounder-free Captioning Network

YeonJu Kim, Junho Kim, Byung-Kwan Lee, Sebin Shin, Yong Man Ro

IEEE International Conference on Image Processing (ICIP), 2023 [Paper][Code]
Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression

Junho Kim*, Byung-Kwan Lee*, Yong Man Ro

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 [Paper][Code]
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network

Byung-Kwan Lee*, Junho Kim*, Yong Man Ro

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 [Paper][Code]
Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck

Junho Kim*, Byung-Kwan Lee*, Yong Man Ro

Neural Information Processing Systems (NeurIPS), 2021 [Paper][Code]
Towards Adversarial Robustness of Bayesian Neural Network through Hierarchical Variational Inference

Byung-Kwan Lee, Youngjoon Yu, Yong Man Ro

Technical Report [Paper][Code]

Reviewer Experience

Journal

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

Conference

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
IEEE/CVF International Conference on Computer Vision (ICCV)
European Conference on Computer Vision (ECCV)
International Conference on Learning Representations (ICLR)
Neural Information Processing Systems (NeurIPS)
International Conference on Machine Learning (ICML)
AAAI Conference on Artificial Intelligence (AAAI)
Association for Computational Linguistic (ACL)
Empirical Methods in Natural Language Processing (EMNLP)

Invited Talks & Awards

Silver Reviewer Award, International Conference on Machine Learning (ICML)2026
Best Runner-Up Award (Oral, Top 1%), Multi-Turn Interactions in LLMs Workshop @ NeurIPS2025
Invited Talk at Kongju University2025
Invited Talk at Hanyang University ERICA2025
Invited Talk at Kookmin University2025
Invited Talk at Dongseo University2025
31st Samsung HumanTech Paper Awards in Computer Science & Engineering2025
KCC XAI Workshop, Best Paper Awards2024
Invited Talk at Korea Institute of Science and Technology Information (KISTI)2024
Invited Talk at NAVER HyperCLOVA X2024
Invited Talk at Kakao Brain2024
1st Award for Research Presentation, Center for Applied Research in AI (CARAI)2023
GIRE Research Mentor, Seoul International School — "MUsE", Journal of Student Research [Publication]2023
KAIST SW IT Academy, AI Project Director of 1st Award Topic2023
Invitation to Presenter in Korean Conference on Computer Vision (KCCV)2023
KAIST SW IT Academy, Lecturer of Data Structure & Algorithm, Deep Learning, Computer Vision2023
Invitation to Presenter in Korean Conference on Computer Vision (KCCV)2022
2nd Award for Research Presentation, Center for Applied Research in AI (CARAI)2022
KAIST FellowshipMar. 2020 — Feb. 2024
National Government FellowshipMar. 2018 — Feb. 2020
1st President's Award for International Autonomous Vehicle Competition, Team Leader [YouTube][Talk]2018

NVIDIA Stock

Current Price & Today Change TradingView