I am a Ph.D. candidate in Artificial Intelligence at KAIST, advised by Prof. Se-Young Yun. My research focuses on large language model agents, with a particular interest in social intelligence, multi-agent interaction, and post-training methods for making LLM agents more robust, reliable, and socially capable.
More broadly, I am interested in understanding how LLM agents acquire social behaviors through interaction, and how training methods such as reinforcement learning and verifiable rewards can improve their reasoning, alignment, and role-adaptive behavior.
MERIT Feedback Elicits Better Bargaining in LLM Negotiators
Jihwan Oh, Murad Aghazada, Yooju Shin, Se-Young Yun†, Taehyeon Kim†
ACL 2026
From Belief Entrenchment to Robust Reasoning in LLM Agents
Jihwan Oh*, Minchan Jeong*, Jongwoo Ko†, Se-Young Yun†
TACL 2026 (To be presented at ACL 2026)
Dual Preference Learning for Multi-Agent Reinforcement Learning
Sehyeok Kang, Minu Kim, Jihwan Oh, Se-Young Yun†
IEEE Access 2025
Preference Alignment with Flow Matching
Minu Kim*, Yongsik Lee*, Sehyeok Kang, Jihwan Oh, Song Chong, Se-Young Yun†
Neurips 2024
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Jihwan Oh*, Joonkee Kim*, Minchan Jeong, Se-Young Yun†
AAMAS 2023
The StarCraft Multi-Agent Exploration Challenges: Learning Multi-Stage Tasks and Environmental Factors without Precise Reward Functions
Mingyu Kim*, Jihwan Oh*, Yongsik Lee, Joonkee Kim, Seonghwan Kim, Song Chong, Se-Young Yun†
IEEE Access 2023
MERIT Feedback Elicits Better Bargaining in LLM Negotiators
Jihwan Oh, Murad Aghazada, Yooju Shin, Se-Young Yun†, Taehyeon Kim†
ACL 2026
PerMix-RLVR: Preserving Persona Expressivity under Verifiable-Reward Alignment
Jihwan Oh*, Soowon Oh*, Murad Aghazada, Minchan Jeong, MyeongSeok Kang, Sungnyun Kim†, Se-Young Yun†
Preprint / AAAI 2026 DAI Workshop
Bridging the Gap between Theory of Mind and Action in LLMs
Sehyeok Kang, Jihwan Oh, Se-Young Yun†
ICLR 2026 AIWILD Workshop
From Belief Entrenchment to Robust Reasoning in LLM Agents
Jihwan Oh*, Minchan Jeong*, Jongwoo Ko†, Se-Young Yun†
TACL 2026 (To be presented at ACL 2026)
Dual Preference Learning for Multi-Agent Reinforcement Learning
Sehyeok Kang, Minu Kim, Jihwan Oh, Se-Young Yun†
IEEE Access 2025
Preference Alignment with Flow Matching
Minu Kim*, Yongsik Lee*, Sehyeok Kang, Jihwan Oh, Song Chong, Se-Young Yun†
Neurips 2024
Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning
Jihwan Oh, Sungnyun Kim, Gahee Kim, Sunghwan Kim, Se-Young Yun†
ICML 2024 SPIGM Workshop
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Jihwan Oh*, Joonkee Kim*, Minchan Jeong, Se-Young Yun†
AAMAS 2023
The StarCraft Multi-Agent Exploration Challenges: Learning Multi-Stage Tasks and Environmental Factors without Precise Reward Functions
Mingyu Kim*, Jihwan Oh*, Yongsik Lee, Joonkee Kim, Seonghwan Kim, Song Chong, Se-Young Yun†
IEEE Access 2023
Korea Advanced Institute of Science and Technology (KAIST) Mar. 2024 - Present
Ph.D. Candidate in Artificial Intelligence
Korea Advanced Institute of Science and Technology (KAIST) Mar. 2021 - Feb. 2023
M.S. in Artificial Intelligence
Neurips Benchmark & Dataset Track 2022, ICML 2025, EACL 2026, ACL 2026, COLM 2026 Reviewer,