Jihwan Oh

Researcher, Ph.D. Student in KAIST AI

ericoh92920 [AT] gmail.com

About

I am a Ph.D. candidate in Artificial Intelligence at KAIST, advised by Prof. Se-Young Yun. My research focuses on large language model agents, with a particular interest in social intelligence, multi-agent interaction, and post-training methods for making LLM agents more robust, reliable, and socially capable.

More broadly, I am interested in understanding how LLM agents acquire social behaviors through interaction, and how training methods such as reinforcement learning and verifiable rewards can improve their reasoning, alignment, and role-adaptive behavior.

Publications * Equal contribution, † Corresponding author

MERIT Feedback Elicits Better Bargaining in LLM Negotiators

Jihwan Oh, Murad Aghazada, Yooju Shin, Se-Young Yun, Taehyeon Kim

ACL 2026

From Belief Entrenchment to Robust Reasoning in LLM Agents

Jihwan Oh*, Minchan Jeong*, Jongwoo Ko, Se-Young Yun

TACL 2026 (To be presented at ACL 2026)

Dual Preference Learning for Multi-Agent Reinforcement Learning

Sehyeok Kang, Minu Kim, Jihwan Oh, Se-Young Yun

IEEE Access 2025

Preference Alignment with Flow Matching

Minu Kim*, Yongsik Lee*, Sehyeok Kang, Jihwan Oh, Song Chong, Se-Young Yun

Neurips 2024

Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning

Jihwan Oh*, Joonkee Kim*, Minchan Jeong, Se-Young Yun

AAMAS 2023

The StarCraft Multi-Agent Exploration Challenges: Learning Multi-Stage Tasks and Environmental Factors without Precise Reward Functions

Mingyu Kim*, Jihwan Oh*, Yongsik Lee, Joonkee Kim, Seonghwan Kim, Song Chong, Se-Young Yun

IEEE Access 2023

MERIT Feedback Elicits Better Bargaining in LLM Negotiators

Jihwan Oh, Murad Aghazada, Yooju Shin, Se-Young Yun, Taehyeon Kim

ACL 2026

PerMix-RLVR: Preserving Persona Expressivity under Verifiable-Reward Alignment

Jihwan Oh*, Soowon Oh*, Murad Aghazada, Minchan Jeong, MyeongSeok Kang, Sungnyun Kim, Se-Young Yun

Preprint / AAAI 2026 DAI Workshop

Bridging the Gap between Theory of Mind and Action in LLMs

Sehyeok Kang, Jihwan Oh, Se-Young Yun

ICLR 2026 AIWILD Workshop

From Belief Entrenchment to Robust Reasoning in LLM Agents

Jihwan Oh*, Minchan Jeong*, Jongwoo Ko, Se-Young Yun

TACL 2026 (To be presented at ACL 2026)

Dual Preference Learning for Multi-Agent Reinforcement Learning

Sehyeok Kang, Minu Kim, Jihwan Oh, Se-Young Yun

IEEE Access 2025

Preference Alignment with Flow Matching

Minu Kim*, Yongsik Lee*, Sehyeok Kang, Jihwan Oh, Song Chong, Se-Young Yun

Neurips 2024

Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning

Jihwan Oh, Sungnyun Kim, Gahee Kim, Sunghwan Kim, Se-Young Yun

ICML 2024 SPIGM Workshop

Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning

Jihwan Oh*, Joonkee Kim*, Minchan Jeong, Se-Young Yun

AAMAS 2023

The StarCraft Multi-Agent Exploration Challenges: Learning Multi-Stage Tasks and Environmental Factors without Precise Reward Functions

Mingyu Kim*, Jihwan Oh*, Yongsik Lee, Joonkee Kim, Seonghwan Kim, Song Chong, Se-Young Yun

IEEE Access 2023

Education

Korea Advanced Institute of Science and Technology (KAIST) Mar. 2024 - Present

Ph.D. Candidate in Artificial Intelligence

Korea Advanced Institute of Science and Technology (KAIST) Mar. 2021 - Feb. 2023

M.S. in Artificial Intelligence

Academic Services

Neurips Benchmark & Dataset Track 2022, ICML 2025, EACL 2026, ACL 2026, COLM 2026 Reviewer,

Honors & Awards

-->