Home

Bio

I am a first year Ph.D student in the Department of Computer Science and Engineering at Seoul National University, advised by Jaesik Park. I am also an incoming research scientist intern at Adobe Research, hosted by Duygu Ceylan and Tuanfeng Y. Wang. Before that, I was a research scientist intern at SONY Research, hosted by Takashi Shibuya and Masato Ishii. My research interests lie at the intersection of computer vision and machine learning, with a focus on generative models and multimodal understanding.

Full bio

I received my M.S. degree from Seoul National University, advised by Jaesik Park, and was honored with the Best Thesis Award. During my master's, I worked as a research scientist intern at Microsoft Research Asia with Chong Luo and Qi Dai.

I earned my B.S. in Electrical Engineering and Computer Science from Gwangju Institute of Science and Technology (GIST). During my bachelor's, I was fortunate to be advised by Jonghyun Choi at Seoul National University and Jeany Son at POSTECH. I was also a recipient of the Presidential Science Scholarship.

Latest News

  • 2026.06 Two papers about image and video editing and generation have been accepted to ECCV 2026.
  • 2026.04 A paper about 4D reconstruction has been accepted to SIGGRAPH 2026.
  • 2026.04 I will join Adobe Research this summer as a research scientist intern.

Latest Paper Releases

Vitæ

Full CV in PDF.

Adobe Research Jul. 2026 - Expected
Research Scientist Intern (Mentors: Duygu Ceylan, Tuanfeng Y. Wang)
Visual computing and generative modeling
SONY AI Sep. 2025 - Mar. 2026
Research Scientist Intern (Mentors: Takashi Shibuya, Masato Ishii)
Visual generative modeling and multimodal understanding
Seoul National University Sep. 2025 - Present
Ph.D. Student (Advisor: Jaesik Park)
Computer vision, generative models, and multimodal understanding
Microsoft Research Asia Sep. 2024 - Mar. 2025
Research Scientist Intern (Mentors: Chong Luo, Qi Dai)
Subject-driven customization and open-domain customized video generation
Seoul National University Sep. 2023 - Aug. 2025
M.S. in Artificial Intelligence (Advisor: Jaesik Park)
Best Master's Thesis Award
Gwangju Institute of Science and Technology (GIST)GIST Mar. 2019 - Aug. 2023
B.S. in Electrical Engineering and Computer Science
Magna Cum Laude
Full experience

News

Experience

Research Internships

Adobe Research
Research Scientist Intern at Adobe Research (Jul. 2026 ~ Expected)
Hosted by: Duygu Ceylan, Tuanfeng Y. Wang
Details

Upcoming research internship at Adobe Research, focusing on visual computing and generative modeling for controllable visual content creation.

SONY
Research Scientist Intern at SONY AI (Sep. 2025 ~ Mar. 2026)
Hosted by: Takashi Shibuya, Masato Ishii
Details

Working on visual generative modeling and multimodal understanding, with an emphasis on controllable video generation and evaluation. The internship connects model behavior analysis with practical generation pipelines for user-facing visual media applications.

Microsoft
Research Scientist Intern at Visual Computing Group (Sep. 2024 ~ Mar. 2025)
Hosted by: Chong Luo, Qi Dai
Details

Investigated video generative models, subject-driven customization, and practical systems for open-domain customized video generation. This work led to an ICASSP 2026 paper and the ECCV 2026 paper, Learning Zero-Shot Subject-Driven Video Generation Using 1% Compute.

Undergraduate Research Experience

GIST
Undergraduate Researcher (Jul. 2020 ~ Aug. 2023)
Advisors: Jonghyun Choi, Jeany Son
Now SNU MPR Lab and POSTECH CV LAB
Details

Built an early research foundation in computer vision, visual recognition, video understanding, and generative modeling. Published multiple domestic conference and journal papers through KSC, KIMST, and the Journal of KIISE, along with international conference papers including ICCV 2025 and ICCV 2023.

I was grateful to collaborate closely with:

Education

Seoul National University
Ph.D. student (Sep. 2025 ~ Present), Advisor: Jaesik Park
M.S. (Sep. 2023 ~ Aug. 2025), Best Thesis Award
Details

Master's thesis: Advancing Image Editing through Layer-wise Memory in Diffusion. The thesis studied controllable image editing in diffusion models through layer-wise memory mechanisms and was recognized with the Best Master's Thesis Award. Thesis link will be added later.

University of California, Berkeley
Exchange Program (Jun. 2021 ~ Dec. 2021)
Details

Took upper-division and theory-oriented coursework including EECS 126, convex optimization, CS 70, and discrete mathematics, along with related courses in probability, algorithms, and computer science foundations.

GIST
B.S. in Electrical Engineering and Computer Science (Aug. 2023), Magna Cum Laude
Details

Graduated 2nd in the Department of Electrical Engineering and Computer Science and received Magna Cum Laude honors. Coursework and undergraduate research focused on machine learning, computer vision, and systems foundations.

Honors and Awards

Aug. 2025 Best Master's Thesis Award
Seoul National University, GSAI Thesis: "Advancing Image Editing through Layer-wise Memory in Diffusion"
Feb. 2025 Award for Outstanding Poster Presentation
IPIU 2025 (37th Workshop on Image Processing and Image Understanding) "층위별 메모리를 활용한 이미지 편집 개선 기법"
Aug. 2023 Magna Cum Laude
Gwangju Institute of Science and Technology (GIST)
Mar. 2021 - Aug. 2023 Presidential Science Scholarship
Full tuition and stipend of $5,000 per year ~150 students selected nationally
Jun. 2021 - Dec. 2021 Scholarship for Study Abroad Program
Full support of expenses + $24,000 stipend for UC Berkeley Exchange Program

Publications

* indicates equal contribution. † indicates corresponding authors.

Learning Zero-Shot Subject-Driven Video Generation Using 1% Compute
Daneul Kim, Jingxu Zhang, Wonjoon Jin, Sunghyun Cho, Qi Dai, Jaesik Park†, Chong Luo†
ECCV 2026 NEW!
Details

TL;DRGenerates videos of a user-specified subject with no per-subject tuning by separating identity injection (from subject images) and motion learning (from unrelated videos), matching prior zero-shot methods at roughly 1% of the compute.

RCEdit-500K teaser
RCEdit-500K: Reference Completion for Image-Conditioned Image Editing
Jingxu Zhang, Daneul Kim, Yueming Pan, Dong Chen, Kai Qiu, Yang Liu, Yifan Yang, Qi Dai, Xiaoyan Sun, and Chong Luo
ECCV 2026 NEW!
Details

TL;DRA large open dataset for reference-image-guided editing, built via "reference completion" — automatically synthesizing the missing reference for existing text-edit triplets to yield ~477K input–reference–instruction–target quadruplets across six edit categories.

RCEdit-500K teaser
4D Human-Scene Reconstruction from Low-Overlap Captures
Minhyuk Hwang*, Sangmin Kim*, Seunguk Do, Daneul Kim, and Jaesik Park
ACM SIGGRAPH 2026 NEW!
Details

TL;DRReconstructs dynamic 3D humans and their surrounding scene over time from only a few low-overlap cameras, using camera-controlled video diffusion to synthesize dense novel views and reconstructing background and people separately with cross-view tracking.

A Comprehensive Ecosystem for Open-Domain Customized Video Generation
Jingxu Zhang, Yuqian Hong, Daneul Kim, Kai Qiu, Qi Dai, Jianmin Bao, Yifan Yang, Xiaoyan Sun, Chong Luo
ICASSP 2026 NEW!
Details

TL;DRTackles open-domain customized video generation — synthesizing videos of user-specified subjects across arbitrary domains — with an end-to-end ecosystem spanning data, model, and evaluation.

Qualitative comparison · "a dog on top of a purple rug in a forest"
Reference
Reference image
VideoBooth
BLIP-Diff.
IP-Adapter
MS-Diff.
OminiCtrl
Ours
CustomDiT (rightmost) best preserves subject identity while generating natural, dynamic videos.
Online Generic Event Boundary Detection
Hyung Rok Jung*, Daneul Kim*, Seunggyun Lim, Jeany Son†, Jonghyun Choi†
ICCV 2025
Details

TL;DRIntroduces online event-boundary detection for streaming video (unlike prior methods that need the whole clip), with an Event-Segmentation-Theory–inspired model (ESTimator) that flags boundaries where its frame predictions diverge from what actually happens.

Exploring MMDiT
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Joonghyuk Shin, Alchan Hwang, Yujin Kim, Daneul Kim, Jaesik Park
ICCV 2025
Details

TL;DREditing methods built for U-Net diffusion fail on multimodal diffusion transformers (MM-DiT); this work analyzes MM-DiT's text–image attention to build an editing method handling global-to-local edits across MM-DiT variants, including few-step models.

Exploring MMDiT
Improving Editability
Improving Editability in Image Generation with Layer-wise Memory
Daneul Kim, Jaeah Lee, Jaesik Park
CVPR 2025
Details

TL;DRSupports long, multi-step image edits by storing past edits in a layer-wise memory, with background-consistency guidance and multi-query disentanglement so objects can be added or removed with rough masks while keeping earlier edits intact.

Improving Editability
Pick-or-Mix
Pick-or-Mix: Dynamic Channel Sampling for ConvNets
Ashish Kumar, Daneul Kim, Jaesik Park, Laxmidhar Behera
CVPR 2024
Details

TL;DRReplaces costly 1×1 channel-squeeze layers in ConvNets by splitting channels into subsets and dynamically picking per pixel based on the input — about 25% faster at comparable accuracy, and reusable as a downscaler and dynamic pruner.

Pick-or-Mix
CMOTA
Story Visualization by Online Text Augmentation with Context Memory
Daechul Ahn, Daneul Kim, Gwangmo Song, Seung Hwan Kim, Honglak Lee, Dongyeop Kang, Jonghyun Choi
ICCV 2023
Details

TL;DRGenerates coherent image sequences from multi-sentence stories using a context-memory module that propagates information across frames and an online text-augmentation scheme that creates pseudo-descriptions during training, improving long-term consistency.

CMOTA

Academic Service

Teaching Assistant

Spring 2026 Introduction to Machine Learning
Seoul National University
Summer 2025 Creating Images Using Generative AI
Samsung Data Scientist Academy, Samsung Electronics
Summer 2025 Generative AI Capstone Project
HD Hyundai
Fall 2023 Generative Artificial Intelligence
Seoul National University
Spring 2021 Object Oriented Programming
GIST

Invited Talks

Aug. 2025 Recent Trends on Visual Generation: From Basics to Application
SNU Medical Vision Lab, Seoul National University Hospital (SNUH)
Apr. 2025 Improving Editability in Image Generation with Layer-wise Memory
SNU AI Institute Brown Bag Seminar

Reviewer Services

Reviewer CVPR 2026, ICLR 2026, ECCV 2026, NeurIPS 2026, TPAMI 2026, SIGGRAPH Asia 2026
Program Committee SBP-BRiMS 2026

Contact

Feel free to reach out for research discussions, collaboration ideas, or anything related to visual generation and multimodal learning.

Email carpedkm [at] snu [dot] ac [dot] kr
Lab Location 650 Ho, 303 Dong, 1 Gwanak-ro, Gwanak-gu, Seoul, Republic of Korea (Zip: 08826)
(in Korean) 대한민국 서울특별시 관악구 관악로 1, 303동 650호 AGI 컴퓨팅 파운데이션 클러스터

You can also reach out to my advisor Jaesik Park for any collaboration.

Posts

Published notes stay public. The private writing studio now uses the shared admin session and the unified Mac backend gateway.

Published

Writing Archive

Browse the current post archive and published long-form notes from the dedicated blog backend.

Open archive

Admin

Blog Studio

Sign in, draft posts, publish entries, and manage the local writing backend without touching other services.

Open private studio

Project

Project Access

Private projects are locked

Guest Mode

Leaderboard

Introduction to Machine Learning Leaderboard

2026.03 ~ Present

Submission and ranking dashboard for the 2026S Introduction to Machine Learning assignment.

Open leaderboard

Private Project

PocketChat

2026.04 ~ Present

Telegram-like web chat hosted on the Mac mini with a single shared admin account and one managed gateway entrypoint.

Open PocketChat

Private Project

Pocket Log

2026.04 ~ Present

Private group daily video check-ins with same-moment multi-view slots, invite-only sharing, and an end-of-day recap built for close circles.

Open Pocket Log

Private Project

Blog Studio

2026.04 ~ Present

Private writing studio backed by the Mac mini, now routed through the same managed gateway as the other private services.

Open Blog Studio