[Events]
Artificial Intelligence Graduate School Expert Invitation Seminar (Professor Lee Jun-seok @ Seoul National University, April 14th (Monday) 13:30)
소프트웨어융합대학
Hit625
2025-04-08
- Date & Time: April 14th (Monday) 13:30~ (Approximately 1 hour)
- Location: Corporate Collaboration Center, 7th Floor, Room 85718
- Speaker: Professor Lee Jun-seok, Graduate School of Data Science, Seoul National University
- Lecture Title: Multimodal Image and Video Understanding on Various Applications
- Lecture Summary: In this talk, we will first overview the definition of multimodal learning in modern AI, followed by several recent interesting applications. First, we will cover referring image segmentation, a task to predict a segmentation mask of the referred object given an image and a text. A simple data augmentation technique turns out to be powerful on this task with promising outcomes. Second, we will talk about video summarization, a task to select important frames or clips from a video, to fully summarize the entire content or to detect interesting parts of it. We present a recent large-scale summarization dataset and promising results when pre-trained on them. Third, we will talk about recent generation and editing models for images and videos, focusing on the characteristics of the latent space learned by diffusion models. We present a work to further improve the nature of the space using isometric regularization. If time permits, we will briefly discuss how these video understanding techniques can be applied to video recommendations.