Skip to main content
Peter Wonka Research Group
P-Wonka
Mathematical Modeling and Differential Equations
Home
People
News
VLMs
Towards Scalable and Structured Understanding in Visual LLMs
Mohamed Elhoseiny, Associate Professor, Computer Science
Feb 23, 12:00
-
13:00
B9 L2 R2325
LLM
Visual Language Models
VLMs
visual computing
In this talk, we explore a suite of recent advances toward scalable, structured video comprehension using Large Vision Language Models (Video LLMs).