VLMs

Towards Scalable and Structured Understanding in Visual LLMs

Mohamed Elhoseiny, Associate Professor, Computer Science

Feb 23, 12:00 - 13:00

B9 L2 R2325

LLM Visual Language Models VLMs visual computing

In this talk, we explore a suite of recent advances toward scalable, structured video comprehension using Large Vision Language Models (Video LLMs).