I am a third-year Ph.D. student of the Visual Intelligence Lab, focusing on computer vision and multimodal large language models.
Currently, we are focusing on reasoning and agentic MLLMs to address some of the frontier needs in modern multimodal learning. Please feel free to reach out if you are interested in related topics.