-
multimodal systems. The emphasis is on agentic approaches, where an LLM interacts with visual tools, which may themselves be neural networks. Central challenges include enabling LLMs to reason about visual
-
Familiarity with large language models or multimodal systems An interest in visual reasoning, educational technology, or human–AI interaction Experience with neural networks for image or video understanding
Searches related to multimodal interaction phd
Enter an email to receive alerts for multimodal-interaction-phd positions