Sort by
Refine Your Search
-
reinforcement learning for large language models (LLMs). Research directions include developing next-generation post-training algorithms, exploring diffusion-based approaches to reasoning with language models
Enter an email to receive alerts for algorithm-development-"Prof" positions