MIT ML Seminar: Swaroop Mishra
Host
Tommi Jaakkola
Title: Instruction Following and Reasoning in Large Language Models
Bio: Swaroop Mishra is a Senior Research Scientist at Google DeepMind (formerly Google Brain), where he works on Gemini reasoning. He and his team recently built a system that received a Silver Medal at the International Mathematical Olympiad (IMO) 2024. His main research contributions are on instruction-tuning and reasoning methods, including natural instructions, super-natural instructions, reframing, question-decomposition, math via programs, help me think, and instruction-bias (EACL 2023 Outstanding Paper Award). He is a co-organizer of the MATH-AI workshop at NeurIPS 2022 and NeurIPS 2024. His work on "Natural Instructions" has recently received the AI2 "Lasting Impact Paper Award."
Bio: Swaroop Mishra is a Senior Research Scientist at Google DeepMind (formerly Google Brain), where he works on Gemini reasoning. He and his team recently built a system that received a Silver Medal at the International Mathematical Olympiad (IMO) 2024. His main research contributions are on instruction-tuning and reasoning methods, including natural instructions, super-natural instructions, reframing, question-decomposition, math via programs, help me think, and instruction-bias (EACL 2023 Outstanding Paper Award). He is a co-organizer of the MATH-AI workshop at NeurIPS 2022 and NeurIPS 2024. His work on "Natural Instructions" has recently received the AI2 "Lasting Impact Paper Award."