I am a founding member at Periodic Labs. I am also an Adjunct Professor at McGill University. Briefly, I worked on reinforcement learning and reasoning at Meta. Before that, I was a staff research scientist in the Google DeepMind Team. I finished my PhD at Mila under the guidance of Aaron Courville and Marc Bellemare. Previously, I spent a year at Geoffrey Hinton's amazing team in Google Brain, Toronto. Earlier, I graduated in Computer Science and Engineering from IIT Bombay.

My current research revolves around RL and LLMs, and my prior work has received an outstanding paper award at NeurIPS. I also came up with On-policy Distillation for LLMs and was a core contributor for Gemma and Gemini models.

Talks & Tutorials

Past Interns & Student Researchers

News