I work on reinforcement learning and reasoning in the LLama team at Meta, based out of Montreal. I am also an Adjunct Professor at McGill University. Previously, I was a staff research scientist in the Google DeepMind Team . I finished my PhD at Mila under the guidance of Aaron Courville and Marc Bellemare. Previously, I spent a year at Geoffrey Hinton's amazing team in Google Brain, Toronto. Earlier, I graduated in Computer Science and Engineering from IIT Bombay.

My current research revolves around RL and LLMs, and my prior work has received an outstanding paper award at NeurIPS.

Current PhD Students

  • Morgane Moss (Co-supervised with Aaron Courville)

Past Mentees & Student Researchers

News