Sanjiban Choudhury

sanjibanc at cornell dot edu

prof_pic.jpg

Assistant Professor

Cornell Computer Science

I am an Assistant Professor at Cornell Computer Science where I lead the People and Robot Teaching and Learning (PoRTaL) group. I am also a Research Scientist at Aurora.

My research focuses on advancing imitation learning and decision making in robotics and AI systems. My group works on both theory and algorithms for efficiently training agents with applications to home robots, multi-agent games, language models and self-driving.

Prospective students and undergraduate researchers see this note.

News

Jul 15, 2024 New ICML 2024 paper on Hybrid Inverse Reinforcement Learning! We propose a new, practical framework for efficient inverse RL, based on a simple idea: “Don’t be optimal, just be better than the demonstrator”
Jun 5, 2024 We got a Google Research Award for Planning with LLMs! Stay tuned for some exciting work on distilling model-based planning into LLMs to solve decision making tasks.
Jun 1, 2024 Thank you OpenAI for the Superalignment Award! Stay tuned for some exciting work on task super alignment, where we use transfer learning to enable generalization from easy to hard tasks.
May 15, 2024 MOSAIC wins best paper award at Vision-Language Models for Navigation and Manipulation at ICRA 2024! And the best poster award at MOMA workshop at ICRA 2024!
Apr 3, 2024 Excited to finally share MOSAIC! We combine multiple foundation models to design a multi-robot collaborative cooking system.
Apr 1, 2024 Fun talk at Andrea’s class in CMU on To Rl or not to RL!
Mar 25, 2024 New NSF FRR grant on Inverse Task Planning with LLMs with Jeannette Bohg!

Selected Papers

2024_mosaic

2024_mosaic_skills

MOSAIC: A Modular System for Assistive and Interactive Cooking
Yuki Wang, Kushal Kedia, Juntao Ren, Rahma Abdullah, ..., Sanjiban Choudhury

paper / website

MOSAIC combines large pre-trained models for general tasks with task-specific modules to enable collaborative cooking.


2023_moment_matching

The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
Anirudh Vemula, Yuda Song, Aarti Singh, J. Andrew Bagnell, Sanjiban Choudhury
International Conference on Machine Learning (ICML), 2023
paper / talk

We propose a completely new in Model-based Reinforcement Learning (MBRL) that solves two long-standing challenges: computational efficiency and mismatched objectives.


2023_demo2code

Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought
Yuki Wang, Gonzalo Gonzalez-Pumariega, Yash Sharma, Sanjiban Choudhury
Neural Information Processing Systems (NeurIPS), 2023
paper / website

Demo2Code leverages LLMs to convert demonstrations to robot task code. It recursively summarizes both down to a task specification, then recursively expands the specification to an executable task code.


2023_manicast

ManiCast: Collaborative Manipulation with Cost-Aware Human Forecasting
Kushal Kedia, Prithwish Dan, Atiksh Bhardwaj, Sanjiban Choudhury
Conference on Robot Learning (CoRL), 2023
paper / website

ManiCast trains transformer models to predict human motion using cost-aware losses. These forecasts are then used by MPC for collaborative manipulation tasks.