Preprints

CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning
Liyiming Ke*, Yunchu Zhang*, Abhay Deshpande, Siddhartha Srinivasa, Abhishek Gupta
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Zichen Zhang*, Yunshuang Li*, Osbert Bastani, Abhishek Gupta, Dinesh Jayaraman, Yecheng Jason Ma, Luca Weihs
Human-Assisted Continual Robot Learning with Foundation Models
Meenal Parakh, Alisha Fong, Anthony Simeonov, Abhishek Gupta, Tao Chen, Pulkit Agrawal
DexTransfer: Real World Multi-fingered Dexterous Grasping with Minimal Human Demonstrations
Zoey Qiuyu Chen, Karl Van Wyk, Yu-Wei Chao, Wei Yang, Arsalan Mousavian, Abhishek Gupta, Dieter Fox
Accelerating online reinforcement learning with offline datasets
Ashvin Nair*, Abhishek Gupta*, Murtaza Dalal, Sergey Levine
Ecological Reinforcement Learning
John D Co-Reyes, Suvansh Sanjeev, Glen Berseth, Abhishek Gupta, Sergey Levine
Unsupervised meta-learning for reinforcement learning
Abhishek Gupta*, Benjamin Eysenbach*, Chelsea Finn, Sergey Levine
Learning latent state representation for speeding up exploration
Giulia Vezzani, Abhishek Gupta, Lorenzo Natale, Pieter Abbeel
Soft actor-critic algorithms and applications
Tuomas Haarnoja, Aurick Zhou, Kristian Hartikainen, George Tucker, Sehoon Ha, Jie Tan, Vikash Kumar, Henry Zhu, Abhishek Gupta, Pieter Abbeel, Sergey Levine

2023

RoboHive: A Unified Framework for Robot Learning
Vikash Kumar, Rutav Shah, Gaoyue Zhou, Vincent Moens, Vittorio Caggiano, Jay Vakil, Abhishek Gupta, Aravind Rajeswaran
Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2023
Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback
Max Balsells I Pamies, Marcel Torne Villasevil, Zihan Wang, Samedh Desai, Pulkit Agrawal, Abhishek Gupta
Conference on Robot Learning (CoRL), 2023
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation
Zheyuan Hu, Aaron Rovinsky, Jianlan Luo, Vikash Kumar, Abhishek Gupta, Sergey Levine
Conference on Robot Learning (CoRL), 2023
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
H.J. Terry Suh, Glen Chou, Hongkai Dai, Lujie Yang, Abhishek Gupta, Russ Tedrake
Conference on Robot Learning (CoRL), 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback
Marcel Torne, Max Balsells, Zihan Wang, Samedh Desai, Tao Chen, Pulkit Agrawal, Abhishek Gupta
Neural Information Processing Systems (NeurIPS), 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
Zhang-Wei Hong, Aviral Kumar, Sathwik Karnik, Abhishek Bhandwaldar, Akash Srivastava, Joni Pajarinen, Romain Laroche, Abhishek Gupta, Pulkit Agrawal
Neural Information Processing Systems (NeurIPS), 2023
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen*, Chuning Zhu*, Pulkit Agrawal, Kaiqing Zhang+, Abhishek Gupta+
Neural Information Processing Systems (NeurIPS), 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu, Max Simchowitz, Siri Gadipudi, Abhishek Gupta
Neural Information Processing Systems (NeurIPS), 2023 (Spotlight)
Tackling combinatorial distribution shift: A matrix completion perspective
Max Simchowitz, Abhishek Gupta, Kaiqing Zhang
Conference on Learning Theory (COLT), 2023
GenAug: Retargeting behaviors to unseen situations via Generative Augmentation
Zoey Chen, Sho Kiami, Abhishek Gupta+, Vikash Kumar+
Robotics: Science and Systems (RSS), 2023, (Best Paper Finalist)
Cherry-Picking with Reinforcement Learning: Robust Dynamic Grasping in Unstable Conditions
Yunchu Zhang, Liyiming Ke, Abhay Deshpande, Abhishek Gupta, Siddhartha Srinivasa
Robotics: Science and Systems (RSS), 2023
Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective
Max Simchowitz, Abhishek Gupta, Kaiqing Zhang
International Conference on Machine Learning (ICML), 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du*, Olivia Watkins*, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas
International Conference on Machine Learning (ICML), 2023
Learning to Extrapolate: A Transductive Approach
Aviv Netanyahu*, Abhishek Gupta*, Max Simchowitz, Kaiqing Zhang, Pulkit Agrawal
International Conference on Learning Representations (ICLR), 2023
TactoFind: A Tactile Only System for Object Retrieval
Sameer Pai*, Tao Chen*, Megha Tippur, Edward Adelson, Abhishek Gupta+, Pulkit Agrawal+
International Conference on Robotics and Automation (ICRA), 2023
Demonstration-Bootstrapped Autonomous Practicing via Multi-Task Reinforcement Learning
Abhishek Gupta, Corey Lynch, Brandon Kinman, Garrett Peake, Sergey Levine, Karol Hausman
International Conference on Robotics and Automation (ICRA), 2023
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu*, Zheyuan Hu*, Ria Doshi, Aaron Rovinsky, Vikash Kumar, Abhishek Gupta, Sergey Levine
International Conference on Robotics and Automation (ICRA), 2023

2022

Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation
Qiuyu Chen, Karl Van Wyk, Yu-Wei Chao, Wei Yang, Arsalan Mousavian, Abhishek Gupta, Dieter Fox
Conference on Robot Learning (CoRL), 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Abhishek Gupta*, Aldo Pacchiano*, Simon Zhai, Sham Kakade, Sergey Levine
Neural Information Processing Systems (NeurIPS), 2022
Distributionally Adaptive Meta Reinforcement Learning
Anurag Ajay*, Abhishek Gupta*, Dibya Ghosh, Sergey Levine, Pulkit Agrawal
Neural Information Processing Systems (NeurIPS), 2022
Autonomous Reinforcement Learning: Formalism and Benchmarking
Archit Sharma*, Kelvin Xu*, Nikhil Sardana, Abhishek Gupta, Karol Hausman, Sergey Levine, Chelsea Finn
International Conference on Learning Representations (ICLR), 2022

2021

Teachable Reinforcement Learning via Advice Distillation
Olivia Watkins, Trevor Darrell, Pieter Abbeel, Jacob Andreas, Abhishek Gupta
Neural Information Processing Systems (NeurIPS), 2021
Autonomous reinforcement learning via subgoal curricula
Archit Sharma, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn
Neural Information Processing Systems (NeurIPS), 2021
Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift
Marvin Mengxin Zhang, Henrik Marklund, Nikita Dhawan, Abhishek Gupta, Sergey Levine, Chelsea Finn
Neural Information Processing Systems (NeurIPS), 2021
Which Mutual-Information Representation Learning Objectives are Sufficient for Control?
Kate Rakelly, Abhishek Gupta, Carlos Florensa, Sergey Levine
Neural Information Processing Systems (NeurIPS), 2021
MURAL: Meta-learning uncertainty-aware rewards for outcome-driven reinforcement learning
Kevin Li*, Abhishek Gupta*, Ashwin Reddy, Vitchyr H Pong, Aurick Zhou, Justin Yu, Sergey Levine
International Conference on Machine Learning (ICML), 2021
Reset-free reinforcement learning via multi-task learning: Learning dexterous manipulation behaviors without human intervention
Abhishek Gupta*, Justin Yu*, Tony Z Zhao*, Vikash Kumar*, Aaron Rovinsky, Kelvin Xu, Thomas Devlin, Sergey Levine
International Conference on Robotics and Automation (ICRA), 2021
Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh*, Abhishek Gupta*, Ashwin Reddy, Justin Fu, Coline Devin, Benjamin Eysenbach, Sergey Levine
International Conference on Learning Representations (ICLR), 2021

2020

The ingredients of real-world robotic reinforcement learning
Henry Zhu*, Justin Yu*, Abhishek Gupta*, Dhruv Shah, Kristian Hartikainen, Avi Singh, Vikash Kumar, Sergey Levine
International Conference on Learning Representations (ICLR), 2020
DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction
Aviral Kumar, Abhishek Gupta, Sergey Levine
Neural Information Processing Systems (NeurIPS), 2020
Gradient Surgery for Multi-Task Learning
Tianhe Yu, Saurabh Kumar, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn
Neural Information Processing Systems (NeurIPS), 2020

2019

Robel: Robotics benchmarks for learning with low-cost robots
Michael Ahn, Henry Zhu, Kristian Hartikainen, Hugo Ponte, Abhishek Gupta, Sergey Levine, Vikash Kumar
Conference on Robot Learning (CoRL), 2019
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta, Vikash Kumar, Corey Lynch, Sergey Levine, Karol Hausman
Conference on Robot Learning (CoRL), 2019
Unsupervised curricula for visual meta-reinforcement learning
Allan Jabri, Kyle Hsu, Abhishek Gupta, Ben Eysenbach, Sergey Levine, Chelsea Finn
Neural Information Processing Systems (NeurIPS), 2019
Guided meta-policy search
Russell Mendonca, Abhishek Gupta, Rosen Kralev, Pieter Abbeel, Sergey Levine, Chelsea Finn
Neural Information Processing Systems (NeurIPS), 2019
Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost
Henry Zhu*, Abhishek Gupta*, Aravind Rajeswaran, Sergey Levine, Vikash Kumar
International Conference on Robotics and Automation (ICRA), 2019
Guiding Policies with Language via Meta-Learning
John D. Co-Reyes, Abhishek Gupta, Suvansh Sanjeev, Nick Altieri, Jacob Andreas, John DeNero, Pieter Abbeel, Sergey Levine
International Conference on Learning Representations (ICLR), 2019
Learning actionable representations with goal-conditioned policies
Dibya Ghosh, Abhishek Gupta, Sergey Levine
International Conference on Learning Representations (ICLR), 2019
Automatically composing representation transformations as a means for generalization
Michael B Chang, Abhishek Gupta, Sergey Levine, Thomas L Griffiths
International Conference on Learning Representations (ICLR), 2019
Diversity is all you need: Learning skills without a reward function
Benjamin Eysenbach, Abhishek Gupta, Julian Ibarz, Sergey Levine
International Conference on Learning Representations (ICLR), 2019

2018

Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
John Co-Reyes, YuXuan Liu, Abhishek Gupta, Benjamin Eysenbach, Pieter Abbeel, Sergey Levine
International Conference on Machine Learning (ICML), 2018
Imitation from observation: Learning to imitate behaviors from raw video via context translation
YuXuan Liu, Abhishek Gupta, Pieter Abbeel, Sergey Levine
International Conference on Robotics and Automation (ICRA), 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
Abhishek Gupta, Russell Mendonca, YuXuan Liu, Pieter Abbeel, Sergey Levine
Neural Information Processing Systems (NeurIPS), 2018
Learning complex dexterous manipulation with deep reinforcement learning and demonstrations
Aravind Rajeswaran*, Vikash Kumar*, Abhishek Gupta, Giulia Vezzani, John Schulman, Emanuel Todorov, Sergey Levine
Robotics Science and Systems (RSS), 2018

2017

Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer
Coline Devin*, Abhishek Gupta*, Trevor Darrell, Pieter Abbeel, Sergey Levine
International Conference on Robotics and Automation (ICRA), 2017
Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning
Abhishek Gupta*, Coline Devin*, YuXuan Liu, Pieter Abbeel, Sergey Levine
International Conference on Learning Representations (ICLR), 2017

2016

Learning Dexterous Manipulation for a Soft Robotic Hand from Human Demonstrations
Abhishek Gupta, Clemens Eppner, Sergey Levine, Pieter Abbeel
International Conference on Intelligent Robots and Systems (IROS), 2016
Guided search for task and motion plans using learned heuristics
Rohan Chitnis, Dylan Hadfield-Menell, Abhishek Gupta, Siddharth Srivastava, Edward Groshev, Christopher Lin, Pieter Abbeel
International Conference on Robotics and Automation (ICRA), 2016

2015

Learning from multiple demonstrations using trajectory-aware non-rigid registration with applications to deformable object manipulation
Alex X Lee, Abhishek Gupta, Henry Lu, Sergey Levine, Pieter Abbeel
International Conference on Intelligent Robots and Systems (IROS), 2015
Learning force-based manipulation of deformable objects from multiple demonstrations
Alex X Lee, Henry Lu, Abhishek Gupta, Sergey Levine, Pieter Abbeel
International Conference on Robotics and Automation (ICRA), 2015
Tractability of planning with loops
Siddharth Srivastava, Shlomo Zilberstein, Abhishek Gupta, Pieter Abbeel, Stuart Russell
AAAI Conference on Artificial Intelligence (AAAI), 2015