Publications

(2024). GHIL-Glue: Hierarchical Control with Filtered Subgoal Images. In arxiv:2410.20018.

PDF Cite Code Project

(2024). Bridging Language and Action: A Survey of Language-Conditioned Robot Manipulation. In arxiv:2312.10807.

PDF Cite

(2024). Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models. In CoRL.

PDF Cite Project

(2024). Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance. In CoRL.

PDF Cite Project

(2024). The Ingredients for Robotic Diffusion Transformers. In arxiv:2410.10088.

PDF Cite Code Dataset Project

(2024). LeLaN: Learning A Language-conditioned Navigation Policy from In-the-Wild Video. In CoRL.

PDF Cite Code Project

(2024). Scaling Cross-Embodied Learning: One Policy for Manipulation, Navigation, Locomotion and Aviation. In CoRL.

PDF Cite Code Project

(2024). Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation. In CoRL.

PDF Cite Code Project

(2024). Autonomous Improvement of Instruction Following Skills via Foundation Models. In CoRL.

PDF Cite Code Dataset Project

(2024). Robotic Control via Embodied Chain-of-Thought Reasoning. In CoRL.

PDF Cite Code Project

(2024). Evaluating Real-World Robot Manipulation Policies in Simulation. In CoRL.

PDF Cite Code Project Video

(2024). Vision-Language Models Provide Promptable Representations for Reinforcement Learning. In arxiv:2402.02651.

PDF Cite Project

(2023). Octo: An Open-Source Generalist Robot Policy. In RSS.

PDF Cite Code Project

(2023). Audio Visual Language Maps for Robot Navigation. In ISER.

PDF Cite Code Project Video

(2022). Visual Language Maps for Robot Navigation. In ICRA.

PDF Cite Code Project Video Google AI Blog

(2022). Grounding Language with Visual Affordances over Unstructured Data. In ICRA.

PDF Cite Code Dataset Project

(2022). Latent Plans for Task Agnostic Offline Reinforcement Learning. In CoRL.

PDF Cite Code Dataset Project

(2022). Affordance Learning from Play for Sample-Efficient Policy Learning. In ICRA.

PDF Cite Code Dataset Project Video

(2020). Composing Pick-and-Place Tasks By Grounding Language. In ISER.

PDF Cite Project

(2020). Hindsight for Foresight: Unsupervised Structured Dynamics Models from Physical Interaction. In IROS.

PDF Cite Dataset Project Video Talk

(2019). Adversarial Skill Networks: Unsupervised Robot Skill Learning from Video. In ICRA.

PDF Cite Code Dataset Project Video Talk

(2019). Self-supervised 3D Shape and Viewpoint Estimation from Single Images for Robotics. In IROS.

PDF Cite Code Video

(2017). Perspectives on Deep Multimodel Robot Learning. In ISRR.

PDF Cite

(2017). Metric Learning for Generalizing Spatial Relations to New Objects. In IROS.

PDF Cite Code Dataset Project Video