From 0573b673849e1f2aa293edfe1be8c687bc25721b Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Ma=C3=ABlic=20Neau?= Date: Fri, 4 Aug 2023 11:55:39 +0200 Subject: [PATCH] RT-2 is out + VoxPoser --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index a7202d3..d8e2c55 100755 --- a/README.md +++ b/README.md @@ -19,6 +19,8 @@ If you find this repository useful, please consider [citing](#citation) and STAR --- ## Reasoning +* **[RT-2]** "RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control", *Blog, July 2023*. +[[Paper](https://robotics-transformer2.github.io/assets/rt2.pdf)] [[Website](https://robotics-transformer2.github.io/)] * **Instruct2Act**: "Mapping Multi-modality Instructions to Robotic Actions with Large Language Model", *arXiv, May 2023*. [[Paper](https://arxiv.org/pdf/2305.11176.pdf)] [[Pytorch Code](https://github.com/OpenGVLab/Instruct2Act)] * **TidyBot**: "Personalized Robot Assistance with Large Language Models", *arXiv, May 2023*. [[Paper](https://arxiv.org/abs/2305.05658)] [[Pytorch Code](https://github.com/jimmyyhwu/tidybot/tree/main/robot)] [[Website](https://tidybot.cs.princeton.edu/)] * **PaLM-E**: "PaLM-E: An Embodied Multimodal Language Model", *arXiv, Mar 2023*, [[Paper](https://arxiv.org/abs/2303.03378)] [[Webpage](https://palm-e.github.io/)] @@ -69,6 +71,8 @@ If you find this repository useful, please consider [citing](#citation) and STAR --- ## Manipulation +* **[VoxPoser]** "VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models", *arXiv, July 2023* +[[Paper](https://arxiv.org/pdf/2307.05973.pdf)] [[Website](https://voxposer.github.io/)] * **ProgramPort**:"Programmatically Grounded, Compositionally Generalizable Robotic Manipulation", *ICLR, Apr 2023*, [[Paper](https://arxiv.org/abs/2304.13826)] [[Website] (https://progport.github.io/)] * **CoTPC**:"Chain-of-Thought Predictive Control", *arXiv, Apr 2023*, [[Paper](https://arxiv.org/abs/2304.00776)] [[Code](https://github.com/SeanJia/CoTPC)] * **DIAL**:"Robotic Skill Acquistion via Instruction Augmentation with Vision-Language Models", *arXiv, Nov 2022*, [[Paper](https://arxiv.org/abs/2211.11736)] [[Website](https://instructionaugmentation.github.io/)]