rl_sar/README.md

# rl_sar

[中文文档](README_CN.md)

Simulation verification and physical deployment of robot reinforcement learning algorithms, suitable for quadruped robots, wheeled robots, and humanoid robots. "sar" stands for "simulation and real"

## Preparation

Clone the code

```bash
git clone https://github.com/fan-ziqi/rl_sar.git
```

## Dependency

This project relies on ROS Noetic (Ubuntu 20.04)

After installing ROS, install the dependency library

```bash
sudo apt install ros-noetic-teleop-twist-keyboard ros-noetic-controller-interface  ros-noetic-gazebo-ros-control ros-noetic-joint-state-controller ros-noetic-effort-controllers ros-noetic-joint-trajectory-controller
```

Download and deploy `libtorch` at any location

```bash
cd /path/to/your/torchlib
wget https://download.pytorch.org/libtorch/cpu/libtorch-cxx11-abi-shared-with-deps-2.0.1%2Bcpu.zip
unzip libtorch-cxx11-abi-shared-with-deps-2.0.1+cpu.zip -d ./
echo 'export Torch_DIR=/path/to/your/torchlib' >> ~/.bashrc
```

Install yaml-cpp

```bash
git clone https://github.com/jbeder/yaml-cpp.git
cd yaml-cpp && mkdir build && cd build
cmake -DYAML_BUILD_SHARED_LIBS=on .. && make
sudo make install
sudo ldconfig
```

Install lcm

```bash
git clone https://github.com/lcm-proj/lcm.git 
cd lcm && mkdir build && cd build
cmake .. && make
sudo make install
sudo ldconfig
```

## Compilation

Customize the following two functions in your code to adapt to different models:

```cpp
torch::Tensor forward() override;
torch::Tensor compute_observation() override;
```

Then compile in the root directory

```bash
cd ..
catkin build
```

## Running

Before running, copy the trained pt model file to `rl_sar/src/rl_sar/models/YOUR_ROBOT_NAME`, and configure the parameters in `config.yaml`.

### Simulation

Open a new terminal, launch the gazebo simulation environment

```bash
source devel/setup.bash
roslaunch rl_sar gazebo_<ROBOT>.launch
```

Where \<ROBOT\> can be `a1` or `gr1t1`.

Press **0** on the keyboard to switch the robot to the default standing position, press **P** to switch to RL control mode, and press **1** in any state to switch to the initial lying position. WS controls x-axis, AD controls yaw, and JL controls y-axis.

Press **R** to reset Gazebo environment.

### Physical Robots

#### Unitree A1

Unitree A1 can be connected using both wireless and wired methods:

* Wireless: Connect to the Unitree starting with WIFI broadcasted by the robot **(Note: Wireless connection may lead to packet loss, disconnection, or even loss of control, please ensure safety)**
* Wired: Use an Ethernet cable to connect any port on the computer and the robot, configure the computer IP as 192.168.123.162, and the gateway as 255.255.255.0

Open a new terminal and start the control program

```bash
source devel/setup.bash
rosrun rl_sar rl_real_a1
```

Press the **R2** button on the controller to switch the robot to the default standing position, press **R1** to switch to RL control mode, and press **L2** in any state to switch to the initial lying position. The left stick controls x-axis up and down, controls yaw left and right, and the right stick controls y-axis left and right.

OR Press **0** on the keyboard to switch the robot to the default standing position, press **P** to switch to RL control mode, and press **1** in any state to switch to the initial lying position. WS controls x-axis, AD controls yaw, and JL controls y-axis.

## Add Your Robot

In the following, let ROBOT represent the name of your robot.

1. Create a model package named ROBOT_description in the robots folder. Place the URDF model in the urdf path within the folder and name it ROBOT.urdf. Create a namespace named ROBOT_gazebo in the config folder within the model file for joint configuration.
2. Place the model file in models/ROBOT.
3. Add a new field in rl_sar/config.yaml named ROBOT and adjust the parameters, such as changing the model_name to the model file name from the previous step.
4. Add a new launch file in the rl_sar/launch folder. Refer to other launch files for guidance on modification.
5. Change ROBOT_NAME to ROBOT in rl_xxx.cpp.
6. Compile and run.

## Reference

[unitree_ros](https://github.com/unitreerobotics/unitree_ros)

## Citation

Please cite the following if you use this code or parts of it:

```
@software{fan-ziqi2024rl_sar,
  author = {fan-ziqi},
  title = {{rl_sar: Simulation Verification and Physical Deployment of Robot Reinforcement Learning Algorithm.}},
  url = {https://github.com/fan-ziqi/rl_sar},
  year = {2024}
}
```
docs: del auto checkout 2024-03-14 12:42:00 +08:00			`# rl_sar`
doc: add README 2024-03-06 17:32:15 +08:00
docs: fix CN dir 2024-03-18 14:54:46 +08:00			`[中文文档](README_CN.md)`
docs: del auto checkout 2024-03-14 12:42:00 +08:00
docs: update readme 2024-05-25 08:54:13 +08:00			`Simulation verification and physical deployment of robot reinforcement learning algorithms, suitable for quadruped robots, wheeled robots, and humanoid robots. "sar" stands for "simulation and real"`
doc: add README 2024-03-06 17:32:15 +08:00
translate 2024-03-07 11:54:33 +08:00			`## Preparation`

feat: [Destructive Update]del submodules unitree_ros 2024-05-23 20:55:35 +08:00			`Clone the code`
translate 2024-03-07 11:54:33 +08:00
			```bash
feat: [Destructive Update]del submodules unitree_ros 2024-05-23 20:55:35 +08:00			`git clone https://github.com/fan-ziqi/rl_sar.git`
translate 2024-03-07 11:54:33 +08:00			```

docs: yaml 2024-04-05 22:03:00 +08:00			`## Dependency`

docs: update readme 2024-05-25 08:54:13 +08:00			`This project relies on ROS Noetic (Ubuntu 20.04)`

			`After installing ROS, install the dependency library`

			```bash
			`sudo apt install ros-noetic-teleop-twist-keyboard ros-noetic-controller-interface ros-noetic-gazebo-ros-control ros-noetic-joint-state-controller ros-noetic-effort-controllers ros-noetic-joint-trajectory-controller`
			```

translate 2024-03-07 11:54:33 +08:00			Download and deploy `libtorch` at any location
doc: add README 2024-03-06 17:32:15 +08:00
			```bash
			`cd /path/to/your/torchlib`
			`wget https://download.pytorch.org/libtorch/cpu/libtorch-cxx11-abi-shared-with-deps-2.0.1%2Bcpu.zip`
			`unzip libtorch-cxx11-abi-shared-with-deps-2.0.1+cpu.zip -d ./`
			`echo 'export Torch_DIR=/path/to/your/torchlib' >> ~/.bashrc`
			```

docs: yaml 2024-04-05 22:03:00 +08:00			`Install yaml-cpp`

			```bash
			`git clone https://github.com/jbeder/yaml-cpp.git`
			`cd yaml-cpp && mkdir build && cd build`
			`cmake -DYAML_BUILD_SHARED_LIBS=on .. && make`
			`sudo make install`
			`sudo ldconfig`
			```

fix: build bugs 2024-04-17 15:20:19 +08:00			`Install lcm`

			```bash
			`git clone https://github.com/lcm-proj/lcm.git`
			`cd lcm && mkdir build && cd build`
			`cmake .. && make`
			`sudo make install`
			`sudo ldconfig`
			```

translate 2024-03-07 11:54:33 +08:00			`## Compilation`
doc: add README 2024-03-06 17:32:15 +08:00
docs: del auto checkout 2024-03-14 12:42:00 +08:00			`Customize the following two functions in your code to adapt to different models:`
doc: add README 2024-03-06 17:32:15 +08:00
			```cpp
			`torch::Tensor forward() override;`
			`torch::Tensor compute_observation() override;`
			```

translate 2024-03-07 11:54:33 +08:00			`Then compile in the root directory`
doc: add README 2024-03-06 17:32:15 +08:00
			```bash
			`cd ..`
			`catkin build`
			```

docs: del auto checkout 2024-03-14 12:42:00 +08:00			`## Running`

docs: yaml 2024-04-05 22:03:00 +08:00			Before running, copy the trained pt model file to `rl_sar/src/rl_sar/models/YOUR_ROBOT_NAME`, and configure the parameters in `config.yaml`.
doc: add README 2024-03-06 17:32:15 +08:00
docs: del auto checkout 2024-03-14 12:42:00 +08:00			`### Simulation`
doc: add README 2024-03-06 17:32:15 +08:00
docs: del auto checkout 2024-03-14 12:42:00 +08:00			`Open a new terminal, launch the gazebo simulation environment`
doc: add README 2024-03-06 17:32:15 +08:00
			```bash
			`source devel/setup.bash`
docs: update gr1t1 2024-05-25 17:39:22 +08:00			`roslaunch rl_sar gazebo_<ROBOT>.launch`
doc: add README 2024-03-06 17:32:15 +08:00			```

docs: update gr1t1 2024-05-25 17:39:22 +08:00			Where \<ROBOT\> can be `a1` or `gr1t1`.

feat: [Destructive Update] NEW VERSISON 2024-05-24 16:24:14 +08:00			`Press 0 on the keyboard to switch the robot to the default standing position, press P to switch to RL control mode, and press 1 in any state to switch to the initial lying position. WS controls x-axis, AD controls yaw, and JL controls y-axis.`
doc: add README 2024-03-06 17:32:15 +08:00
docs: add R 2024-05-25 17:55:57 +08:00			`Press R to reset Gazebo environment.`

docs: add cyberdog deploment 2024-04-08 23:25:05 +08:00			`### Physical Robots`
docs: del auto checkout 2024-03-14 12:42:00 +08:00
docs: add cyberdog deploment 2024-04-08 23:25:05 +08:00			`#### Unitree A1`
docs: add physical deploment 2024-03-18 14:53:31 +08:00
docs: add cyberdog deploment 2024-04-08 23:25:05 +08:00			`Unitree A1 can be connected using both wireless and wired methods:`
docs: add physical deploment 2024-03-18 14:53:31 +08:00
docs: add cyberdog deploment 2024-04-08 23:25:05 +08:00			`* Wireless: Connect to the Unitree starting with WIFI broadcasted by the robot (Note: Wireless connection may lead to packet loss, disconnection, or even loss of control, please ensure safety)`
			`* Wired: Use an Ethernet cable to connect any port on the computer and the robot, configure the computer IP as 192.168.123.162, and the gateway as 255.255.255.0`

			`Open a new terminal and start the control program`
doc: add README 2024-03-06 17:32:15 +08:00
docs: del auto checkout 2024-03-14 12:42:00 +08:00			```bash
			`source devel/setup.bash`
docs: add cyberdog deploment 2024-04-08 23:25:05 +08:00			`rosrun rl_sar rl_real_a1`
docs: del auto checkout 2024-03-14 12:42:00 +08:00			```
doc: add README 2024-03-06 17:32:15 +08:00
docs: add cyberdog deploment 2024-04-08 23:25:05 +08:00			`Press the R2 button on the controller to switch the robot to the default standing position, press R1 to switch to RL control mode, and press L2 in any state to switch to the initial lying position. The left stick controls x-axis up and down, controls yaw left and right, and the right stick controls y-axis left and right.`

feat: [Destructive Update] NEW VERSISON 2024-05-24 16:24:14 +08:00			`OR Press 0 on the keyboard to switch the robot to the default standing position, press P to switch to RL control mode, and press 1 in any state to switch to the initial lying position. WS controls x-axis, AD controls yaw, and JL controls y-axis.`
docs: add Citation 2024-03-22 00:16:13 +08:00
feat: * add joint_names to config.yaml * add ReadTensorFromYaml * Due to the fact that the robot_state_publisher sorts the joint names alphabetically the mapping table is established according to the order defined in the YAML file 2024-04-28 16:51:46 +08:00			`## Add Your Robot`

			`In the following, let ROBOT represent the name of your robot.`

			`1. Create a model package named ROBOT_description in the robots folder. Place the URDF model in the urdf path within the folder and name it ROBOT.urdf. Create a namespace named ROBOT_gazebo in the config folder within the model file for joint configuration.`
			`2. Place the model file in models/ROBOT.`
			`3. Add a new field in rl_sar/config.yaml named ROBOT and adjust the parameters, such as changing the model_name to the model file name from the previous step.`
			`4. Add a new launch file in the rl_sar/launch folder. Refer to other launch files for guidance on modification.`
			`5. Change ROBOT_NAME to ROBOT in rl_xxx.cpp.`
			`6. Compile and run.`
feat: [Destructive Update]del submodules unitree_ros 2024-05-23 20:55:35 +08:00
			`## Reference`

			`[unitree_ros](https://github.com/unitreerobotics/unitree_ros)`
feat: * add joint_names to config.yaml * add ReadTensorFromYaml * Due to the fact that the robot_state_publisher sorts the joint names alphabetically the mapping table is established according to the order defined in the YAML file 2024-04-28 16:51:46 +08:00
docs: add Citation 2024-03-22 00:16:13 +08:00			`## Citation`

			`Please cite the following if you use this code or parts of it:`

			```
			`@software{fan-ziqi2024rl_sar,`
			`author = {fan-ziqi},`
docs: update readme 2024-05-25 08:54:13 +08:00			`title = {{rl_sar: Simulation Verification and Physical Deployment of Robot Reinforcement Learning Algorithm.}},`
docs: add Citation 2024-03-22 00:16:13 +08:00			`url = {https://github.com/fan-ziqi/rl_sar},`
			`year = {2024}`
			`}`
			```