66c418ef38 | ||
---|---|---|
.github | ||
src | ||
.gitattributes | ||
.gitignore | ||
CONTRIBUTORS.md | ||
LICENCE | ||
README.md | ||
README_CN.md |
README.md
rl_sar
Version Select: ROS-Noetic | ROS2-Foxy/Humble
This repository provides a framework for simulation verification and physical deployment of robot reinforcement learning algorithms, suitable for quadruped robots, wheeled robots, and humanoid robots. "sar" stands for "simulation and real"
feature:
- Support legged_gym based on IsaacGym and IsaacLab based on IsaacSim. Use
framework
to distinguish. - The code has two versions: ROS-Noetic and ROS2-Foxy/Humble
- The code supports both cpp and python, you can find python version in
src/rl_sar/scripts
[!NOTE] If you want to train policy using IsaacLab(IsaacSim), please use robot_lab project.
Preparation
Clone the code
git clone -b ros2 https://github.com/fan-ziqi/rl_sar.git
Dependency
This project uses ros2-foxy
(Ubuntu 20.04) or ros2-humble
(Ubuntu22.04) and requires the installation of the following ROS dependency packages:
sudo apt install ros-$ROS_DISTRO-teleop-twist-keyboard ros-$ROS_DISTRO-ros2-control ros-$ROS_DISTRO-ros2-controllers ros-$ROS_DISTRO-control-toolbox ros-$ROS_DISTRO-robot-state-publisher ros-$ROS_DISTRO-joint-state-publisher-gui ros-$ROS_DISTRO-gazebo-ros2-control ros-$ROS_DISTRO-gazebo-ros-pkgs ros-$ROS_DISTRO-xacro
Download and deploy libtorch
at any location
cd /path/to/your/libtorch
wget https://download.pytorch.org/libtorch/cpu/libtorch-cxx11-abi-shared-with-deps-2.0.1%2Bcpu.zip
unzip libtorch-cxx11-abi-shared-with-deps-2.0.1+cpu.zip -d ./
echo 'export Torch_DIR=/path/to/your/libtorch' >> ~/.bashrc
source ~/.bashrc
Install yaml-cpp
and lcm
. If you are using Ubuntu, you can directly use the package manager for installation:
sudo apt install liblcm-dev libyaml-cpp-dev
You can also use source code installation, click to expand
Install yaml-cpp
git clone https://github.com/jbeder/yaml-cpp.git
cd yaml-cpp && mkdir build && cd build
cmake -DYAML_BUILD_SHARED_LIBS=on .. && make
sudo make install
sudo ldconfig
Install lcm
git clone https://github.com/lcm-proj/lcm.git
cd lcm && mkdir build && cd build
cmake .. && make
sudo make install
sudo ldconfig
Compilation
Compile in the root directory of the project
cd ..
colcon build --merge-install --symlink-install
Running
In the following text, <ROBOT>_<PLATFORM> is used to represent different environments. Currently supported list:
isaacgym | isaacsim | |
---|---|---|
a1 | ✓ | ✓ |
go2 | ✓ |
Before running, copy the trained pt model file to rl_sar/src/rl_sar/models/<ROBOT>_<PLATFORM>
, and configure the parameters in config.yaml
.
Simulation
Open a terminal, launch the gazebo simulation environment
source install/setup.bash
ros2 launch rl_sar gazebo.launch.py rname:=<ROBOT> framework:=<PLATFORM>
(e.g. ros2 launch rl_sar gazebo.launch.py rname:=a1 framework:=isaacgym)
Open a new terminal, launch the control program
source install/setup.bash
(for cpp version) ros2 run rl_sar rl_sim
(for python version) ros2 run rl_sar rl_sim.py
Control:
- W and S controls x-axis, A and D controls yaw, and J and L controls y-axis.
- Press <Space> to sets all control commands to zero.
Real Robots
Unitree A1
Unitree A1 can be connected using both wireless and wired methods:
- Wireless: Connect to the Unitree starting with WIFI broadcasted by the robot (Note: Wireless connection may lead to packet loss, disconnection, or even loss of control, please ensure safety)
- Wired: Use an Ethernet cable to connect any port on the computer and the robot, configure the computer IP as 192.168.123.162, and the gateway as 255.255.255.0
Open a new terminal and start the control program
source install/local.bash
ros2 run rl_sar rl_real_a1
Press the R2 button on the controller to switch the robot to the default standing position, press R1 to switch to RL control mode, and press L2 in any state to switch to the initial lying position. The left stick controls x-axis up and down, controls yaw left and right, and the right stick controls y-axis left and right.
Or press 0 on the keyboard to switch the robot to the default standing position, press P to switch to RL control mode, and press 1 in any state to switch to the initial lying position. WS controls x-axis, AD controls yaw, and JL controls y-axis.
Unitree Go2
- Connect one end of the Ethernet cable to the Go2 robot and the other end to the user's computer. Then, enable USB Ethernet on the computer and configure it. The IP address of the onboard computer on the Go2 robot is 192.168.123.161, so the computer's USB Ethernet address should be set to the same network segment as the robot. For example, enter 192.168.123.222 in the "Address" field ("222" can be replaced with another number).
- Use the
ifconfig
command to find the name of the network interface for the 123 network segment, such asenxf8e43b808e06
. In the following steps, replace<YOUR_NETWORK_INTERFACE>
with the actual network interface name. - Open a new terminal and start the control program:
source install/local.bash ros2 run rl_sar rl_real_go2 <YOUR_NETWORK_INTERFACE>
- Go2 supports both joy and keyboard control, using the same method as mentioned above for A1.
Train the actuator network
Take A1 as an example below
- Uncomment
#define CSV_LOGGER
in the top ofrl_real_a1.cpp
. You can also modify the corresponding part in the simulation program to collect simulation data for testing the training process. - Run the control program, and the program will log all data after execution.
- Stop the control program and start training the actuator network. Note that
rl_sar/src/rl_sar/models/
is omitted before the following paths.ros2 run rl_sar actuator_net.py --mode train --data a1/motor.csv --output a1/motor.pt
- Verify the trained actuator network.
ros2 run rl_sar actuator_net.py --mode play --data a1/motor.csv --output a1/motor.pt
Add Your Robot
In the following text, <ROBOT>_<PLATFORM> is used to represent your robot environment.
- Create a model package named
<ROBOT>_description
in therl_sar/src/robots
directory. Place the robot's URDF file in therl_sar/src/robots/<ROBOT>_description/urdf
directory and name it<ROBOT>.urdf
. Additionally, create a joint configuration file with the namespace<ROBOT>_gazebo
in therl_sar/src/robots/<ROBOT>_description/config
directory. - Place the trained RL model files in the
rl_sar/src/rl_sar/models/<ROBOT>_<PLATFORM>
directory, and create a newconfig.yaml
file in this path. Refer to therl_sar/src/rl_sar/models/a1_isaacgym/config.yaml
file to modify the parameters. - Modify the
forward()
function in the code as needed to adapt to different models.
- If you need to run on the physical robot, modify the file
rl_sar/src/rl_sar/src/rl_real_a1.cpp
as needed.
Contributing
Wholeheartedly welcome contributions from the community to make this framework mature and useful for everyone. These may happen as bug reports, feature requests, or code contributions.
Citation
Please cite the following if you use this code or parts of it:
@software{fan-ziqi2024rl_sar,
author = {fan-ziqi},
title = {rl_sar: Simulation Verification and Physical Deployment of Robot Reinforcement Learning Algorithm.},
url = {https://github.com/fan-ziqi/rl_sar},
year = {2024}
}
Acknowledgements
The project uses some code from the following open-source code repositories:
- unitreerobotics/unitree_guide
- mertgungor/unitree_model_control
- The code in src/rl_sar/scripts/actuator_net.py is modified from scripts/actuator_net in the Improbable-AI/walk-these-ways repository.