- Updated SACPolicy to conditionally compute losses for grasp critic based on num_discrete_actions. - Simplified forward method to return loss outputs as a dictionary for better clarity. - Adjusted learner_server to handle both main and grasp critic losses during training. - Ensured optimizers are created conditionally for grasp critic based on configuration settings. |
||
---|---|---|
.. | ||
common | ||
configs | ||
scripts | ||
templates | ||
__init__.py | ||
__version__.py |