Commit Graph

2 Commits

Author SHA1 Message Date
Eugene Mironov 55c414da09 [Port HIL_SERL] Final fixes for the Reward Classifier (#598) 2025-03-24 09:09:38 +00:00
Yoel 97b8214cd2 Reward classifier and training (#528)
Co-authored-by: Daniel Ritchie <daniel@brainwavecollective.ai>
Co-authored-by: resolver101757 <kelster101757@hotmail.com>
Co-authored-by: Jannik Grothusen <56967823+J4nn1K@users.noreply.github.com>
Co-authored-by: Remi <re.cadene@gmail.com>
Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co>
2025-03-24 09:04:12 +00:00