Commit Graph

2 Commits

Author SHA1 Message Date
Eugene Mironov f2e0426af0 [Port HIL_SERL] Final fixes for the Reward Classifier (#598) 2025-03-24 13:40:47 +01:00
Yoel 0ebdae8a40 Reward classifier and training (#528)
Co-authored-by: Daniel Ritchie <daniel@brainwavecollective.ai>
Co-authored-by: resolver101757 <kelster101757@hotmail.com>
Co-authored-by: Jannik Grothusen <56967823+J4nn1K@users.noreply.github.com>
Co-authored-by: Remi <re.cadene@gmail.com>
Co-authored-by: Michel Aractingi <michel.aractingi@huggingface.co>
2025-03-24 13:20:43 +01:00