Top
Best
New

Posted by ash_at_hny 7/6/2025

Reinforcement Learning from Human Feedback (RLHF) in Notebooks(github.com)
72 points | 1 comments
kcdom1000f 7/6/2025|
Hl
careful_ai 7/6/2025||
[dead]
bobvylan 7/6/2025|
[dead]