Top
Best
New

Posted by ash_at_hny 16 hours ago

Reinforcement Learning from Human Feedback (RLHF) in Notebooks(github.com)
71 points | 1 comments
kcdom1000f 13 hours ago|
Hl
careful_ai 10 hours ago||
[dead]
bobvylan 9 hours ago|
[dead]