Top
Best
New

Posted by onurkanbkrc 4 hours ago

Reinforcement Learning from Human Feedback(arxiv.org)
49 points | 3 comments
klelatti 3 hours ago|
Web version with links, etc:

https://rlhfbook.com/

verdverm 2 hours ago|
Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
leggerss 24 minutes ago||
You could say he's also learning from human feedback
iisweetheartii 3 hours ago|
[dead]