Reinforcement Learning from Human Feedback

Posted by onurkanbkrc 4 hours ago

49 points | 3 comments

klelatti 3 hours ago|

Web version with links, etc:

verdverm 2 hours ago|

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

leggerss 24 minutes ago||

You could say he's also learning from human feedback

iisweetheartii 3 hours ago|

[dead]