H
Hacker News
Top
Best
New
Posted by panthertrax 14 hours ago
Reward models for LMs are fundamentally broken
(twitter.com)
2 points
|
0 comments