Posted by sambellll 19 hours ago
Worse model may not "know" enough to distinguish between a 70 and a 100 candidate, so it's expected that it's output has high variance. But a better model might "know" enough, so it can be more confident and thus more consistent.
This isn't to diminish the whispernet. Rather, it shows just how many important signals cannot be quantized.