Posted by cafkafk 15 hours ago
Would love to see the benchmarks if someone actually pulls something like that off.
I use LM studio and qwen3.5 35B - but never figured out if it is swapping or not.
Om am unrelated note, does anyone know a model that can help with this use case: