Posted by cafkafk 14 hours ago
Model name: Intel(R) Xeon(R) CPU E3-1265L V2 @ 2.50GHz
Mainboard Product Name: P8Z77 WS
GPU 05:00.0 VGA compatible controller: NVIDIA Corporation AD106 [GeForce RTX 4060 Ti 16GB] (rev a1)
05:00.1 Audio device: NVIDIA Corporation AD106M High Definition Audio Controller (rev a1)
Memory: 32GBThis works.
https://pcpartpicker.com/products/motherboard/#s=20028,20029...
(He has a fully maxed out “last Intel” Mac Pro and laments the lack of replacement).
Plus many boards also support CXL for RAM expansion over PCI 5!
Source: building a hybrid inference business for regulated industry workloads.
Totally just vibes based, I think it goes up to 20+ tps when it's not under load (and that's me trying to be conservative). For context, reading speed at 250 wpm would be around 5 to 6 tokens per second.