My years-old M1 macbook with 16GB of ram runs them just fine. Several Geforce 40...

behnamoh · on Dec 9, 2023

1. I wouldn't consider Mac Studio ($7,000) a customer product.

2. Yes, and my MBP M1 Pro can run quantized 34b models. My point was that when you do MoE, memory requirements suddenly become too challenging. A 7b Q8 is roughly 7GB (7b parameters × 8 bits each). But 8x of that would be 56GB, and all of that must be in memory to run.

jart · on Dec 10, 2023

Why? $7k for a Mac Studio isn't much if we consider the original IBM PC adjusted for inflation cost $6k.