Comment Might be interesting (Score 1) 34
Grok-1 doesn't seem like a terrible deal for half way between ChatGPT 3.5 and 4. 314B parameters, 8 experts, 2 experts per token... If that translates to something like 70B parameters per token mere mortals with lots of ram should be able to get a few tokens/s on a CPU.