Comment An Anecdote (Score 1) 90
Well I'm not quite sure either but let me tell you what I experienced on an older Intel iMac Pro - (2017).
I loaded up the largest model possible just to see what it would do... I entered some initial question, I forget what, and then got about 10-20 minutes of a "thinking" message.
Then, I got... an "H".
A few minutes later... an "I".
Yeah it too about 30 minutes to begin a message with "Hi", I gave up after a few hours.
So 20 tokens a second is sounding pretty good compared with that!