The summary seems wrong, the Qwen-2.5-max blogpost compares against v3, not r1. This Qwen model doesn't appear to be trained to reason, unless I missed something. It's just a traditional LLM base model.
No, only that people can't read, which is nothing new. This is the literal quote from the deepseek v3 paper: "Note that the aforementioned costs include only the official training of DeepSeek-V3, excluding the costs associated with prior research and ablation experiments on architectures, algorithms, or data."
In other words, this is only what deepseek has been saying from the very beginning in its public tech report.
The vast majority of the world does not need nor even want a "network is the computer" philosophy.
Umm.. the most used apps today are exactly that - remote webapps. Not really that big conceptual difference between streaming HTML/CSS/JS and streaming X11 commands.
Try that on the Earth sometime.
The main point is to save time on porting, from OS-specific APIs to maintaining constantly breaking cross-compiling chains for OSes you never actually use personally.
Exactly, and N26 or Revolut are getting pretty big in Europe. They are app-only with literally no physical branches. (Revolut isn't quite a bank just a credit card though that's often enough, but N26 is a real one.) It turns out that banking can be quite simple, a breeze with a well-designed app, and completely remote.
A meeting is an event at which the minutes are kept and the hours are lost.