The smallest (and only) open weight model that gets Opus or Sonnet level coding performance is MiniMax M2.5, and you need about 512GB of VRAM for that model (with enough room for input tokens). At 128GB you are looking at Opus 4.0 / Haiku 4.5 level models like Qwen 3.5 122B-A1 at Q4 or Qwen3-Coder-Next 80B-A3B at Q8.
I think it's likely we will have small language models that specifically target coding that are at Opus 4.6 quality on 128-256GB of VRAM in the next couple years, I don't think we are there yet.
It is illegal* to ask if candidates are married.
It is illegal* to ask if candidates have children.
It is illegal* to ask if candidates live with their parents.
* In America.
AI-generated code is just this generation's version of copying code from the web or from another part of the codebase. Sometimes that person understood the code fully, and sometimes they just checked to see if the output matched what they expected.
The only uniquely dangerous thing about this recent iteration of that problem is the massive scale.
The CEOs of these companies are trying to justify inflated stock prices that were high based on the expectation of future growth.
No, CEOs are trying to show their board, investors, and activist investors that they have a plan for how to take advantage of AI and can at least keep up with their competitors use of AI, if not surpass them. I work at a large enterprise (close to 50k employees) and VPs are being told that they need to find ways for AI to have an impact on their department or their leaders will find someone who can. If it isn't happening fast enough consultants are brought in to take over their department's transformative roadmap and leaders who can't keep up are relegated to being SMEs until they are eventually replaced. I'm not in the room when that message is given, but I've seen the rapid shift of VPs who were raising alarms nearly immediately turn into AI cheerleaders.
If you work for a publicly traded or VC backed company I assure you your CEO does not have a choice on whether to jump on the AI bandwagon. That's not how hype driven bubbles work.
The moving cursor writes, and having written, blinks on.