Comment Re: What is a "harmful response?" (Score 1) 59

by ljw1004 on Sunday June 28, 2026 @03:16PM (#66214420) Attached to: How a Seemingly Harmless Image Can Jailbreak Vision-Language AI Models

I don't think "continuous" means what you think it means. The reason you can do this is because the models are continuous.

One of the defining papers in this field is 2013 "Intriguing properties of neural networks" by Szegedy et al. In their own words from the abstract, "we find that deep neural networks learn input-output mappings that are fairly discontinuous to a significant extent"

I'm using the word "continuous" in the same sense as them. (Perhaps it does indeed mean what I think it means...)

Comment Re: What is a "harmful response?" (Score 2) 59

by ljw1004 on Sunday June 28, 2026 @08:32AM (#66213938) Attached to: How a Seemingly Harmless Image Can Jailbreak Vision-Language AI Models

In image processing like this article is talking about, the classic example of a harmful response is that your car's camera sees "speed limit 30" sign, but a small sticker it makes the image processor believe it saw a "speed limit 70" sign.

(this is an actual demonstrated attack. It means that pranksters could cripple self driving.)

The thing about these image classifiers is that they're not "continuous". You can make it see a stop sign as a right-of-way sign, or a green light.

Comment Re: Lack of math skills? (Score 1) 110

by ljw1004 on Sunday June 07, 2026 @01:43PM (#66179466) Attached to: Failing CS Grades Soar At UC Berkeley As Professors See Greater AI Usage

[invariant-heavy and proof-heavy guidance to the AI] How do you do that?

My main AGENTS.md has ten lines about the most important coding principles:

- Prefer functional-style code, where variables are immutable "const", there's almost no "if/else" branching branching, and most functions are side-effect free.
- Code should have comments, and functions should have docstrings. The best comments are ones that introduce invariants, or prove that invariants are being upheld, or indicate which invariants the code relies upon. ...

I am adamant about clean engineering. What I look for:
- Invariants are the best way to document all aspects of code. These include code invariants (stating what assumptions a function makes about shared data, and how it upholds them), and architecture invariants (for instance the main index.js never touches state except through component accessors). ...

You must document *meaning* of every field, and also enums and disjoint type fields.
- "Meaning" says briefly what the field/enum represents. From a well-written meaning, a smart reader will be able to deduce all the invariants around this field/enum, and deduce how it will be used in the code.
- It is hard work to distill a good meaning! You must put considerable effort into it. ...

The instruction on "meaning" ended up carrying a lot of weight to the AI. It adopted the habit of putting a comment on every single field and function that starts with the word "// Meaning: " and they're honestly, genuinely good ones! Single-line sentences on fields that carry a lot of good weight.

Separately, I have a LEARNINGS.md file which I have the AI auto-update every time it gets course corrected by me. Over the first two weeks there were a lot of course corrections, but now there are only a few a day. The file ended up carrying my senior engineer wisdom, more or less, the kind of things I normally mentor to junior developers on the team over several years. Here's an extract: https://gist.github.com/ljw100...

Comment Re: Lack of math skills? (Score 1) 110

by ljw1004 on Sunday June 07, 2026 @04:13AM (#66178818) Attached to: Failing CS Grades Soar At UC Berkeley As Professors See Greater AI Usage

My CS degree had a lot of theorem proofs in it, invariants, that kind of thing. I've always had the habit of aiming to prove my code correct under all possible circumstances. Usually not a formal proof, but using the same skeleton as a formal proof would.

It got me a job on the C# language design team (when I tried to prove an algorithm correct, couldn't, discovered a counter-proof that the runtime had a flaw).

As I mentor junior devs and review their code, I'm always telling them to reason about their invariants better and document them.

Now in the age of AI, I find that invariant-heavy and proof-heavy guidance to the AI ends up getting its work done quicker and higher quality. OpenAI mentioned the same thing in a blog post in February.

Sure, there are many paths to professional success and engineering excellence that don't involve this kind of CS heavy approach. But, there are many that do...

Comment Re: perceived (Score 1) 240

by ljw1004 on Monday May 25, 2026 @10:42AM (#66159700) Attached to: Will Big Tech Layoffs Bring a Culture Shift to Anxiety and Job Insecurity?

I think a tool that lets one person do the work of 20 will result in 17 people being laid off, as the company triples its rate of work. (I can see triple being feasible and achievable in the software industry, but maybe not more)

Comment Re: Hell hath frozen over! (Score 1) 102

by ljw1004 on Friday May 08, 2026 @01:10PM (#66134328) Attached to: First Segment of the Fehmarnbelt Tunnel Is In Place

Oh, and a "train" is a bit like 100 cars back to back.

Comment Re:Prices are sticky (Score 5, Informative) 103

by ljw1004 on Thursday May 07, 2026 @07:37PM (#66133228) Attached to: CEOs Want Tariff Refunds As Earnings Take a Hit

Anyone expecting corporations to not try to make a profit and extract maximum value for their shareholders ignore that that's their fiduciary duty.

"this belief is utterly false. To quote the U.S. Supreme Court opinion in the recent Hobby Lobby case: 'Modern corporate law does not require for-profit corporations to pursue profit at the expense of everything else, and many do not.'"

https://www.nytimes.com/roomfo...

"We ... show that [the Shareholder Primacy Norm] is not a legal requirement, at least under the guise of shareholder value maximization. This is in contrast to the common assertion that managers are legally constrained from addressing corporate social responsibility issues if doing so would be inconsistent with the economic interests of shareholders."

https://papers.ssrn.com/sol3/p...

Comment Re:Huh? (Score 2) 22

by ljw1004 on Wednesday May 06, 2026 @01:37PM (#66130630) Attached to: Microsoft Gives Up On Xbox Copilot AI

> Am I the only one that can't imagine any possible value an AI assistant would bring to a game?

I use AI assistants lots when playing games!

At the moment it's Minecraft. I want to figure how to build something, e.g. a golem farm. I look for tutorials online but (1) they're all videos which I hate watching, (2) they're all hyper-specific and concrete, "place this block here then that block there", but what I want to understand are the foundational principles so I can know how to adapt the golem farm to my own purposes -- what are the mechanics, how do they spawn, how does water flow, what is the SOLUTION SPACE of possibilities.

Gemini AI has been really good at this kind of thing.

The other time is when I get stuck, or want advice on how to make a character build to achieve a certain end. Once again the online advice is typically in the form of "walkthroughs", do step 1 then step 2 then step 3, in other words just one possible way to play the game, and it's too easy to accidentally read too far and spoil the rest of it. I don't want that. I like the feeling of openness and possibilities. I again ask Gemini, and it gives me advice on just the particular bit I'm stuck on, and is better at showing for me the available options.

Comment Re: modernized to C99, then unmodernized to using (Score 4, Informative) 46

by ljw1004 on Monday May 04, 2026 @04:22AM (#66126648) Attached to: NetHack 5.0 Released

Lua remains the commonest choice today for games to offer scripting/modding. It's pretty much the industry standard. (Outside C# for Unity).

Comment Re: While they are at it ... (Score 2) 33

by ljw1004 on Saturday February 07, 2026 @01:44PM (#65974740) Attached to: New Bill in New York Would Require Disclaimers on AI-Generated News Content

Fox News is just about always truthful. You just have to watch out for the tricks they use (on 95%+ of their stories)...

(1) non-representative selection. Headline "illegal immigrant murders local mother", which is true in this case, but they don't report the other 99 murders that went by immigrants, and don't report a general trend of immigrants causing less crime overall per capita. (I made up this specific example to illustrate their trick)

(2) report quotes: headline "Biden's senility was covered up, says person". They are 100% factually reporting that the person did indeed say this.

In both cases the reader is left with an untrue impression despite the stories containing only truth. It's because it's not the whole truth.

Comment This article was AI generated slop (Score 2) 11

by ljw1004 on Monday November 03, 2025 @04:26PM (#65770700) Attached to: arXiv Changes Rules After Getting Spammed With AI-Generated 'Research' Papers

The irony is, this article itself was AI-generated slop with ridiculous duplication. Maybe a low-effort AI-assisted piece by an author who couldn't be bothered.

Comment Re:Fear is the appropriate response. (Score 1) 89

by ljw1004 on Friday August 22, 2025 @02:55PM (#65608550) Attached to: KPMG Wrote 100-Page Prompt To Build Agentic TaxBot

The hallucination problem _cannot_ be fixed. It is a fundamental part of the mathematical model.

I think it can. I've been working on getting an LLM (Claude Sonnet 3.7) to add missing type annotations to python code. When I naively ask it "please add types" then like you said it has about a 60% success rate and 40% hallucination rate as measured by "would an expert human have come up with the same type annotations and did they pass the typechecker".

But when I have a much more careful use of the LLM, micromanaging what sub-tasks it does, then it has a 70% success rate, and 30% rate of declining because it didn't have confidence to come up with an answer. Effectively there were no more hallucinations. (I got these numbers by spot-checking 200 cases).

So I think hallucination can be solved for some tasks, by the right kind of task-specific micromanagement and feedback loops.

Comment Re: Where does all this money come from? (Score 1) 19

by ljw1004 on Thursday August 07, 2025 @03:45PM (#65573506) Attached to: OpenAI Pays Bonuses Ranging Up To Millions of Dollars To 1,000 Researchers, Engineers

OpenAI has $12bn annual revenue, about 3% that of Apple, about $3million per employees per year (compared to $2 million per employee per year at Apple).

I think OpenAI has a huge amount of growth potential even just from predictable growth over the next several years, even if steep changes towards AGI don't come.

Comment Re:why (Score 1) 70

by ljw1004 on Wednesday June 25, 2025 @02:43PM (#65475750) Attached to: HDMI 2.2 Finalized with 96 GB/s Bandwidth, 16K Resolution Support

All good in theory, except that you likely need something like a 200" TV so actually tell the difference between 8k and 16k.

Like I said, I figured 8k would be enough resolution for soccer. As for 16k, I imagine that something with bandwidth for 16k would translate that bandwidth into twice the frequency for 8k, which would be ideal for soccer.

[Lawrence of Arabia] Let me guess, you are watching these classics at 1080p, or at best 4k.

I watched Lawrence of Arabia on a Cinerama screen. It was breathtaking. I expect that the higher resolutions described here will help more places (like movie theaters) display higher quality prints. I suspect they'll open up new avenues like fake windows or full-wall screens in residences.

Comment Re:why (Score 2) 70

by ljw1004 on Wednesday June 25, 2025 @01:14PM (#65475442) Attached to: HDMI 2.2 Finalized with 96 GB/s Bandwidth, 16K Resolution Support

Do you watch soccer? 4k resolution means a player's head is about 14 pixels high, not enough to make out much beyond a blob of color; their jersey is 60 pixels high, enough to make out the number but not much more. Doubling the vertical resolution (i.e. going to 8k) would likely be enough to let you make out similar detail to what you'd see in real life. (Frame rate is another issue: HDMI 2.0 allows 4k at 60hz which is too slow when panning in a soccer game; HDMI 2.1 allows 4k at 120hz which is probably enough). I think that 16k is probably the right bandwidth to get soccer looking good.

Do you do VR? 4k per eye isn't good enough for VR yet. It's possible that 16k will be, but we might still need more.

Do you watch the gorgeous film classics like Lawrence of Arabia? One of the (many) things that make it look great is that it was shot on 65mm, equivalent to about 12k resolution.

Comment Re: What is a "harmful response?" (Score 1) 59

Comment Re: What is a "harmful response?" (Score 2) 59

Comment Re: Lack of math skills? (Score 1) 110

Comment Re: Lack of math skills? (Score 1) 110

Comment Re: perceived (Score 1) 240

Comment Re: Hell hath frozen over! (Score 1) 102

Comment Re:Prices are sticky (Score 5, Informative) 103

Comment Re:Huh? (Score 2) 22

Comment Re: modernized to C99, then unmodernized to using (Score 4, Informative) 46

Comment Re: While they are at it ... (Score 2) 33

Comment This article was AI generated slop (Score 2) 11

Comment Re:Fear is the appropriate response. (Score 1) 89

Comment Re: Where does all this money come from? (Score 1) 19

Comment Re:why (Score 1) 70

Comment Re:why (Score 2) 70

Slashdot Top Deals

Slashdot