Forgot your password?
typodupeerror

Comment Re:We need humility, not arrogance (Score 1) 152

The LLM already has knowledge of vulnerabilities in its training data.

It sounds like you're a little behind on the progress in this area. You should try some of the top commercial models out for a few coding analysis tasks and see how they perform and their limitations.

If you're not comfortable using the commercial models, you can install some locally if you have appropriate hardware though they won't be the same quality as the ones from Anthropic of OpenAI

Comment Re:We need humility, not arrogance (Score 1) 152

Ok, maybe it's not a 100% valid opinion for previous LLMs though it's definitely more correct for them.

For some bugs like "When the user clicks the a button the light flashes blue but it's suppose to flash yellow", that's not a vulnerability but there could be a spec or historical reason why it had that behavior which isn't possible to reason from just looking at the code alone

Comment Re:We need humility, not arrogance (Score 1) 152

That's a valid opinion for previous LLM models but more recent ones (especially Anthropic's new model) have larger context windows and better parsing of code which lets them find issues that aren't "simple toy examples with obvious specifications."

You don't need a formal spec to determine that a webpage shouldn't crash the web browser. There are certain vulnerabilities which are "obvious" to determine the program shouldn't be doing that once found.

Now for logical bugs (e.g. the program does a valid action but not the expected one) that aren't vulnerabilities you're right, there's not really any way to determine those without knowing the expected result.

Comment Re:Anyone else tired of that word? (Score 1) 37

Problem is when people started labelling anything made by AI as slop regardless of quality. Ironically even stuff made by humans that is too good is sometimes labeled as slop because people suspect AI was involved.

It's definitely an appropriate word for the large amount of AI generated content being put out on the Internet, but it's also over used in a lot of cases

Comment Re:This is such BS (Score 1) 22

I think you're missing the point here.

If you google "hack the Mexican government" you're not going to get any meaningful results, but if you prompt an advanced LLM to do so then apparently it can deliver results. Claude is like a script kiddie on steroids in this case since it knows all the existing vulnerabilities and tools to exploit them which will do the trick for a lot of targets.

It's lowering the bar for hackers so they don't even have to know much about computer system

Slashdot Top Deals

I am here by the will of the people and I won't leave until I get my raincoat back. - a slogan of the anarchists in Richard Kadrey's "Metrophage"

Working...