Please create an account to participate in the Slashdot moderation system

 



Forgot your password?
typodupeerror

Comment No (Score 2, Insightful) 47

The behavior, known in the research community as sycophancy, stems from how these models are trained: reinforcement learning from human feedback, or RLHF, rewards responses that human evaluators prefer, and humans consistently rate agreeable answers higher than accurate ones.

No, it's because in the training corpus most of the responses to "are you sure" that anyone bothered to record will involve someone being corrected.

Comment Re:Better hope he saved enough... (Score 1) 38

How about all the woman who accused Bill Clinton?

You can have Bill Clinton. We don't give a fuck. He was a rapey piece of shit which many of us have been pointing out, check my posting history. That pales compared to the Trump-Epstein child rape and cannibalism consortium, but still, you can have him too.

Comment Re:This is so incredibly much bullshit (Score 1) 252

It's depressing to me just how many so-called "nerds" around here are little more than shelled out muppets repeating the party line.

You mean the "global warming is a myth" party line deliberately created by Big Oil and spread among the "I'm such an individual I get all my information from youtube videos" flock of fuckheads?

Slashdot Top Deals

Parts that positively cannot be assembled in improper order will be.

Working...