Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror

Comment Re:It just shows (Score 1) 63

I chose RC boat because some people are using this competing at "stupid human trick" as an example of intrinsic proof of LLMs being able to supersede humans.

In the scenario of an olympic swimming competition, an autonomous boat vs a manned boat would show no difference to each other, both would compete the task much better than a human. It's a useless test to measure general utility. Just like a person swimming a 1500 meter distance is not really a useful indicator on its own of how useful they are. These Math Olympiads are similar in that they are not particularly indicative of people being useful. The same ways that we would stress a human in impressive ways does not mean that a computer coming at it from a different approach should be considered to have broadly superseded the humans.

Yes, a real boat is valuable and LLM can be valuable when utilized correctly, but in the face of exaggerated hype some pessimistic reality check is called for to balance expectations.

Comment LLMs can't think and they don't need to (Score 2) 91

LLMs have a great deal of utility and extend the reach of computing to a fair amount of scope that was formerly out of reach to computing, but they don't "think" and the branding of the "reasoning" models is marketing, not substantive.

The best evidence is reviewing so-called "reasoning chains" and how the mistakes behave.

Mistakes are certainly plausible in "true thinking", but the way they interact with the rest of the "chain" is frequently telling. It flubs a "step" in the reasoning and if it were actual reasoning, that should propagate to the rest of the chain. However when a mistake is made in the chain, it's often isolated and the "next step" is written as if the previous step said a correct thing, without ever needing to "correct" itself or otherwise recognize the error. What has been found is that if you have it generate more content and dispose of designated "intermediate" content you have a better result, and the intermediate throwaway content certainly looks like what a thought process may look like, but ultimately it's just more prose and mistakes in the content continue to have an interesting behavior of isolation rather than contaminating the rest of an otherwise ok result.

Comment Re:No"AI" cannot think (Score 1) 91

When the model is used for inference, yes. But I assume he was speaking to the awkwardness of training. Take a machine vision that has never been trained on dogs and cats, feed it a dozen labeled images of cats and dogs to retrain it to add dog/cat recognition. Then try to do inference on that model and it will be utterly useless still for dog/cat recognition. Take a model trained on normal images. Then have it try recognition on a fisheye lense. It will fail because it has no idea. You might hope to retrain it to recognize the fisheye distortion generically, but generally that won't work and you have to retrain it to catch the fisheye variant of everything you want. After retraining, if you just slap the fisheye variant into a normal picture, the model is likely to be oblivious to the anomaly.

Don't know if this is an argument about 'thinking', but it is certainly a difference in 'learning', that AI models need to consume way way more than a human to start to give useful results. A model training to operate a car is still oddly dodgy even after consuming more car operating hours than a human will consume in a lifetime. In areas where we have enough training data to feed, this is fine, but it's certainly different than a human learning, which can do something better to extend less training to better results.

Comment Re:It just shows (Score 1) 63

I'm not in denial, the LLMs and other forms of AI have utility, but expectations have to be mitigated.

Was in a discussion with a software executive a couple weeks back who said he fully anticipates he can lay off every one of his software developers and testers in the next year and only have to retain the 'important' people: the executives and sales people.

People see articles like this show how LLMs enable computing to reach another tier of 'stupid human tricks', which is certainly novel, but people overextend exactly what this means. We put too much focus on whether the LLMs can navigate tests explicitly designed to be graded and passable for humans, with effort and imagine that means they are able to chart the less well trodden set of problems that have an unknown or maybe no solution at all.

There's a lot of stuff LLMs can do that computing wasn't able to do before, and that does potentially speak to a great deal of the things humans do today, but we have to temper the wild expectations people are getting.

Comment Re:Or how about this novel solution? (Score 3, Insightful) 58

It actually is that hard, sometimes.

Often, jobs have a culture which have become structured so that you must be responsive, if not 24/7, then at the least during your work hours, to IMs. Step away from your desk for 30m to eat lunch or whatever? People are going to start calling you in many of these (IMO toxic) environments.

And frankly, it's required for some jobs (like in support roles). You've got to be available and IM is used for coordinating on the ground.

I've told people I am simply not available on IM platforms on my phone, I won't even install them if I can avoid it. This has caused some backlash, admittedly, but it's sanity worth preserving. If it's important, think it out a bit more and send me an email.

There's no good solution for this, unfortunately, particularly when everyone's set on using Slack for everything.

Comment Re:It just shows (Score 1) 63

The point is we have a myriad of "tests that are hard for humans, but don't necessarily translate to anything vaguely useful". In academics, a lot of tests are only demanding of reasoning ability because the human has limited memory. Computers short on actual "reasoning" largely make up for it by having just mind boggling amounts of something more akin to recall than reasoning (it's something a bit weirder, but as far as analogies go, recall is closer).

It's kind of like bragging that your RC boat could get a gold medal in the Olympic 1500 meter freestyle. It didn't complete the challenge in the same way, and that boat would be unable to, for example, save someone that is about to drown, because the boat can just go places, it can't do what that human swimmer could do. That person swimming 1500m itself is a useless feat of interest, and not really directly useful in and of itself.

Comment Re:I remember what I was relieved... (Score 1) 265

There's more to be relieved about. The US accepted weapons grade fissionables and exchanged it at 1 to 7 for reactor grade. At the time, the mushy brains yelled and screamed that the US was "Giving away Uranium!". To be fair, the reporting that got them screaming did not mention the fact it involved trading in weapons grade (stuff that goes boom) for reactor grade (stuff that hums). Of course the wingnut media didn't report that - it wouldn't make their base scream in horror.

Comment Re: More things wrong with the world. (Score 1) 80

The commenter clearly seemed to think the world was going to be supremely unfair to the CEO (turns out 'exec' is ambiguous, as the man, his wife, the mistress, and the mistress' husband are all executives one place or another). You said the exec deserves to lose because of his actions, which seems to be inconsistent. The commenter's stance is based on his blatant assumption that the wife was not earning money and the mistress was just some gold digger, and that even if the wife wasn't earning money, that if the split happens it's unfair for her to get a cut of the CEOs wealth that he earned.

The assertions of misogyny are because he filled in the gaps he didn't know with assumptions consistent with negative stereotypes of women in these situations. He jumped right to the fiction of the struggling man paying huge alimony to some indolent ex-wife living a life of luxury. That the mistress was only in it for gold
digging.

Comment Re:More things wrong with the world. (Score 1) 80

You seem to have just been hit with the headlines and manufactured a scenario where he is a rich guy married to a stay at home wife, with a gold digging mistress.

My spouse was interested enough to bother to dig in and the reality is that the CEO, the wife, the mistress, and the mistress' husband are all four rich with income, so alimony is likely not even a factor. Similarly, the assets being split is unlikely to be lopsided.

From what I've seen in actual life, that all seems to be a rich person trope, and an exaggeration. Those I've known with modest lifestyles that get divorced seem not to have encountered a whole lot of financial duress due to that (maybe child support, but not the wife). I was at a business lunch where three people started bemoaning this as if it were true, that their former spouses are just draining them of all their cash. However, one of them had just been talking about his brand new BMW M5 that his 'lame' former wife would never let him buy and another chimed in with the same experience, albeit with a more humble Kia Stinger. Broadly they all seemed to be doing quite well and the wife would have otherwise been stuck high and dry largely at their "man of their house" mindset that didn't have her earn an income, which is fine, but expect them to be able to use some of your income even after the relationship falls apart.

Comment Re:Too slow, they're already past that. (Score 0) 24

Unless YOUR DNA turned up at a crime scene,

Many times the initial match is made from a consanguinity/divergence comparison. Then the cops come banging at the door, then the lawyers get involved, the expenses mount, and even when you're found innocent and not a match, you still have the legal bills, the loss of reputation because you are a "person of interest", maybe you lost your job when they put you in jail pending trial and you can't afford bail (remember that bail charges are a fee and you don't get them back, ever) and the strife and stress of being victimized by a system that is less "justice" and more "law".

While some advertise that their tests are accurate out to fourth and fifth cousin, second cousins share only about 4% of their DNA with each other. Your life could be turned topsy turvy over a less than 4% finding.

The only solution I see other than not permitting arrests based solely on consanguinity (direct match to actual crime scene sample is another matter - that should be allowed of course) is to require every birth certificate have a DNA test. And recall - not everyone gets a birth certificate. That would tend to also (almost) solve another problem: Convictions for rape and the appropriate person charged child support.

Slashdot Top Deals

The last thing one knows in constructing a work is what to put first. -- Blaise Pascal

Working...