GooberToo - Slashdot User

Comment Re:Flash is costly? (Score 5, Informative) 37

by Rei on Friday April 26, 2024 @07:16AM (#64426690) Attached to: Seagate Joins the HDD Price Hike Party, Blames AI for Spike in Demand

Creating the training dataset is the *last* step. I have dozens of TB of raw data which I use to create training datasets that are only a few GB in size. Of which I'll have a large number sitting around at any point in time.

Take a translation task. I start with several hundred gigs of raw data. This inflates to a couple terabytes after I preprocess it into indexed matching pair datasets (for example, if you have an article that's published in N different languages, it becomes (N * N-1) language pairs - so, say, UN, World Bank, EU, etc multilingual document sets greatly inflate). I may have a couple different versions of this preprocessed data sitting around at any point in time. But once I have my indexed matching pair datasets, I'll weighted-sample only a relatively small subset of it - stressing higher-quality data over lower quality and trying to ensure a desired mix of languages.

But what I do is nothing compared to what these companies do. They're working with common crawl. It grows at a rate of 200-300 TB per month. But the vast majority of that isn't going to go into their dataset. It's going to be markup. Inapplicable file types. Duplicates. Junk. On and on. You have to whittle it down to the things that are actually relevant. And in your various processing stages you'll have significant duplication. Indeed, even the raw training files... I don't know about them, but I'm used to working with jsons, and that adds overhead on its own. Then during training there's various duplications created for the various processing stages - tokenization, patching with flash attention, and whatnot.

You also use a lot of disk space for your models. It's not just every version of the foundation you train (and your backups thereof) - and remember that enterprise models are hundreds of billions to trillions of FP16 parameters in their raw states - but especially the finetune. You can make a finetune in like a day or so; these can really add up.

Certainly disk space isn't as big of a cost as your GPUs and power. But it is a meaningful cost. As a hobbyist I use a RAID of 6 20TB drives and one of 2 4TB SSDs. But that's peanuts compared to what people working with common crawl and having hundreds of employees each working on their own training projects will be eating up in an enterprise environment.

Comment Once again... (Score 2) 44

by Rei on Friday April 26, 2024 @06:55AM (#64426664) Attached to: TSMC Unveils 1.6nm Process Technology With Backside Power Delivery

Taiwan reinforces its silicon shield.

Comment Putting numbers into perspective (Score 4, Interesting) 137

by Rei on Friday April 26, 2024 @06:51AM (#64426660) Attached to: Honda To Spend $11 Billion On Four EV Factories In North America

This is all to produce a peak of 240k EVs per year. Production "starts" in 2028. It takes years for a factory to hit full production. Let's be generous and say 2030.

Honda sold 1,3 million vehicles in the US alone last year - let alone all of North America, including both Canada and Mexico. If all those EVs were just for the US it'd be 18% of their sales, but for all of North America, significantly less.

In short, Honda thinks that in 2030 only maybe 1/7th to 1/8th of its North American sales will be EVs. This is a very pessimistic game plan.

Comment Re:Gotta start somewhere (Score 1) 158

by Rei on Thursday April 25, 2024 @03:56PM (#64425228) Attached to: Ford Just Reported a Massive Loss on Every Electric Vehicle It Sold

It's pretty easy to offer consumers a great deal when you're willing to dump wheelbarrows of cash into a furnace to do so.

Comment Re:Gotta start somewhere (Score 5, Informative) 158

by Rei on Thursday April 25, 2024 @11:51AM (#64424310) Attached to: Ford Just Reported a Massive Loss on Every Electric Vehicle It Sold

Ford made the Ford Ranger EV 1998 to 2002, then the Ford Focus Electric from 2011 to 2018 before switching to the Mach-E. They are not "new at it". They're just bad at it.

To be fair, I have a lot more hope for Ford than GM, as Farley seems to actually understand the critical importance of turning things around and the limited timeframes to do so, unlike GM, which still seems to only care about press.

Comment Re:How much is really delayed maintenance? (Score 1) 116

by Rei on Thursday April 25, 2024 @07:48AM (#64423632) Attached to: Updating California's Grid For EVs May Cost Up To $20 Billion

Copper is not "the last mile". It's the last five meters. If that. When people talk about "the grid", they're not talking about the wiring in your walls. Which you don't have to redo anyway for adding an EV. Nobody has to touch, say, your kitchen wiring to add an EV charger.

"The grid" is the wiring leading up to your house. Those conductors are alumium, not copper. Occasionally the SER/SEU cable will occasionally be copper, but even that's generally alumium these days. And that's only to the service connection point (not even to the transformer - to the point of handoff between grid-owned and the homeowner-owned, generally right next to the house), e.g. after the service drop line with overhead service that descends down to the building. The "last mile" is absolutely not copper. Approximately zero percent of modern grid-owned wiring is copper, and even the short customer-owned connection from the drop line into the house is usually alumium.

Grids are not copper. Period. This isn't the year 1890 here.

And no, grid operators don't make money selling power. They make money providing the grid through which power is sold.

I have never seen a single utility that charges a flat grid access fee to residential consumers, anywhere on Earth.

Distinction can be hard to grasp for someone utterly ignorant on the subject

Says a guy who thinks that there's a mile of copper leading up to your house.

Comment Re:How much is really delayed maintenance? (Score 5, Interesting) 116

by Rei on Wednesday April 24, 2024 @08:27PM (#64422864) Attached to: Updating California's Grid For EVs May Cost Up To $20 Billion

The grid is not made of copper. You thought it was? Copper is for home wiring, if that. Up to that point, it's alumium, bundled with steel on major lines for tensile strength. Does it look like copper to you?

As for the article: grid operators don't build out grids on a lark. They do it to sell power, because they make money selling power. If people want to buy more power because they want to charge an EV, then that's more money available for them. EVs are a boon to grid operators. They're almost an ideal load. Most charging done at night, steady loads, readily shiftable and curtailable with incentives, etc. Daytime / fast charging isn't, but that's a minority. And except in areas with a lot of hydro, most regions already have the ample nighttime generation capacity; it's just sitting idle, power potential unsold. In short, EVs can greatly improve their profitability. Which translates to any combiation of three things:

1) More profits
2) A better, more reliable grid
3) Lower rates

* ... depending on the regulations and how competitive of an environment it is.

As for the above article: the study isn't wrong, it's just - beyond the above (huge) problem - it is based on stupid assumptions. Including that there's zero incentives made for people to load shift when their vehicles charge, zero battery buffering to shift loads, and zero change in the distribution of generation resources over the proposed timeframe. All three of these are dumb assumptions.

Also, presenting raw numbers always leads to misleading answers. Let me rephrase their numbers: the cost is $7 to $26 per person per year. The cost of 1 to 5 gallons of gas per year at California prices..

Comment Re:Israel (Score 1) 122

by Rei on Wednesday April 24, 2024 @02:55PM (#64421976) Attached to: Biden Signs TikTok 'Divest or Ban' Bill Into Law

You sure you're responding to the right person?

Comment Re: Humans won't go extinct from climate change (Score 1) 124

by Reziac on Wednesday April 24, 2024 @01:44PM (#64421652) Attached to: Ex-White House Cyber Policy Director: Microsoft is a National Security Risk

Funny thing, Montana is a big grain-producing state, and we have possibly the most unpredictable, and definitely the most absurdly-variable climate in North America.

https://montanakids.com/facts_...

Oh, and we also grow potatoes, but only in very limited areas (potatoes need more predictable conditions), whereas grain is grown here pretty much anywhere the ground is near enough to level.

Comment Re:Israel (Score 2) 122

by Rei on Wednesday April 24, 2024 @01:29PM (#64421588) Attached to: Biden Signs TikTok 'Divest or Ban' Bill Into Law

Funny that to you, "Israel" and "Jews" are synonymous. As if all Jewish people unconditionally support all actions of the state of Israel, even those which are highly controversial within Israel itself.

This false synonymy creates an extremely harmful backlash. Stop doing it.

Comment Re:Titan or Bust! (Score 1) 70

by Rei on Wednesday April 24, 2024 @10:22AM (#64420924) Attached to: NASA Officially Greenlights $3.35 Billion Mission To Saturn's Moon Titan

Ukraine is not free

Give me a list of Ukrainian prime ministers since 2000, and compare it to a list of Russian presidents since 2000 . Thanks in advance.

Even before the conflict it was the poorest and most corrupt country in Europe

This is not even remotely true. Ukraine's Rule of Law Index in 2022 was 0,50; contrast with NATO members Turkey at 0,42 and Hungary at 0,52. And its scores were dragged down by the consequences of the war in Donbas.

with a military second in size in Europe only to Russia (hence the poverty)

Ukraine's percentage of GDP spent before the current invasion was 3,2%, and that was *with* the ongoing Donbas conflict . By contrast, the US, at peace, spends 3,45% of its GDP on the military. For some European contrasts:

Azerbaijan: 4,5%
Armenia: 4,3%
Russia: 4%
Greece: 3,7%

Before the 2014 Russian invasion, Ukraine's percentage of GDP spent on the military was 1,6%.

Comment Re:Terraforming on the same trip (Score 1) 70

by Rei on Wednesday April 24, 2024 @08:49AM (#64420598) Attached to: NASA Officially Greenlights $3.35 Billion Mission To Saturn's Moon Titan

ED: Just saw your second paragraph. But the things you speculate on are not exactly common on Titan, if they even exist on the surface at all (it's an icy crust ,not a rocky one). And either way, it'd be much easier with compounds other than methane.

And no, there doesn't seem to be meaningful amounts of nitrates in the atmosphere at least. You can see a list here. Nitrogen compounds are cyanide and nitrile compounds.

Comment Re:Terraforming on the same trip (Score 1) 70

by Rei on Wednesday April 24, 2024 @08:45AM (#64420584) Attached to: NASA Officially Greenlights $3.35 Billion Mission To Saturn's Moon Titan

Metabolized with what oxidizer?

It's just the opposite - methane on Titan is like nitrogen on Earth; it's things like acetylene and free hydrogen that are the potential energy sources, and to a lesser extent the more common (but less reactive) higher mass alkanes, etc.

The main problem is that LAWKI isn't even remotely compatible with existing in the cryogenic environment of Titan. There are a lot of interesting alternative chemistries, but they require basically redesigning life from scratch. We're simply not up to this task with our current technology.

Comment Re:Amphibious? (Score 1) 70

by Rei on Wednesday April 24, 2024 @08:42AM (#64420576) Attached to: NASA Officially Greenlights $3.35 Billion Mission To Saturn's Moon Titan

Sadly it's being launched to near the equator, not the poles :( The geophysicists won out over the geochemists...

Comment Re:Titan or Bust! (Score 1) 70

by Rei on Wednesday April 24, 2024 @08:40AM (#64420574) Attached to: NASA Officially Greenlights $3.35 Billion Mission To Saturn's Moon Titan

Freedom is great and neglectable, until you very suddenly don't have it.

History has not ended. The world order that makes life nice and comfy for you is not a given into the future.

Slashdot Top Deals