Forgot your password?
typodupeerror

Comment Distilling allows plausible deniability (Score 2) 51

With progressive layer by layer distillation Apple can make aggressive changes in architecture, all while letting Google take all the blame for the piracy.

I think there is a lot of potential to improve architectures for local, beyond MoE and what Apple "pioneered" with LLM in a Flash (the low rank predictor approach was actually first described in a paper from 2013 they didn't cite). Google's spark transformer for instance is already far more elegant than MoE and low rank predictors, beyond that there is also unexplored potential of forced temporal coherence in the active set.

Only Apple and Tiny AI are likely to truly push sparsity in production. Going beyond MoE with sparsity and being forced to accept low single digit percentage compute utilisation during training on NVIDIA's expensive HBM based GPUs is too counter-intuitive for most researchers to accept, even if they really should.

Comment Re:Why was original post modded ??? (Score 3, Informative) 136

https://www.npr.org/2026/05/28...

"In his application to enter the Senior Executive Service level ranks that RUSHsubmitted to his former U.S. Government employer on October 25, 2018, RUSH stated he was agraduate of the United States Air Force Test Pilot School, and he was the current Director of Testfor a 145-person, 18-aircraft joint Army/Navy weapons test organization, despite his militaryrecords, discussed above, indicating that he separated from the Navy in 2015. In this sameapplication, RUSH stated he had an eleven-year tenure as a Thesis/Dissertation advisor at the Air14.Force Institute of Technology."

LOL.

Comment Re:Imagine This Happening in the USA (Score 1) 27

If the Micron's 1 US memory factory had been in production already, it would have happened. A sudden 10x increase in margins and leverage will get any worker moving.

It will be interesting to see what happens at CXMT ... will Xi put the workers in tiger chairs for their bosses? Probably, but there is enough money on the line they might fight any way.

Comment Re:That kind of thinking brings in new players.. (Score 1) 70

They all say they use NAND like processing. In 3D NAND the full stack of alternating layers is created first, it's literally impossible to use lithography on the internal layers beyond what comes from on top. As far as high resolution goes, that's just the holes.

https://semiengineering.com/3d...

There's an old paper which title catches the essence of 3D NAND well ... "3D memory: etch is the new litho".

PS. I'm ignoring the logic, but that's not repeated with the cell layers.

Comment Re:That kind of thinking brings in new players.. (Score 1) 70

NAND 3D flash has only one high resolution exposure, for the entire stack of storage layers. The per layer litho for the staircase contacts is low resolution. Cells which can be formed in similar manner have been proposed for DRAM.

https://ieeexplore.ieee.org/ab...
https://www.niar.org.tw/en/xmd...
https://neosemic.com/neo-semic...

I have my doubts it can actually work reliably, would be nice though.

Comment Re:Cartel (Score 1) 70

US has proposed laws to not only ban ASML from selling China DUV scanners but even servicing what's already there. Would be the end of China's memory and flash fabs.

Though that would likely push China into a full on trade war, catapult flash prices into the stratosphere, and hell might even get EU to get into a tradewar with the US. EU is cowardly, but even they will have a breaking point. It comes on top of the tariffs, alternative forms of protectionism and the ICC shenanigans.

Slashdot Top Deals

fortune: not found

Working...