
IBM's New Mainframe 7nm CPU Telum: 16 Cores At 5GHz, Virtual L3 and L4 Cache (arstechnica.com) 90
Long-time Slashdot reader UnknowingFool writes:
Last week IBM announced their next generation mainframe CPU Telum. Manufactured by Samsung's 7nm node, each Telum processor has 8 cores with each core running at a base 5GHz. Two processors are combined in a package similar to AMD's chiplet design. A drawer in each mainframe can hold 4 packages (sockets), and the mainframe can hold 4 drawers for combined 256 cores.
Different from previous generations, there is no dedicated L3 or L4 cache. Instead each core has a 32MB L2 cache that can pool to become a 256MB L3 "virtual" cache on the same processor or 2GB L4 "virtual" cache on the same drawer. Also included to help with AI is a on-die but not on-core inference accelerator running at 6TFLOPS using Intel's AVX-512 to communicate with the cores.
Different from previous generations, there is no dedicated L3 or L4 cache. Instead each core has a 32MB L2 cache that can pool to become a 256MB L3 "virtual" cache on the same processor or 2GB L4 "virtual" cache on the same drawer. Also included to help with AI is a on-die but not on-core inference accelerator running at 6TFLOPS using Intel's AVX-512 to communicate with the cores.
"me too" (Score:1)
Re:"me too" (Score:5, Insightful)
Mainframes aren't designed to serve the same load profile as personal computers. They are designed to handle e.g. 200 channels of input requiring database retrieval, decimal arithmetical calculations, database storage, and output with multiple levels of error correction every second of the day, 365 days year, with 2-3 years between maintenance outages. A processor designed for that environment probably won't run Photoshop or the latest 3D game very well.
Re: (Score:1)
Comment removed (Score:5, Informative)
Re:"me too" (Score:5, Informative)
The predecessor IBM z15 chip was designed to enable industry-leading seven nines availability for IBM Z and LinuxONE systems. Telum is engineered to further improve upon availability with key innovations including a redesigned 8-channel memory interface capable of tolerating complete channel or DIMM failures and designed to transparently recover data without impact to response time.
That's what you're paying for with mainframe CPUs, not a PassMark score.
Re: (Score:2)
So ... idling Windows 11 then.
Re:"me too" (Score:5, Interesting)
Clearly you are too young to know IBM's relevance.
IBM are more like the innovator of things that then trickle down to consumer CPUs.
Without IBM, AMD would not even exist anymore. Thrice over. (E.g. the original Athlon was based on innovation brought in by IBM.)
Re: (Score:1)
Re: (Score:2)
and how would you know what's "in the rest of the company anymore"?
Cache details don't mean much to me (Score:3)
OTOH - max 256 x 5GHz Cores!
I suspect there are some benchmarks trembling a bit right now.
Re: (Score:3)
It'll run JCL so fast, you'll think you just completed the Kessel run in less than 12 parsecs.
Re: (Score:2)
Mainframes are not about raw number crunching performance, you can build PCs that can compute rings around this thing for much much less money. They can't shovel data as rapidly, though.
Re: (Score:2)
Off you go then, build this PC.
Re: (Score:2)
No spare funds to waste. Feel free to paypal me at the address above, though, and put your money where your fat fucking mouth is.
Re: (Score:2)
Aww diddums, got called out on your BS and had a hissy fit? Shame.
Re: (Score:2)
I assume the much less money means high 5 figures rather than mid 6.
Nope (Score:4, Informative)
No, it doesn't use AVX-512, not even to talk to the AI thingummie on the dies. Or at least, it doesn't say to in TFA. It says IBM does NOT do things like intel's AVX-512. Which makes much more sense than intel's AVX-512 ever did.
The article is otherwise very thin on details, and so is best skipped for something better. Just like 99%+ of today's /. "summaries".
Re: (Score:2)
Re: (Score:1)
Re: (Score:2)
Or at least, it doesn't say to in TFA
There were 2 articles. The 2nd one [arstechnica.com] says: "The new inference accelerator is placed on-die, which allows for lower latency interconnects between the accelerator and CPU cores—but it's not built into the cores themselves, a la Intel's AVX-512 instruction set."
Re: (Score:2)
Yes, indeed. But it doesn't say what you think it says. That sentence reads "[IBM's] inference accelerator is not built into the cores, whereas intel's AVX-512 instruction set is."
It's a comparison. "IBM does a thing differently from how intel does this other thing." Concluding that IBM used AVX-512 for anything at all, as the summary does, indicates something went wrong with the reading comprehension.
Is this good? I have no frame of reference (Score:2)
From a quick google, I see I can buy an AMD 128 core dual socket mobo/CPU set for 14k. I know you can get 64 cores in a single chip for a few thousand. I've heard you can't really buy an IBM mainframe for less than the cost of a nice house. Is this interesting for anyone starting a new ap
Re:Is this good? I have no frame of reference (Score:5, Informative)
You don’t buy mainframes to crunch numbers. If you used a bank or credit card then you used a mainframe. That is the target market. As for virtualization IBM was doing it in the 1970s on the S/370 line. The whole architecture is so alien compared to anything from the pc realm. I’m not sure if it’s still true but up until very recently their new systems had native binary compatibility going back to the 1960s on the S/360 system.
Re: (Score:1)
I have a 6 six core cpu at home, basically a mainframe from several years ago.
Re:Is this good? I have no frame of reference (Score:5, Informative)
Not even remotely, at all.
Or does it have a backplane? (Not to be confused with a mainboard?) :)
Does it natively ONLY run virtualized environments of different kinds? With hardware support for obscure things like different byte sizes, different byte orders, environments that don't even have a concept of a file and use records instead, and use EBCDIC?
Does it offer CPU interconnects that can cross boards, racks, and even buildings if necessary?
Does it do error detection and correction on ALL the things natively in hardware?
Can you swap cards, drives, RAM, PSUs, and even CPUs while it's running, without it even blinking? (Unless it has blinkenlights for that purpose.
Don't confuse a mainframe with a server.
You literally may not even be able to turn the thing on and enter something, unless you know how. (Though that may have changed, but I remember front displays with obscure codes, serial ports and opening up the thing being the only ways to set it up with anything you and I would call an operating system that you could then use.)
Re:Is this good? I have no frame of reference (Score:4, Informative)
Nope. You're thinking of *supercomputers*. Mainframes aren't about calculation speed, they're about massive data throughput with high reliability and availability. Your CPU might have the computing speed of a mainframe from several years ago, but the speed at which your CPU gets data from RAM is far too slow to handle a mainframe workload.
Re: (Score:2, Informative)
You really shouldn't comment on things you obviously have no idea about. IBM z Series mainframes have totally insane redundancy/resilience and ridiculous I/O channels. They are not for raw number crunching. They are for stuff like processing millions of financial transactions quickly and incredibly reliably. They run some of the most critical parts of the banking system (and other critical infrastructure, like the UK railways and Postal service). Your six core system is a toddler's toy compared to even an a
Re:Is this good? I have no frame of reference (Score:5, Informative)
Oh brother. Here are the specs for the current generation z15.
Up to 190 cores for customer use (there are a bunch more cores used for things like IO processing and hot swap spares). Each core running at 5.2GHz
Up to 40TB usable memory. That is fully redundant memory.
IO bandwidth of 1152 GBps
5.3 million IO/s
Please enlighten us as to which 6 core PC that is
Re:Is this good? I have no frame of reference (Score:4, Interesting)
Until Y2K was well past one of my former employers was paying IBM part of the cost of maintaining the 1401 architecture emulation subsystem on the 370 and 390 series. "Part" because they weren't the only organization in the world running 1401 assembler code in the year 2000.
Re: (Score:2)
Re: (Score:2)
I haven't done much with mainframes over the last 50 years either, but that market is driven not as much by performance and capacity as it is reliability and the ability to repair faults while operating. Cost is a factor but the buyers of mainframes expect to employ their hardware over much longer than those who chase the latest Intel/AMD chipset so the cap-ex is amortized over a long time and is in fact much less than what they pay for SLA contracts.
Re: (Score:1)
Not a mainframe expert, but how is deploying a fully redundant architecture using PC-based hardware less desirable? I can imagine from the cost perspective it would be much cheaper. Even more so if you factor in the labor.
In other words, what does a mainframe provide that could not be replicated?
Re:Is this good? I have no frame of reference (Score:4, Informative)
Re: (Score:2)
Re: (Score:2)
Certainly. It is called Geographically Dispersed Parallel Sysplex.
Re: (Score:3)
Because it’s more complicated and just as expensive to maintain a room full of pcs. Can you guarantee your new pc solution behaves exactly like the system it replaced? Unlikely. Mainframes do.
If you can't guarantee it, you've fucked up (Score:2)
Because it’s more complicated and just as expensive to maintain a room full of pcs. Can you guarantee your new pc solution behaves exactly like the system it replaced? Unlikely. Mainframes do.
I've heard this many times and really really don't understand it. I have worked with business systems for decades. In Java, for example, the inputs and outputs in a well designed system are well documented and unit tested. Recently, functional programming became fashionable and the situation has even improved. If you don't like Java, C#, Kotlin, Go, and various others are worthy modern alternatives.
Theoretically all business software can be mapped out with a flowchart. All known inputs and outputs
Comment removed (Score:5, Informative)
You can gaurantee same behavior between hardware (Score:2)
You bring up the software, but the point about the mainframe was that the software will work EXACTLY as before, even if you hop between 3 generations of mainframe HARDWARE. Meanwhile, on the x86 side, there have been function discrepancies just moving between Sandy Bridge Xeons and Ivy Bridge Xeons, or between different Opteron generations, or from Opteron to EPYC.
There's also the massive I/O where a single mainframe can outperform a large cluster when it comes to connecting to hundreds or thousands of units simultaneously.
In terms of Google, Netflix etc, they all deal with data that it doesn't matter if it gets temporarily lost and has to be redone if a node goes down. That's unacceptable in banking, insurance etc etc, but for Google, it's long been stated that it's acceptable. If a search node physically breaks down, no matter, the user will just type in their search again after the error message. Another trade-off is the sheer amount of floor space and wattage that is burned up by the likes of Google, Microsoft, Amazon etc
I have worked with Java for almost 25 years now. Code I wrote in 1997 works IDENTICALLY today, with one exception, it's faster and more efficient. That's the whole point of the JVM. When you write code correctly, it works identically even between operating systems. Unless you go out of your way to be stupid, business code works identically in Java 1.0 as it does in Java 17 pre-release. No recompilation needed...and as long as you don't use any newly assigned keyword as variables, you can even continue
Re:Is this good? I have no frame of reference (Score:5, Informative)
Cluster of virtual machines - all of your services are on different hardware, if any of that hardware fails / has to go down for maintenance you damn well better hope that your failover plan works and switches out to a new VM seamlessly or you lose that service. You also can't hotswap hardware as easily. Failovers also mean either 1: you are running a VM at idle constantly on OTHER hardware than the original host, or 2: it takes time for you to spin up a new VM and sync the service and DBs for that service ( which have to be mirrored realtime on yet more different machines, eating bandwidth ) VS. the mainframes scheduler hitting a polling event and noticing Hey, CPU's 124-136 are dead, execute this on group 200 instead.
Mainframe - Oh, one drawer of CPU's took a dump? Pull it out and keep on running, while you wait for replacements to arrive. No interruption at all to any of your services since they all run on a physical cluster. Also, when the new CPU's arrive for the drawer you just plug them back into the mianframe and it will start using them redistributing the load internally.
Re: (Score:1)
You can't do it with PC-based hardware.
Unless you introduce mainframe technology, and make it essentially a mainframe.
E.g. how would you *guarantee* prevention of all bit flips *everywhere*? That includes the CPU, all busses, and all memories/buffers of any kind. PCs are only getting there, now that it's becoming a real problem for consumers. (CRC and the like only goes so far. There are already tons of collisions. You just don't notice because usually it doesn't matter. A game might show something wrong. A
Re: (Score:2)
Imagine, a CPU fails and the machine keeps going. A stick of RAM fails and the machine keeps going.
Depending on requirements, it may be cheaper to deal with warm or cold failover, or to engineer your own solution in software. Or it might be cheaper to go the mainframe route.
Re: (Score:1)
Very high availability and massive bus bandwidth --
I am reminded of the old mainframe when I was in college 30 yrs ago.... the maintenance dept. told me it had only been rebooted once in 30 years because of a building fire, that machine ran everything for ~5,000 students and faculty. And of course, the accounting dept.
Most mainframes, you can hot-swap pretty much anything without even causing a blip. Fully error detecting and correcting at every level, etc.
Re: (Score:2)
A fully redundant architecture using generic hardware needs some form of interconnect to keep all the nodes in sync... Generally the throughput and latency of that interconnect will be a lot worse than what the mainframe can offer.
Re: (Score:3, Interesting)
Not only it's ridiculously expensive and ridiculously slow, but it's also the only bad-endian arch kept alive.
In Debian, we have s390x as an arch with no porters and no users, that's somehow kept alive only because IBM pays Ubuntu (not Debian!) to keep it alive. Which costs us developer time as maintainers are supposed to fix arch-specific problems there, for no benefit to the distribution, just to let IBM nickel and dime legacy customers.
I have no problem with ppc64el which has at least some users -- but
Re: (Score:3)
You're ignorant, a mainframe can do the transactional work of two thousand x86 servers. Look it up. Your MIPS-hot little x86-64 would be IO bottlenecked trying to do mainframe work.
Re: (Score:2, Funny)
Your MIPS-hot little x86-64 would be IO bottlenecked
Well, my home desktop has a RAID0 of 8 Optane NVMe-s tucked into every spare PCIe lane (2 in NVMe slots, 2 in 4x card slots, 4 on a 4x4x4x4x carrier board; 1 NVMe slot is taken by a legacy (flash) disk, a 16x slot by the GPU, a 1x slot by a RS232 card). For lukewarm data, there's that legacy NVMe, for cold stuff a piece of spinning rust.
Now let's see that dinosaur of yours get a better IO latency.
And that's just a 2018 consumer-tier machine. Any large server today has a few TB of pmem -- which has ~100x b
Re:Is this good? I have no frame of reference (Score:5, Informative)
Is this supposed to be a joke? Current generation mainframes support 384 FICON ports, 48 25bps ethernet ports, 96 1000baseT ethernet ports. Total IO bandwidth of 1152GBps. 5.3 million IO/sec
Re: (Score:2)
Is this supposed to be a joke? Current generation mainframes support [...] 48 25bps ethernet ports
Meanwhile, that "ordinary x86" you downplay takes 400Gbps ethernet cards. There's a wee bit of difference in latency between 25Gbps and that. And durable storage which works at memory speeds, instead of IBMs "our UPSes never fail" which leads to a big oops when they do. Granted, your mainframe can take more network ports, but you need to separate it into many VMs (as the OS can't do coherency in one piece), we can compentarize this into physically separated machines just as well. There's not much gain f
Re: (Score:2)
Wow, that is amazing! Oh, wait, it isn't. Mainframes have been doing that for over 30 years. https://en.wikipedia.org/wiki/... [wikipedia.org]
Re: (Score:2)
You couldn't run a thousand programs all doing a couple thousand transactions of IO per second simultaneously on your storage, your system would be essentially deadlocked waiting for itself. A mainframe can have hundreds of FICON channels each with up to 32K devices each. No comparison to desktops.. for the job of transactions on data in storage. Maybe for other types of jobs your desktop, or "supercomputer array" of them, might do better but that's not where mainframe lives.
You should read over mainfra
Re: (Score:3)
Re: (Score:2)
Where does this idiotic 'slow' idea come from?
Re: (Score:2)
Code that only works on little-endian machines, unless it is specifically written as e.g. a device driver, is broken. It's a tragedy that we no longer have a popular big endian architecture forcing shitty programmers to fix their bugs.
Besides upsides of little-endian, what would be the benefit, pray tell, of having to support different endianness? It's akin to having different AC sockets or different charset encodings (or worse, all the incompatible variants of IBM's encryption standard (EBCDIC)).
The sooner we get rid of US 110V-only plugs, non-UTF8, and bad-endian, the better.
Re: (Score:2)
Big endian isn't going away. TCPIP is big endian, as are most IETF network protocols. So is Java.
What kind of idiot thinks EBCDIC is an 'encryption standard'?
Re: (Score:3)
I suspect the primary use case if for banks and other industries with Jurassic code who have decided their application(s) would cost more to refactor and test than they consider "worth it" so this is just the next in a long line of hardware refreshes as their older equipment ages out (again).
Re: (Score:3)
Re: (Score:3)
Jurassic code? there is plenty of new mainframe code too, a pile of x86-64 with only the ability to have high GFLOPS and MIPS won't cut it when needing million of transactions per second, there would be I/O bottleneck. Mainframes do what your commodity processors in parallel can't.
Re: (Score:2)
It is the near-no-downtime aspect, that you can replace essentially anything with the system still running.
Re: (Score:2)
Mainframes are extremely high performance... for a certain kind of work.
So you're interested in pure number crunching? What if you had banking or insurance code, written in a mix of COBOL, Java and C+++, that required you to do over a million transactions per second and had to up with no downtime at all? That's the mainframe market, the very back end of your bank and your insurance company and your global airline reservation system.
Re:Is this good? I have no frame of reference (Score:5, Interesting)
I worked at IBM for about 4 years in & after college system testing Linux and middleware on the mainframe. When I first started, I wasn't particularly impressed with them compared to commodity x86 servers but over time I started to recognize some of the power and positive aspects that they provided over racks full of commodity servers.
First off, they are I/O monsters. They were and are still widely used for transaction processing. It helps that IBM develops either specialized hardware instructions or separate hardware components to accelerate or offload workloads. Virtualization support has been baked into the hardware for decades, including partitioning resources at the hardware level. Crypto, networking, and other I/O can be offloaded to "cheaper" processors or SoCs, leaving the million dollar CPs (central processors) for your application. Over the past 15 years since I moved on they've added additional specialized hardware and capabilities to stay abreast of current computing needs/trends.
Folks seem to not want to think about this in the age of microservices, but there's a significant cost to scaling out in terms of complexity, performance, and HW/SW maintenance. It can be more convenient, simpler, and possibly less expensive to scale up, particularly when IBM lets you connect them together in a parallel sysplex configuration for even more redundancy. It definitely takes big piles money to do so, but it could provide a decent ROI if you don't need clusters or data centers worth of gear and as many people to keep it going – think "data center in a box" or "cloud in a box" (albeit expensive).
(I left IBM to go work on x86 virtualization and cloud computing, so I've also got plenty of experience at the opposite end of the spectrum too.)
Re: (Score:1)
Don't forget it's not all script kiddie WhatWG devil may care spaghetti code, but sometimes literally life and death code that may make or brake entire economies. The kind of code where 10 lines a day is a lot for one person.
Comment removed (Score:5, Interesting)
Re: (Score:2)
Your 400hp engine can only develop safely develop that much power for short bursts. A 4,400hp railway loco can develop full power continuously for hours on end.
Re: (Score:2)
Regarding your questions about comparing core counts:
IBM is very good at designing hardware to achieve excellent symmetric multi-processing performance. Just naively throwing a bunch of cpu's together into a system results in rapidly diminishing per
Re: (Score:2)
Is this good? How does this compare to x86 and Linux? I've never had the misfortune of working with IBM mainframes, but I know they're legendarily expensive and have heard they're underwhelming in performance terms.
A mainframe is not a supercomputer made to crunch lots of numbers outright. A mainframe is for handling a large number of I/O transactions with redundancy, high availability, and reliability. They are not made to simulate weather; they are made to handle tens of thousands of financial transactions at once while checking the accuracy of the transaction multiple times. Also parts of the mainframe can be replaced while it is operational and 6 nines or uptime is normal.
Comment removed (Score:4, Informative)
Imagine a beowulf of these... (Score:1)
In hot grits, with Natalie Portman.
Meh.
Kids today.
Re:It's amazing we can build such things (Score:5, Informative)
https://www.theregister.com/20... [theregister.com]
pdp11 running a nuclear power plant.
Re: (Score:1)
https://www.theregister.com/20... [theregister.com]
pdp11 running a nuclear power plant.
Well that's not fair. You chose probably the best computer ever built. I remember the Xerox 9700 that had a PDP 11/42 as the heart of the machine. Later Xerox made the entire computer except for the CPU. The machine used to print 2 pages/second with overlays and could do merging at the same time. Two sided printing. That one also had a split bus.
Used to love the PDP 11. I don't think I've had my hands on one since the 1990s.
I didn't know one ran a nuclear power plant. Thanks.
Re: (Score:1)
Fun fact: Technically, life in a phenomenon that reverses entropy (locally). By turning chaos into highly structured... well... structures. Which, again technically, equals a reversal of time. (We can only tell the arrow of time from the growth of entropy. But entropy only grows because statistically, chaos is a more likely outcome of a set of moving particles, than order. Quantum physics has no requirement that the unlikely outcome can not happen. It does not distinguish between forward and backward. Only
Re: (Score:1)
Stupid indeed. Let's see Intel's ridiculously measly PCIe lanes keep up with a machine designed, top to bottom, with almost a century of evolution, for massive IO throughput. :)
Xeon / Epyc (Score:2)
I mean, really, mainframes these days are merely souped-up PC servers running old COBOL and FORTRAN legacy software.
Re: (Score:1)
I mean, really, mainframes these days are merely souped-up PC servers
The ignorance on this thread is astounding (but not surprising). You have literally no idea about the ridiculous resilience, reliability, scalability and insane I/O capabilities of a modern z series IBM mainframe.
Re: (Score:3)
Re: (Score:2)
Mainframe enlightenment would be out of range for most, but guessing that the homelabbing [reddit.com] trend is exposing a decent amount of PC-only folks to the world of server hardware. Similar in spec to what you got, I caught the bug a few years ago with a Dell R720 and it's been pretty addictive. The parallel rise of cheap auctioned hardware and VM technology in general dovetailed nicely over the last decade or so.
(Btw, yes, the NSA is indeed stealing my porn but the proof is too long to fit into the bottom of thi
Re: (Score:2)
Comparable in performance
If you don't have any use cases, you have no reason to know what IO even is.
Or what performance would mean.
For you, it has worse performance; it doesn't run any of your games.
No such thing as virtual cache (Score:1)
You either have it or you don't.
Sounds what they really have is a cache that's reconfigurable between how much is exclusive and how much is shared. Nifty feature, could be really useful, but needs a better name. Calling it "virtual" is a bad idea.