Quote from: _MT_ on March 05, 2020, 15:03:00
Quote from: Valantar on March 04, 2020, 09:41:51
Which obviously makes something like this impossible to implement for a semi-custom console APU - they would need to discard >90% of chips! That would not only be idiotic, but would drive console prices through the roof (with at least a 10x price increase for the SoC).
...
Zen2 on 7nm has its efficiency sweet spot around the ~3.2-3.5GHz mark
Not entirely true. The cost of a chip is not just the cost of manufacture. And even in manufacture, I'm not that familiar with integrated circuit manufacture, but in general, there are often significant one-time expenses. Also, if you take a modular approach (multiple chiplets), the situation improves (you're discarding fewer cores/ smaller piece of silicon). Yes, a 10 % yield in a chip for a high volume consumer product is not going to be acceptable.
That's pretty high. A few years ago, there was a paper where they determined 200 MHz to 1.6 GHz as a "sweet window" with efficiency being pretty flat in this range. I'm not sure what they studied, but it was a phone SoC, I believe. In general, as density goes up (nodes get finer), leakage current becomes a bigger and bigger problem. The cost of simply having the core "switched on" is getting relatively larger which is what's driving the bottom end of the range. At the high end, voltage is the killer. The higher the frequency, the higher the required voltage for stable operation. And power rises with the square of voltage. A 10 % increase in voltage is a 21 % increase in power. I wouldn't really expect the sweet spot to be significantly above 2 GHz going by the specifications of Epyc Rome.
You can't just go and compare clock/voltage sweet spots across different nodes - or even different architectures on the same node - so that comparison is entirely invalid. Also, PC chips generally use high performance libraries for their designs rather than the high density and/or high efficiency ones mobile SoCs use, which has dramatic effects on clock scaling and where the efficiency curve starts rising dramatically. Mobile SoCs are far more efficient at low clocks, but can't clock even close to as high as PC chips, with efficiency typically plummeting somewhere between 2 and 3 GHz with a hard stop close after that. On the other hand the efficiency curves of desktop chips tend to be relatively flat for a long time, meaning they don't get particularly efficient at low clocks (hence why tiny ARM cores have better battery life), but scale quite high before becoming inefficient, and still keep going for a while after that before becoming unstable. And as I said, the per-core power draw of a Zen2 core in a Threadripper 3990X (which, barring binning differences, is the same as any other Zen2 core) is just ~3W at 3.45GHz, while a single core boosting to 4.5 can consume around 6x that number if not more. In other words, that's pretty much the sweet spot for that chip on that node. Subtracting a few hundred MHz for binning and for margin to allow as many chips as possible to pass QC and you have a 3-3.2GHz very efficient console CPU.
As for manufacturing, it's highly unlikely that a console will use chiplets. While this would bring down die sizes and lower prices that way, packaging would be more expensive, and you'd need a third I/O die as both CPU and GPU need equal access to the unified memory of the system, complicating things quite a bit on the manufacturing side. Of course this is guesswork, but a monolithic die is still by far the most likely. You are of course right that R&D is a very significant part of the cost of the chip though, so I should have said a 10x increase in the manufacturing cost of the SoC.