The expected performance of the Nintendo Switch 2 easily reaches the performance of the Playstation 4 Pro, which delivers almost 4.2 TFLOPS! Switch 2 can also easily surpass the performance of the Playstation 4 Pro!
1. -- Expected GPU of the handheld Switch 2: Nvidia T239 Ampere , at 1007.25 MHz (clock is adjustable with software) = 3.09 TFLOPS performance in FP32.
-- GPU of another handheld, Rog Ally: AMD 780M (in Z1 Extreme APU), at 2700 MHz (clock specified as Boost Clock, also adjustable) = approx. 4.1 TFLOPS performance in FP32.
2 – An example of change in performance as TFLOP with two graphics cards, both with the same graphics processor of the other, both as GTX 1080, with different boost clocks:
-- ASUS ROG STRIX GTX 1080 GAMING: 8.873 TFLOPS in FP32 at Boost Clock 1733 MHz.
-- Colorful iGame GTX 1080 Fire Ares X-TOP AD LE: 9.784 TFLOPS in FP32 at boost clock 1911 MHz.
In the example with the grapics cards with the GTX 1080 by increasing the boost clock by "only" 178 MHz, the performance of the GTX 1080 was increased by 0.911 Tflops!
Nintendo had the clock speed of the Switch 1 increased later, after the release, for more performance gain!
3. What many people don't seem to know: If two identical graphics cards (GPUs) are placed on two different main boards, on which ram memory and processors (CPUs) are also located, whereby the distances between the first board on a stationary console are much greater than on the second board as on a handheld, then the following happens: The handheld's system performs much better overall because the information has to travel much shorter distances between the CPU and GPU and the storage medium! Therefore, a smaller handheld objectively performs significantly better than a larger console, even if the GPUs and CPUs would have the same performance individually.
-- In short: The Switch 2 can "easily" achieve the performance of Rog Ally, which uses the Z1 Extreme chip, by increasing the boost clock by a few hundred MHz, as well as the performance of the Playstation 4 Pro, which is specified with almost 4.2 TFLOPS.