Quote from: dasdkas on June 15, 2020, 12:51:52
This is interesting but then why A100 is a monolithic die?
The A100 die does not feature RT cores, so this could be plausible considering this fact and this fact only.
Neverthless, with that being said, considering that a frame on Turing using RT and Tensor cores goes as follows: FLOAT --> RT CORE --> INT+FLOAT --> TENSOR CORE; it seems inefficient, that the RT part of the frame is offloaded, processed and then brought back again to run through the Tensor cores. Furthermore, a lot of gains will be achieved in the denoising part, meaning, using the Tensor cores.
But I'm no expert. So, maybe someone can add something to the discussion. I'm defnitely looking forward to what NVIDIA engineers come up with!