QuoteAIDA64 / Memory Read
Asus ROG Flow Z13 GZ302EA-RU073W
AMD Ryzen AI Max+ 395, Radeon RX 8060S
121931 MB/s
Lets add what has been said in notebookchat.com/index.php?topic=226695 ("HP ZBook Ultra G1a 14 workstation review: MacBook Pro alternative for gamers")
Strix Halo is a 256-bit chip, so its theoretical bandwidth is:
memory bandwidth [GB/s] = 8000 * 64 (bit per channel) * 4 (channels) (aka quad-channel) (= the 256-bit memory bus width) / 8 (bit to Byte) / 1000 (MB to GB)
= 256 GB/s
The practical benchmark value is often 70-80% of the theoretical value, so:
256 GB/s * 0.75 = 192 GB/s
LLM tokens per second = 192 GB/s / 39.6 GB (Llama-3.3-70B-Instruct-Q4_K_M.gguf)
= 4.85
Obviously, the measured 121931 MB/s don't align with what should be measured as something like 192 GB/s. Notebookcheck, could you run a e.g. Llama-3.3-70B-Instruct-Q4_K_M.gguf LLM and see how many tokens per second you get?