Post reply

Name
Email
Subject
Message icon

[quote author=tokens per second link=msg=647933 date=1744731778]
[quote]"Dual-Channel" seems wrong even on HP's datasheet and quickspecs.  Ignoring that LPDDR5 channels are 32bit, the Ryzen AI Max 256-bit memory bus should have 4 64bit channels ("Quad-channel").  AMD's specs do not list a channel number.  Memory channels are not mentioned on this site's review nor in Apple's specifications (though some sites say the MacBookPro M4 Max 128GB has 8 channels).  For accuracy maybe the channels should be omitted here as in the MacBookPro review.  (Or make it "multi-channel" if you need to contrast with "single-channel".)[/quote]
Good point, it could be 8*32-bit, instead of 4*64-bit like in normal desktop systems or both. But it leads to the same following bandwidth speed calculation:
[quote]You can't use the AIDA memory benchmark result for this formula ..[/quote]
Right, ty for making me looking it up again: Strix Halo is a 256-bit chip, so its theoretical bandwidth is:
[code]
= 8000 * 64 (bit per channel) * 4 (channels) (= the 256-bit memory bus width) / 8 (bit to Byte) / 1000 (MB to GB).
= 256 GB/s[/code]
The practical benchmark value is often 70-80% of the theoretical value, so 256 GB/s * 0.75 = ~192 GB/s.
Let's recalc:
[code]
tokens per second = 192 GB/s / 39.6 GB (Llama-3.3-70B-Instruct-Q4_K_M.gguf)
                  = 4.85
[/code]
[/quote]

Other options

Return to this topic
Don't use smileys

Verification:

Please leave this box empty:

Shortcuts: ALT+S post or ALT+P preview

Topic summary

Posted by sharath

- April 19, 2025, 16:13:26

WOW!! what a disappointment. The whole idea was power with no compromise on battery life. This is worse than having a DGPU. Did AMD forget to implement power management in this? 7 hrs hours for wifi. the newer 50 series gaming laptops do better. what was even the point of this then?
And that price tag, trying to get some suckers who still think this is like Apples m-series chips.

Has to be the most disappointing chips of them all. Why would anyone buy this over a gaming laptop. you can get a RTX 4060 laptop for well under 1000$.

Posted by dmitriy-kirilovTix

- April 19, 2025, 11:34:57

you were at that show? I thought we were only playing for a few local tweekers and the guys in the other bands. Come say hullo next time.

Posted by Look_down_instead

- April 15, 2025, 21:57:37

Quote from: Dont_Look_Up on April 15, 2025, 20:28:18ZBook Ultra has larger battery than the Flow Z13 (74.5 vs 56 Watt-hours)

But it doesn't tho. Might be mixing up with the old flow z13 from the year or 2 before. The latest 2025 flow z13 with the same Ryzen AI max chip has a 70 Whr battery, which is almost the same roughly.

Quote from: Dont_Look_Up on April 15, 2025, 20:28:18shows how unoptimised are the default power profiles.

+1 Agreed.

I would like to see more capped ingame benchmark tests (@30fps, 40-45, 60, 85-90, etc) in a variety of games (not just new but 5, 7, 10, 15, etc year old games) -- just to see how the power scales. The miracle of this chip isn't when running uncapped latest games it seems but seeing just how much can be squeezed within a limited power budget given a load that's not as demanding (older games and or at capped FPS).

Posted by Dont_Look_Up

- April 15, 2025, 20:28:18

Quote from: Worgarthe on April 15, 2025, 14:13:45Edit: So the game runs at 30 fps but the total system power is 7.6 to 8.4W which is completely insane and way too impressive 😳

And if you think that the HP ZBook Ultra has larger battery than the Flow Z13 (74.5 vs 56 Watt-hours) while having the same internals, it gives a potential battery life while gaming around 12 hours for the ZBook with The Phawx's custom settings!

By the way, if you check the FPS while gaming Celeste in the video, it is 60 FPS or very close to it!

Simply amazing, and also shows how unoptimised are the default power profiles.

Posted by tokens per second

- April 15, 2025, 17:42:58

Quote"Dual-Channel" seems wrong even on HP's datasheet and quickspecs. Ignoring that LPDDR5 channels are 32bit, the Ryzen AI Max 256-bit memory bus should have 4 64bit channels ("Quad-channel"). AMD's specs do not list a channel number. Memory channels are not mentioned on this site's review nor in Apple's specifications (though some sites say the MacBookPro M4 Max 128GB has 8 channels). For accuracy maybe the channels should be omitted here as in the MacBookPro review. (Or make it "multi-channel" if you need to contrast with "single-channel".)

Good point, it could be 8*32-bit, instead of 4*64-bit like in normal desktop systems or both. But it leads to the same following bandwidth speed calculation:

QuoteYou can't use the AIDA memory benchmark result for this formula ..

Right, ty for making me looking it up again: Strix Halo is a 256-bit chip, so its theoretical bandwidth is:

Code Select

= 8000 * 64 (bit per channel) * 4 (channels) (= the 256-bit memory bus width) / 8 (bit to Byte) / 1000 (MB to GB).
= 256 GB/s

The practical benchmark value is often 70-80% of the theoretical value, so 256 GB/s * 0.75 = ~192 GB/s.
Let's recalc:

Code Select

tokens per second = 192 GB/s / 39.6 GB (Llama-3.3-70B-Instruct-Q4_K_M.gguf)
                  = 4.85

Posted by RobinLight

- April 15, 2025, 15:29:36

Quote from: tokens per second on April 15, 2025, 08:55:01I agree. You can still roughly calculate the speed (works for "dense" models like the LLama-70B):
Code Select Expand
tokens per second = bandwidth / filesize = 121177 MB/s / 39600 MB (Llama-3.3-70B-Instruct-Q4_K_M.gguf) = 3.06

You can't use the AIDA memory benchmark result for this formula because it's just using a classic CPU related workload with it's own intrinsic bottlenecks. I am sure, the iGPU can achieve much more throughput near the theoretical limit of the memory bus.
And of course, this formula works only for a single user inferencing scenarios. I guess, local agent based systems are going to be more important in the future so there are multiple requests in parallel where the memory bandwidth is to some degree less important.

Posted by Worgarthe

- April 15, 2025, 14:13:45

Quote from: Dont_Look_UpNo he is playing "Celeste" at 1080p with full effects for 9.5 hours!
Check his video at 34min (I can't post the link).

Ok, thank you for the info with the timestamp basically, I know the vid is long so that's helpful and I appreciate it. I will check in a moment.

You can, btw, post links, just delete the https part, so post like this pretty much: youtube.com/watch?v=yiHr8CQRZi4

Edit: So the game runs at 30 fps but the total system power is 7.6 to 8.4W which is completely insane and way too impressive 😳

Posted by Dont_Look_Up

- April 15, 2025, 13:40:51

Quote from: Worgarthe on April 15, 2025, 11:29:22
Quote from: Dont_Look_Up on April 15, 2025, 11:25:34He gets 9.5 hours of battery life while gaming!
Playing Minesweeper and Solitaire?

No he is playing "Celeste" at 1080p with full effects for 9.5 hours!
Check his video at 34min (I can't post the link).

Posted by gc

- April 15, 2025, 12:14:50

"Dual-Channel" seems wrong even on HP's datasheet and quickspecs. Ignoring that LPDDR5 channels are 32bit, the Ryzen AI Max 256-bit memory bus should have 4 64bit channels ("Quad-channel"). AMD's specs do not list a channel number. Memory channels are not mentioned on this site's review nor in Apple's specifications (though some sites say the MacBookPro M4 Max 128GB has 8 channels). For accuracy maybe the channels should be omitted here as in the MacBookPro review. (Or make it "multi-channel" if you need to contrast with "single-channel".)

Posted by Worgarthe

- April 15, 2025, 11:29:22

Quote from: Dont_Look_Up on April 15, 2025, 11:25:34He gets 9.5 hours of battery life while gaming!

Playing Minesweeper and Solitaire?

Posted by Dont_Look_Up

- April 15, 2025, 11:25:34

This is by far the best mobile workstation of 2025!

Unfortunately, the power profiles are not properly optimised yet but they can be customised for even better performance and battery life.

There is a setting to improve significantly the battery life by giving power priority to CPU or GPU. The APU state can be set to P=0 or P=90 to save battery from the part of the APU that is used the least.

ThePhawx on youtube review: Asus ROG Flow Z13 - Strix Halo Review, explains this on his review. Most people do not know about this setting. (I can't post the link)

He gets 9.5 hours of battery life while gaming! And much better for web surfing. Just watch his video.

Posted by Anders Andersson

- April 15, 2025, 10:16:53

Asus ROG Flow Z13 is around 10% faster in single core Geekbench 5 and 6.
Is the HP badly optimized for single thread load?

Posted by tokens per second

- April 15, 2025, 08:55:01

Quote3. The biggest bummer: As always there is not a single test regarding AI model inference using llama.cpp or ollama. This notebook has been especially made for this purpose so please add some benchmarks using at least a 70b Q4 model and do the same on a similar performing MacBook Pro.

I agree. You can still roughly calculate the speed (works for "dense" models like the LLama-70B):

Code Select

tokens per second = bandwidth  / filesize
                  = 121177 MB/s / 39600 MB (Llama-3.3-70B-Instruct-Q4_K_M.gguf)
                  = 3.06

Posted by will blake

- April 15, 2025, 05:15:20

Could the author please run the battery runtime cyberpunk test on it?

Also, in the flow z13 review I believe it was made clear how much wattage is used in each power profile mode, plugged in vs on battery. If I recall correctly, there was even a nice table showing all this info concisely. Could you do the same here? As it's not entirely clear. Also add the dB fan noise in every mode too if possible.

Trying to figure out, if it's using 110W while cyberpunk gaming on high performance profile on mains here... how much does it use while on battery? 66W? And how much can this be further optimized, like using lowest power mode while in battery, does it use 35W then? Would that give 2 hours in that mode playing cyberpunk? How much does this effect the FPS vs using full power, unthrottled mode? What's the dB fan noise like in these different profile modes while gaming on battery running Cyberpunk?

Posted by JJL

- April 15, 2025, 02:25:02

Is Adaptive Sync is different from VRR ?
Because the specsheet of the laptop says, OLED panel should have VRR 48-120hz.

News:

Post reply

Topic summary

Posted by sharath

Posted by dmitriy-kirilovTix

Posted by Look_down_instead

Posted by Dont_Look_Up

Posted by tokens per second

Posted by RobinLight

Posted by Worgarthe

Posted by Dont_Look_Up

Posted by gc

Posted by Worgarthe

Posted by Dont_Look_Up

Posted by Anders Andersson

Posted by tokens per second

Posted by will blake

Posted by JJL