I could be wrong, but there's a mix up here between NBC's 4K and 4K QD32 results. Still, considering the subject's 256 GB capacity (with SSDs, larger capacity = better performance) and physical size, QD32 read/write speeds around 800 MBps is a noticeable improvement from PM961, or even the higher-end SM961. Benchmarks in other 4K areas aren't bad either.
Random 4K, especially QD1 or just "4K" mainly determines the "snappiness" of UI experience, and QD1~4 is what matters most for a typical user. Which explains why "Real life performance was excellent – application start times or file copy transfer speeds were very fast." I doubt majority of people actually need 4 digit sequential speeds on the SP, though it's nice to know the potential. As for QD32, it's rather relevant to server level performance, but I guess I'm excited anyway.
I'm no expert, so here's a much better explanation:
http://www.overclock.net/t/1231707/can-someone-explain-the-different-crystaldiskmark-tests
http://www.violin-memory.com/blog/understanding-io-random-vs-sequential/