News:

Willkommen im Notebookcheck.com Forum! Hier können sie über alle unsere Artikel und allgemein über Notebook relevante Dinge disuktieren. Viel Spass!

Main Menu

Post reply

The message has the following error or errors that must be corrected before continuing:
Warning: this topic has not been posted in for at least 120 days.
Unless you're sure you want to reply, please consider starting a new topic.
Other options
Verification:
Please leave this box empty:

Shortcuts: ALT+S post or ALT+P preview

Topic summary

Posted by Richard Costello
 - March 19, 2021, 18:26:39
Quote from: Mate on March 14, 2021, 15:29:10
@Richardd Castello You are wrong - M1 uses more transistors than Intel/AMD core.

M1 = 16 billion / AMD = 40 billion so don't talk out of your arse 'Mate'. The size of the transistors and the relative size of the cores means more supplemental silicon can be on chip hence SOC. Which will mean more transistors on the M1 but not per core. But that conversation is just confusing the layman which is what my post was aimed at. Same for the microcode and x86 internals not being RISClike. Out of order execution etc used to classed as a RISC like feature. But again, thats over complicating things.

You are probably aware as I am that ARM isn't really RISC anymore anyway. But in layman's terms the explanation needs relatively simple descriptions. Its called trying to help, rather than being a nob.
Posted by _MT_
 - March 15, 2021, 12:48:25
Quote from: riklaunim on March 14, 2021, 19:05:17
AFAIK eGPU on Apple is limited to AMD cards only. Newer Nvidia cards are not supported.
Unless something has changed, no current eGPU will work on M1. I imagine the problem is simply that Apple isn't interested in releasing drivers. Potentially to keep the market captive.

On their Intel platforms, they use GPUs from AMD and so MacOS does have drivers for them.
Posted by _MT_
 - March 15, 2021, 12:08:55
Quote from: Mate on March 14, 2021, 15:29:10
Additionally  ARM64 is almost as complex as x86-64. Reduced instruction set but still executes ~1000 different instructions.  Also x86 CPUs are not RISC inside. Every CPU(M1 too) is translating instruction sets to micro-ops and then execute.
Well, real processor designs are converging. RISC folk found out they could never compete without more complex instructions. And CISC folk found the sets growing too complex, necessitating a sort of hardware emulation. You can also see wide designs getting faster and fast designs getting wider.

There are fundamental differences between ARM and x86. A big factor for x86 is backward compatibility. It's a strength and a weakness at the same time. They benefit from huge proliferation and conservatism in IT. But it's also tying their hands. They have to live with decisions made decades ago, under completely different circumstances. The rub is that once the market becomes open enough to change, they can take advantage of it as well. Either starting something new or redefining what x86 is. Essentially, what we are seeing is that there are many ways to skin a cat, so to speak. A design work is about compromises. It isn't religion, it isn't dogma.

Another step we might see in the not so distant future is a sort of super-core design with many-way SMT. For example, instead of having 8 cores with 2-way SMT (meaning 16 threads in total), you could have a single core with 20-way SMT (20 threads processed simultaneously). With dynamic partitioning of resources between the many logical cores. Achieving higher utilization. Of course, we might never see it if doesn't work out.
Posted by riklaunim
 - March 14, 2021, 19:05:17
Quote from: [email protected] on March 14, 2021, 17:54:16
This is all well and good, but you can not use a EGPU box with a 3080 card
AFAIK eGPU on Apple is limited to AMD cards only. Newer Nvidia cards are not supported.
Posted by [email protected]
 - March 14, 2021, 17:54:16
This is all well and good, but you can not use a EGPU box with a 3080 card or 64gb of ram and other issues that need to get resolved on these M1 processor systems. Will I get a new Mac with an Apple processor, yes but not for at least 2 more years when everything is working as it should.
Posted by Jan Onderwater
 - March 14, 2021, 17:08:44
I did some calculations myself, assuming the the CPU scales 80% Efficient and the GPU 90%
Every generation (m1->m2) gains 10% in speed in Single Core.


M1                     
       Geekbench         Cinebench   
En   HP   GPU   Single    Multi   Metal   Single   Multi
4   4   4   1.709   7.398   21.982   1.498   7.508
4   8   8   1.709   11.387   39.564   1.498   10.800
4   12   12   1.709   16.856   59.346   1.498   15.594
4   16   16   1.709   22.325   79.128   1.498   20.387
4   24   24   1.709   33.262   118.692   1.498   29.974
8   24   32   1.709   33.712   158.256   1.498   31.187
12   36   64   1.709   50.568   316.512   1.498   46.781
16   48   128   1.709   67.424   633.024   1.498   62.374
24   72   256   1.709   101.136   1.266.048   1.499   86.342
M2                     
Geekbench         Cinebench   
En   HP   GPU   Single    Multi   Metal   Single   Multi
4   4   4   1.880   8.137   24.180   1.648   8.259
4   8   8   1.880   12.526   43.524   1.648   11.882
4   12   12   1.880   18.542   65.286   1.648   17.155
4   16   16   1.880   24.558   87.048   1.648   22.429
4   24   24   1.880   36.590   130.572   1.648   32.976
8   24   32   1.880   37.083   174.096   1.648   34.310
12   36   64   1.880   55.625   348.192   1.648   51.466
16   48   128   1.880   74.166   696.384   1.648   68.621
24   72   256   1.880   111.250   1.392.768   1.648   102.931
M3                     
Geekbench         Cinebench   
En   HP   GPU   Single    Multi   Metal   Single   Multi
4   4   4   2.068   8.951   26.598   1.813   9.085
4   8   8   2.068   13.778   47.880   1.813   10.800
4   12   12   2.068   20.394   71.820   1.813   15.594
4   16   16   2.068   27.011   95.760   1.813   20.387
4   24   24   2.068   40.246   143.640   1.813   29.974
8   24   32   2.068   40.787   191.520   1.813   31.187
12   36   64   2.068   61.181   383.040   1.813   46.781
16   48   128   2.068   81.574   766.080   1.813   62.374
24   72   256   2.068   122.362   1.532.160   1.813   86.342
Posted by Lee Rutter
 - March 14, 2021, 15:42:13
Wow, thank you for this amazing, layman's detail. Greatly in alignment with what I know, and then some.  :)

Apple has been working on their M1 for 1/2 a decade and obviously, the iPhone and iPads have gone through many iterations so Apple has been quietly working to bring everything together so that they are on par or ahead of the game.

I am elated that Apple has worked on the M1 processor for so long and looking forward to the M1X as that is the one I will be buying once it rolls out and hopefully very soon.


Quote from: Richardd Costello on March 13, 2021, 22:54:08
Layman's terms as best I can.

RISC = simple instructions, executed quickly, to do something complicated you need a group of instructions. Simplicity means less transistors, means less power.

CISC = complex instructions, executed slowly, you can do complicated things with one instruction. Complexity means more transistors, means more power (consumption).

Most of the time, CPUs execute simple instructions so RISC is the way to go for performance. BUT years ago, the CPU of choice for the IBM PC was chosen by the accounting department rather than the engineering department. They went for a chip that they could get a good discount on because of existing deals and chose the Intel 8086.

This choice gave Intel lots of money to develop their chip family, making it more and more complex and crucially utilising faster and faster clock speeds. Other companies couldn't keep up, hence Apple dumping the PowerPC RISC design and jumping on the Intel bandwagon because even IBM couldn't keep up with Intels clock speeds.

The simplicity of the RISC design means you need a relatively small number of transistors to achieve 1 instruction per clock cycle. 80x86 needed many (variable number of) cycles per instruction. Intel developed the 80x86 line up to the Pentium 4 when they hit a problem. They simply couldn't get them to run faster than 3.4GHz - they were melting. So at that point they switched the architecture to the Core line which is more RISC like but internally decodes 80x86 to maintain compatibility. Still complex.

Ultimately Intel has managed to achieve 1 instruction per clock cycle (like RISC) but to achieve this they have had to have ever more complicated pipelining of instructions, meaning even more transistors, more power, more heat.

Think of a processor like a production line in a car factory, it might take 3 days to make a car, but if you have many stations all working on cars moving along a production line, after 3 days of production the cars actually come off the production line one per minute. CPUs are like this, with many instructions being worked on at the same time.

What has happened recently with the M1, is Apple has got close to Intel's clock speeds. The M1 is rumoured to be running at 3.2GHz, which is very fast for the ARM architecture. This speed means that the ARM can process its instructions at the same rate as the Intel chips but crucially using a LOT less transistors.

This has a two fold advantage for Apple, firstly their chip uses a lot less power (less transistors), generating a loss less heat. Secondly it makes their processor core very small compared with an Intel core.

The physically small core and low power means they can add more cores but crucially also means they can add more supplemental hardware to the die. They can have memory controllers and the RAM itself on chip - and can subsequently run the RAM a lot faster than your typical DDR RAM units. The whole SOC (system on a chip) thing means everything is faster.

Apple have also been able to produce their chips using smaller manufacturing methods than Intel can reliably achieve. Further adding benefits.

Intel having always been running into a dead end with CISC, sooner or later RISC was going to dominate once manufacturing and high clock speeds became mass market.

Intel have now reached that dead end. It started with mobile where battery life was the main issue. Now its the desktop where clock speed and memory performance is key. Soon, it will be the server market where 128 or 256 core ARM CPUs will trundle along feeding us our data using less electricity and crucially requiring less cooling. Thats why nVidia want to buy ARM.
Posted by Mate
 - March 14, 2021, 15:29:10
@Richardd Castello

You are wrong - M1 uses more transistors than Intel/AMD core. However it also can execute more instructions per cycle so it can compete with x86 CPUs without boosting to 4GHz+.  Main reason  for M1 efficiency is smaller manufacturing process(They bought whole TSMC 5nm capacity for wagons of cash) - that alone gives 30-40% less energy consumption. Add BIG.little(another non-Apple technology) and voila, we have super energy efficient M1. That CPU is not miracle - AMD could easily smash its performance per watt under load if AMD only had access to 5nm nodes. 

Additionally  ARM64 is almost as complex as x86-64. Reduced instruction set but still executes ~1000 different instructions.  Also x86 CPUs are not RISC inside. Every CPU(M1 too) is translating instruction sets to micro-ops and then execute.
Posted by Markus
 - March 14, 2021, 14:41:50
QuoteMiani came to this estimate by comparing the differences in results between the A12 and A12Z (1.68x faster) and the A14 and M1 (1.87x faster) and then picking an average for the M1X.

You did what? What a waste of time!
Posted by Richardd Costello
 - March 13, 2021, 22:54:08
Layman's terms as best I can.

RISC = simple instructions, executed quickly, to do something complicated you need a group of instructions. Simplicity means less transistors, means less power.

CISC = complex instructions, executed slowly, you can do complicated things with one instruction. Complexity means more transistors, means more power (consumption).

Most of the time, CPUs execute simple instructions so RISC is the way to go for performance. BUT years ago, the CPU of choice for the IBM PC was chosen by the accounting department rather than the engineering department. They went for a chip that they could get a good discount on because of existing deals and chose the Intel 8086.

This choice gave Intel lots of money to develop their chip family, making it more and more complex and crucially utilising faster and faster clock speeds. Other companies couldn't keep up, hence Apple dumping the PowerPC RISC design and jumping on the Intel bandwagon because even IBM couldn't keep up with Intels clock speeds.

The simplicity of the RISC design means you need a relatively small number of transistors to achieve 1 instruction per clock cycle. 80x86 needed many (variable number of) cycles per instruction. Intel developed the 80x86 line up to the Pentium 4 when they hit a problem. They simply couldn't get them to run faster than 3.4GHz - they were melting. So at that point they switched the architecture to the Core line which is more RISC like but internally decodes 80x86 to maintain compatibility. Still complex.

Ultimately Intel has managed to achieve 1 instruction per clock cycle (like RISC) but to achieve this they have had to have ever more complicated pipelining of instructions, meaning even more transistors, more power, more heat.

Think of a processor like a production line in a car factory, it might take 3 days to make a car, but if you have many stations all working on cars moving along a production line, after 3 days of production the cars actually come off the production line one per minute. CPUs are like this, with many instructions being worked on at the same time.

What has happened recently with the M1, is Apple has got close to Intel's clock speeds. The M1 is rumoured to be running at 3.2GHz, which is very fast for the ARM architecture. This speed means that the ARM can process its instructions at the same rate as the Intel chips but crucially using a LOT less transistors.

This has a two fold advantage for Apple, firstly their chip uses a lot less power (less transistors), generating a loss less heat. Secondly it makes their processor core very small compared with an Intel core.

The physically small core and low power means they can add more cores but crucially also means they can add more supplemental hardware to the die. They can have memory controllers and the RAM itself on chip - and can subsequently run the RAM a lot faster than your typical DDR RAM units. The whole SOC (system on a chip) thing means everything is faster.

Apple have also been able to produce their chips using smaller manufacturing methods than Intel can reliably achieve. Further adding benefits.

Intel having always been running into a dead end with CISC, sooner or later RISC was going to dominate once manufacturing and high clock speeds became mass market.

Intel have now reached that dead end. It started with mobile where battery life was the main issue. Now its the desktop where clock speed and memory performance is key. Soon, it will be the server market where 128 or 256 core ARM CPUs will trundle along feeding us our data using less electricity and crucially requiring less cooling. Thats why nVidia want to buy ARM.
Posted by LL
 - March 13, 2021, 21:45:06
Estimate?! someone guessing is now a estimate?
Posted by Mate
 - March 13, 2021, 18:52:37
#8
In old days computers had RAM in kilobytes. In those times memory footprint of application code was important. Its one of main reasons why we are using CISC based x86 architecture for personal computers. Memory was expensive and RISC CPUs  needed more RAM. Now its not problem anymore. Additionally now instructions are only small part of memory used by program - in consequence difference between memory footprints of CISC(x86) and RISC(ARM)  is significantly smaller  than 20 or 30 years ago.
Posted by 8&8
 - March 13, 2021, 18:11:09
Risc-V and still yet in silicon, notice that already exist carbon nanotube processors in this tech...

Posted by Dorby
 - March 13, 2021, 16:11:06
Quote from: _MT_ on March 13, 2021, 14:48:02
Quote from: george on March 13, 2021, 11:21:10
can someone explain in layman terms how is this possible?
I don't know if this is layman enough, but one thing you need to understand is that frequency impacts efficiency. It stems from the relationship between voltage and frequency. The higher the operating frequency, the higher the voltage required to ensure stable operation. Power rises with the square of voltage. That's why this relationship is a very important characteristic of a processor from efficiency standpoint. Firestorm cores operate at a much lower frequency than their x86 counterparts under heavy load. They operate much closer to optimum efficiency point. As a rule of thumb, 3 GHz is roughly considered a knee point for efficiency in modern x86 processors. Meaning that efficiency quickly deteriorates beyond this point. That's why base frequencies of mobile processors are where they are. Running around 5 GHz is definitely not good for efficiency.

There are two basic approaches to increasing performance of a processor. Working faster (frequency) and doing more simultaneously (so called width). Historically, these two approaches were considered mutually exclusive. Modern processors combine them. But they still pull in different directions. M1 is a wider design. Widest on the market. Actually, modern x86 designs are internally not that dissimilar. The challenge is that you've got to feed the core with instructions. In the case of an x86 processor, it means decoding x86 instructions and turning them into internal micro instructions. But the instruction set is very complex. And there is a big complication in the form of variable length of instructions. You don't know where the next instruction starts without looking at the previous instruction. Which complicates the design of decoders. And it makes going as wide more challenging. This is where SMT comes in. By processing two (or more) threads simultaneously, you provide more instructions for the core to chew on, working its magic (like out-of-order execution, optimizing utilization of resources).

Also, the relationship between frequency and performance is not exactly straightforward. A processor is faster than memory. A big factor in performance is how much time you spend waiting for data. The higher the frequency, the more cycles get wasted by waiting. And again, SMT can come to the rescue, masking latency.

Apple's designs are extreme in more ways than one. It's hard to say what's going on if you don't have access to internal information. But those are the two main factors. They are so efficient mainly because they run at such a low frequency compared to x86 processors. And they are competitive because they're very wide yet their frequency isn't too low. There are many little things going on. Consider how Apple focuses on low latency (many benchmarks are latency sensitive). And then there is the fact that Apple uses the most advanced manufacturing process on the market, better than what AMD and Intel are using. Which means higher efficiency and being able to cram more transistors into the same space.

Desktop processors consume 100+ W primarily because they can. Efficiency is more of a consequence of increasing performance rather than a target. Their power budget for a high performance personal computer can be over 500 W. Mobile x86 processors are derivatives of desktop designs (and power saving technologies do trickle back). Here we see the opposite. Designs from highly power-constrained world of mobile phones upscaled into the world of personal computers. A single core in my desktop computer has a higher power budget than an entire iPad. And Apple succeeding in this effort that was considered very difficult if not borderline impossible. And it doesn't look like the design is running out of breath.
Wow as a layman this was really helpful, Thanks!  :)
Posted by Reddragon72
 - March 13, 2021, 16:09:30
Quote from: Lucas on March 13, 2021, 15:49:25
The only problem with the new MAC CPUs i that they use RISC instructions which requires a lot of RAM and if the user buys a machine with 8 GB then a lot of data is written to the internal SSD which in turn reduces the lifespan. This is not theorizing, it's being observed on the new laptops as we speak where SSDs wil potentially be used up in 2-3 years.
It would not be a big problem if Apple did not use soldered SSDs so once one dies you need a new laptop.

Not exactly right here. ARM processors do more with less ram. Your thinking of old and I mean first gen PowerPC chips. There is a reason Android is popular and that is it is cheap to buy and doesn't require a ton of ram. Today android and iOS use way less ram than windows does and they run more processes in the background than windows. They also have a much smaller and less exercised disk cache or paging cache than windows. RISC instructions today require less cpu and less ram to perform the same task than CISC. Fact I am running a linux distro in my LG phone along side of Android and the linux distro on my phone is faster than my 8 core x86 desktop linux distro. And my phone distro runs everything with only 2gig of shared ram while the desktop chews up 6 of my 16 just for the distro.

ARM would dominate if it were designed like the x86 world and you could mix and match parts to your hearts content.