Comment by tlb
The 128- and 256-core ARM server chips (like from Ampere) are pushing server performance in interesting ways. They're economically viable now for trivially parallelizable things like web servers, but possibly game-changing if your problem can put that many general-purpose cores to work.
The thing is, there aren't that many HPC applications for that level of parallelism that aren't better served by GPUs.