AMD EPYC Milan-X specifications leak, revealing up to 64 cores, 280W TDP, 768MB L3 cache.

Published by

Click here to post a comment for AMD EPYC Milan-X specifications leak, revealing up to 64 cores, 280W TDP, 768MB L3 cache. on our message forum
https://forums.guru3d.com/data/avatars/m/80/80129.jpg
I wonder if they are going to go the IBM route with virtual caches in the future. Seems like a much better solution for enterprise.
https://forums.guru3d.com/data/avatars/m/248/248627.jpg
I think amd is really onto something now with all the add on cache they're going to be throwing onto even their desktop CPU's
https://forums.guru3d.com/data/avatars/m/229/229509.jpg
That's a LOOOOOOT of cache! Is this like the L4 that intel stuck onto the desktop Broadwell parts then, or something else? I remember that giving a rather spicy boost to performance that kept the architecture on par with far more recent things...
https://forums.guru3d.com/data/avatars/m/271/271560.jpg
Denial:

I wonder if they are going to go the IBM route with virtual caches in the future. Seems like a much better solution for enterprise.
yes for traditional enterprise you are probably right as long as "better" means cheaper. but for high tech enterprise nothing beats on die cache. and massive on die cache at that.
data/avatar/default/avatar36.webp
Cache is king
https://forums.guru3d.com/data/avatars/m/268/268248.jpg
Hehe i guess we are not more than a year ot two to get cpus with 1gb cache ! Not on desktop thought 😛
https://forums.guru3d.com/data/avatars/m/80/80129.jpg
tunejunky:

yes for traditional enterprise you are probably right as long as "better" means cheaper. but for high tech enterprise nothing beats on die cache. and massive on die cache at that.
It is on die - 256MB of Shared L2/L3 virtual cache per 8 core chip. It obviously comes with a latency penalty but the benefits of the L3 being shared with the L2 supposedly outweighs that.
https://forums.guru3d.com/data/avatars/m/248/248994.jpg
Denial:

It is on die - 256MB of Shared L2/L3 virtual cache per 8 core chip. It obviously comes with a latency penalty but the benefits of the L3 being shared with the L2 supposedly outweighs that.
The latency is still, surely, an order of magnitude lower than accessing DDR?
https://forums.guru3d.com/data/avatars/m/266/266726.jpg
BLEH!:

That's a LOOOOOOT of cache! Is this like the L4 that intel stuck onto the desktop Broadwell parts then, or something else? I remember that giving a rather spicy boost to performance that kept the architecture on par with far more recent things...
This is better, this is legit l3 cache, thats added on top of the existing cache and connected via tsvs (Through Silicon Via), there is no latency penalty, unlike an l4 cache which would be slower.
https://forums.guru3d.com/data/avatars/m/271/271560.jpg
Venix:

Hehe i guess we are not more than a year ot two to get cpus with 1gb cache ! Not on desktop thought 😛
can you imagine (crazy sci - fi music) we're living in the fuuuutuuure
https://forums.guru3d.com/data/avatars/m/246/246171.jpg
I'd be really curious to see what would happen if someone were to [somehow] put in a 64MB (not a typo) stick of RAM in, install Windows XP, and play some early 2000s game just to see how it runs. Obviously it's utterly useless, but it's weird to think about having more cache than you have RAM, and enough cache to actually run your entire system and a simple game. Hard to wrap your head around.
https://forums.guru3d.com/data/avatars/m/229/229509.jpg
schmidtbag:

I'd be really curious to see what would happen if someone were to [somehow] put in a 64MB (not a typo) stick of RAM in, install Windows XP, and play some early 2000s game just to see how it runs. Obviously it's utterly useless, but it's weird to think about having more cache than you have RAM, and enough cache to actually run your entire system and a simple game. Hard to wrap your head around.
CacheRAMdrive? Cachedrive?
https://forums.guru3d.com/data/avatars/m/80/80129.jpg
Kaarme:

The latency is still, surely, an order of magnitude lower than accessing DDR?
Yeah - the idea behind IBM's implementation is that it massively reduces L3 latency at the expense of slightly increased L2. If you get lucky, you can hit L3 for nearly identical latency as L2 but in the vast majority of scenarios the latency of your L3 is going to be significantly faster than traditional L3 - for example the L3 seen here. In traditional PC workloads you're not going to see much of a benefit but in use cases IBM's cloud servers, or Epyc's servers are being used, having a fast L3 is extremely useful. Keep in mind IBM's system is also scaling up to ~512mb of combined L2/L3 per module and 8GB of cache (virtual L4 in this case) in a 32 chip system. AMD's stacking here would be adjacent to this.
data/avatar/default/avatar40.webp
In before 1-2 gigs of cache and cache drives - can it run crysis from cache ?