Zen and Vega DDR4 Memory Scaling on AMD's APUsby Gavin Bonshor on June 28, 2018 9:00 AM EST
- Posted in
- Raven Ridge
- Ryzen 3 2200G
- Ryzen 5 2400G
We have previously explored the importance of memory scaling within AMDs Ryzen CPUs: the question being answered today is how much of an effect on performance does the memory frequency have when Zen is paired with AMD’s own Vega graphics core. We run a complete suite of tests on AMD's Ryzen 3 2200G ($99) and Ryzen 5 2400G ($169) APUs with memory speeds from DDR4-2133 to DDR4-3466 using a kit of G.Skill Ripjaws V.
Recommended Reading on AMD Ryzen APUs
- Marrying Vega and Zen: The AMD Ryzen APU Review
- Overclocking The AMD Ryzen APUs: Guide and Results
- Delidding The AMD Ryzen APUs: How To Guide and Results
- AMD Ryzen 5 2400G and Ryzen 3 2200G Core Frequency Scaling: An Analysis
- Best CPUs for Gaming: Q2 2018
Memory Scaling on AMD Ryzen APUs
While adding Vega to Zen may be a new concept, the premise of the APU combing compute and graphics on the same chip remains the same. Graphics is often a memory bound operation - the speed at which the data can be accessed by the graphics is directly tied to the frame rate, and we have seen on chips in the past that the speed of the memory (or an interim cache) can vastly help accelerate the performance of the graphics. Graphics is usually the focus here, as faster memory only assists CPU workloads that are memory limited.
For example, see our articles on:
- Memory Scaling on Ryzen 7
- Memory Frequency Scaling on Intel's Skull Canyon NUC
- Memory Scaling on Haswell CPU, IGP and dGPU: DDR3-1333 to DDR3-3000
- DDR4 Haswell-E Scaling Review: 2133 to 3200
One of the main issues with memory right now is pricing. With the price of DDR4 having risen over the course of 2017 and with no signs of slowing in 2018, building a new desktop system has looked more expensive over the course of the last couple of years: the inflation of GPU pricing has also certainly contributed to those woes. While the general outlook on the current DDR4 DRAM market is that for a user wanting extra speed, more money must be spent is true, how that equates into actual performance becomes more relevant than ever before. On pricing, for example, here is a Corsair Vengeance LPX 2x8 DDR4-2666 memory kit over on Amazon:
The price of this memory when launched was $142, which decreased down to as low as $57 on sale but was an average of $75 during early 2016. Over the course of 2017 and 2018, this very popular memory kit is now trading at $179, having reached a high of $200. To put that in perspective, this kit launched at a cost of $8.88 per GB, went down as low as $3.56 per GB, and is now at $11.19 per GB. This is almost certainly a sellers market, not a buyers market. People are often spending money on capacity over speed. The goal of this article is to determine how much speed actually matters, especially when we look at lower-cost processors like the AMD Ryzen APUs.
Our APU with some other G.Skill TridentZ DRAM in an SFF test
For a user looking to build a budget system without focusing too much on high-end performance applications such as CAD or content creation, the Ryzen 3 2200 and Ryzen 5 2400 APUs has a lot to offer, especially when money is a highly limiting factor on purchasing decisions. As was within our Ryzen 5 2400G review, we concluded that AMD’s Ryzen 2000 series pairing offers the best value and performance compared against what’s currently on offer on both sides of the APU/CPU marketplace (Intel or AMD) when an iGPU is featured on chip.
Memory Scaling on APUs: More Infinity Fabric
Most of the following analysis in this section was taken from our previous Memory Scaling on Ryzen 7 article.
While we already know due to the previous testing we did with the Ryzen with what effect memory frequency has on the Zen cores, and AMD added a new element to this when it equipped the Ryzen 3 2200G and Ryzen 5 2400G with Vega. As per with the rest of the Ryzen processor range from AMD, each chip combines multiple technologies, but relatively speaking, the most relevant one which has the ability to affect memory performance on the Ryzen 2000 series is called Infinity Fabric.
The Infinity Fabric (hereafter shortened to IF) consists of two fabric planes: the Scalable Control Fabric (SCF) and the Scalable Data Fabric (SDF). The SCF is all about control: power management, remote management and security and IO. Essentially when data has to flow to different elements of the processor other than main memory, the SCF is in control. The SDF is where main memory access comes into play. There's still management here - being able to organize buffers and queues in order of priority assists with latency, and the organization also relies on a speedy implementation. The slide below is aimed more towards the IF implementation in AMD's server products, such as power control on individual memory channels, but still relevant to accelerating consumer workflow.
AMD's goal with IF was to develop an interconnect that could scale beyond CPUs, groups of CPUs, and GPUs. In the EPYC server product line, IF connects not only cores within the same piece of silicon, but silicon within the same processor and also processor to processor. Two important factors come into the design here: power (usually measured in energy per bit transferred) and bandwidth.
The bandwidth of the IF is designed to match the bandwidth of each channel of main memory, creating a solution that should potentially be unified without resorting to large buffers or delays. Discussing IF in the server context is a bit beyond the scope of what we are testing in this article, but the point we're trying to get across is that IF was built with a wide scope of products in mind. On the consumer platform, while IF isn't necessarily used to such a large degree as in server, the potential for the speed of IF to affect performance is just as high.
Test Bed and Hardware
As per our testing policy, we take a premium category motherboard suitable for the socket, and equip the system with a suitable amount of memory. With this test setup, we are using the BIOS to set the CPU core frequency using the provided straps on the MSI B350I Pro AC motherboard. The memory is set to the range of speeds as given for our testing.
|Processors||AMD Ryzen 3 2200G||AMD Ryzen 5 2400G|
|Motherboard||MSI B350I Pro AC|
|Cooling||Thermaltake Floe Riing RGB 360|
|Power Supply||Thermaltake Toughpower Grand 1200 W Gold PSU|
|Memory||G.Skill Ripjaws V
|Integrated GPU||Vega 8
|Discrete GPU||ASUS Strix GTX 1060 6 GB
1620 MHz Base, 1847 MHz Boost
|Hard Drive||Crucial MX300 1 TB|
|Case||Open Test Bed|
|Operating System||Windows 10 Pro|
With the aim being to procure a set of consistent results, the G.Skill Ripjaws V DDR4-3600 kit was set to latencies of 17-18-18-38 throughout each of the different straps tested. Due to an inability to support 100 MHz straps on our motherboard, the XMP profile was enabled in the BIOS on the memory Ripjaws V kit and the latency timings adjusted to 17-18-18-38 manually on the MSI B350I Pro AC motherboard. Each of the straps set for the aim in continuity of testing for frequency scaling.
A side note on our previous experience with memory scaling. In the past we introduced a concept of a Performance Index (PI) for each memory kit, to give a rough performance comparison metric between memory brands. This PI was defined as the data rate (such as DDR4-2400) divided by the CAS Latency (such as the 17 in 17-18-18) rounded down to the nearest whole number. In previous articles like this, typically the memory with the highest PI scored the best overall, especially in gaming, although in combinations with similar PIs, the one with the highest frequency was often ahead. We will revisit this concept later in the review.
In this review, we will be testing the following combinations of data rate and latencies:
|DDR4-2133||17-18-18||2133 / 17 = 125||1.35 V|
|DDR4-2400||17-18-18||2400 / 17 = 141||1.35 V|
|DDR4-2667||17-18-18||2667 / 17 = 157||1.35 V|
|DDR4-2866||17-18-18||2866 / 17 = 169||1.35 V|
|DDR4-3333*||17-18-18||3333 / 17 = 196||1.35 V|
|DDR4-3466||17-18-18||3466 / 17 = 204||1.35 V|
*Corresponds to XMP Profile 1 on this memory kit
AGESA and Memory Support
At the time of the launch of Ryzen, a number of industry sources privately disclosed to us that the platform side of the product line was rushed. There was little time to do full DRAM compatibility lists, even with standard memory kits in the marketplace, and this lead to a few issues for early adopters to try and get matching kits that worked well without some tweaking. Within a few weeks this was ironed out when the memory vendors and motherboard vendors had time to test and adjust their firmware.
Overriding this was a lower than expected level of DRAM frequency support. During the launch, AMD had promised that Ryzen would be compatible with high speed memory, however reviewers and customers were having issues with higher speed memory kits (DDR4-3200 and above). These issues have been addressed via a wave of motherboard BIOS updates built upon an updated version of the AGESA (AMD Generic Encapsulated Software Architecture), specifically up to version 188.8.131.52, but now preceded by 184.108.40.206 (thanks to AMD's unusual version numbering system).
Whilst the maturity of the Ryzen platform is no longer an issue generally faced, the AGESA 220.127.116.11 microcode specifically focused on supporting the new Raven Ridge Ryzen 3 2200G and Ryzen 5 2400G APUs was announced before launch; we covered these BIOS updates for AMD's Ryzen APUs back in February at launch.
The whole purpose of the today's testing is to evaluate the scalability on AMD's Zen architecture and to see if the performance can be influenced by increasing the DRAM frequency. It would be foolish to not establish the effect on gaming performance and to see if memory frequency has a direct impact on frame rates given that previous generations of AMD APU have been reported to do so.
In this article we cover:
- Overview and Test Bed (this page)
- CPU Performance
- Integrated Graphics Performance
- Discrete Graphics Performance with a GTX 1060
Post Your CommentPlease log in or sign up to comment.
View All Comments
iwod - Thursday, June 28, 2018 - linkWhen are we going to get faster memory? Haven't we stuck with DDR4 for quite long? Even with DDR5 it doesn't seems to scale well with APU's needs.
Chaitanya - Thursday, June 28, 2018 - linkBiggest problem is most modules on market are still stuck with Jedec 2133Mhz and there are hardly handful of Jedec 2666Mhz kits on market. Xmp sucks and it needs to die soon. Ddr5 it seems is just a capacity solution to Ddr4 rather than a speed problem.
Maxiking - Thursday, June 28, 2018 - linkI don't think you understand here what's going on. Jedec is just the standard, a set of predetermined memory speed settings Bios will run RAM at. It doesn't matter if rams are Jedec 2133 or Jedec 2666mhz, both specifications are painfully slow. When you run above those Jedec specs, you use XMP profiles. Nothing more, nothing less. If you set up rams in XMP and copy Jedec specs, they will offer the same performance.
If anything, it is Jedec what sucks. It takes them years to update and create new memory standards, so DIMM manufacturers have to overclock on their own via XMP.
Samus - Thursday, June 28, 2018 - linkDDR4 for too long? It’s only been around for 4 years!
Stuka87 - Thursday, June 28, 2018 - linkIt was released 4 years ago, but very little used it. As I recall only Haswell-E used in in 2014. 2016 is when DD4 pretty much became the standard and mainstream chips were using it.
Andy Chow - Friday, June 29, 2018 - linkHow is JEDEC 2133 slow? It's 17 GB/s. If you run it in quad channel, it's 68 GB/s. I seriously doubt that bottlenecks most workloads. Just look at the 3DPM results, when you're actually doing calculations, and you actually decrease performance with faster memory, probably because the JEDEC standards aren't defined so the RAM and the memory controller aren't behaving perfectly which random io read/write queues. And I bet if you used registered 2133 DDR4, you would actually see a performance increase, even through there's 2-3 more controllers in the way.
JEDEC is obviously very prudent and conservative when defining their specs, but by no way are there current specs slow. If your workload is simple and linear (gpu, compression, encryption), then DDR4 isn't the recommended RAM type, HBM is, and the HBM JEDEC specs pre-date the DDR4 ones. DDR4 is optimized for low random io latency, whereas HBM is optimized for sequential io bandwidth. Most datacenter and server workload needs are io latency bottlenecked, not bandwidth bottlenecked, so I doubt on the DDR side the next generations will increase pre-fetch sizes above what they already are, regardless of how that would benefit games, encryption or compression (the last two are asic solved in the corporate world).
bananaforscale - Saturday, June 30, 2018 - linkThis. There are *very* few workloads that benefit from more than two channels. Heck, Ryzen 2 is less memory speed dependent than Ryzen was. Now, *latency* could go way down and it would benefit stuff.
invasmani - Tuesday, July 10, 2018 - linkNot accurate and 2666MHz Cas 9 for example on a good memory kit has a amazing performance index of 296. I run 2000MHz Cas 7 w/ a performance index of 285 it's actually better overall than the kit is rated for at DDR4 3200MHz Cas 16 by a long shot.
Ket_MANIAC - Friday, June 29, 2018 - linkHow long exactly have you been stuck with DDR4 and what would you do with DDR5 on a desktop, if I may?
Andy Chow - Friday, June 29, 2018 - linkNo. We had DDR3 for 11 years before DDR4 came out. DDR4 came out in 2014. DDR5 won't come out before 2019 at the earliest, the specs aren't even final as of today, so you don't really know what you are talking about.
The reason that DDR ram will never be good with graphics compared to GDDR is because DDR is optimized for cisc operations while GDDR is optimized for risc operations. You could run GDDR on an apu (the ps4 does this), but this makes the CPU run slower.
An apu will always be a sub-standard solution performance wise. It's a solution that aims to be good for low-cost, low-power consumption, or small form factor. It will always deliver mediocre performance.