Investigations into Socket 939 Athlon 64 Overclocking
by Jarred Walton on October 3, 2005 4:35 PM EST- Posted in
- CPUs
RAM Latency
We talked about memory latency a bit before, and we estimated the total latency. Using CPU-Z and the included Latency.exe utility, we can get an actual real value for memory latency - or at least, more real than our estimate. There are different ways of measuring memory latency, but we simply took the highest reported value from the table that was generated. For every system, that value was in the bottom-right corner of the table, indicating a data set of 32MB and a stride of 512. While the values may or may not be entirely accurate, they should at least be consistently measured across all the tested configurations. Here are the results, in CPU cycles as well as nanoseconds. Remember that at higher CPU speeds, each cycle is faster, so pay more attention to the nanoseconds graph.
The value RAM is obviously slower than the performance RAM across the tests. This is not at all surprising. You can see how the latency of the performance RAM trends downward until we were forced to drop to 2T or PC2700 in order to reach a stable result, while the value RAM fluctuates more. Every time that we have to drop the speed, i.e. from PC3200 to PC2700, there is an initial increase in latency, followed by a trend downward until we have to drop to the next RAM ratio.
While the value RAM is clearly slower than the expensive RAM, the big question is: how much performance do you actually lose by opting for more economical RAM? This is one of the areas that we are interested in testing for this article, and we'll comment on the results throughout the benchmarks.
RAM Possibilities
One area of the BIOS that's missing (for now) is support for additional RAM ratios. The reality is that the RAM speed is derived from the CPU clock with a divider. Ideally, we'd like to see a BIOS that gives direct access to the CPU divider rather than hiding it behind approximate memory speeds. That would allow for the selection of a larger range of options, but we're not sure if that's something that AMD controls in the CPU or if the BIOS programmers can do this. Here's a list of the standard RAM Settings, CPU multipliers and the resultant memory dividers that were available in the Infinity BIOS.
We have a LanParty UT nF3 250Gb that includes support for many in-between options from DDR200 through DDR500. Some of the interesting inclusions are DDR240, DDR280, DDR300, DDR360, DDR440, and DDR500. (CPU-Z actually failed to report the divider on a couple of those settings, but the resultant RAM speed was still read properly.) While additional memory dividers on the high end won't really help tweakers looking to get the most from the TCCD, BH5, or CH5 RAM, they can be particularly useful when using value RAM. All you need to do is select the appropriate divider to get your RAM under DDR400 - assuming standard value RAM. That way, you wouldn't take as much of a performance hit by running something like 10x250 (2.5 GHz). You could select a 13X divider rather than the standard 15X divider.
Additional ratios can even be useful for tuning performance RAM. For example, OCZ VX would not run stable above DDR500 (10x250), and we had to use 2-3-3-8-1T timings even then. (The 3.2V limit of the motherboard was at least partially to blame.) Running at 10x270, we had to drop to DDR333 (CPU/12), which resulted in the RAM running at DDR450 rather than the DDR540 result that would have been required for the normal CPU/10 ratio. However, if we could have selected a CPU/11 ratio, we could have run the RAM at DDR490 and gained a bit more performance. The additional ratios aren't required, but they would be nice to have.
If the last two paragraphs didn't make sense, then you can guess why we don't get additional access to RAM dividers. Experienced users might know how to make use of the settings, but many people would simply get confused; a "Catch-22".
We talked about memory latency a bit before, and we estimated the total latency. Using CPU-Z and the included Latency.exe utility, we can get an actual real value for memory latency - or at least, more real than our estimate. There are different ways of measuring memory latency, but we simply took the highest reported value from the table that was generated. For every system, that value was in the bottom-right corner of the table, indicating a data set of 32MB and a stride of 512. While the values may or may not be entirely accurate, they should at least be consistently measured across all the tested configurations. Here are the results, in CPU cycles as well as nanoseconds. Remember that at higher CPU speeds, each cycle is faster, so pay more attention to the nanoseconds graph.
The value RAM is obviously slower than the performance RAM across the tests. This is not at all surprising. You can see how the latency of the performance RAM trends downward until we were forced to drop to 2T or PC2700 in order to reach a stable result, while the value RAM fluctuates more. Every time that we have to drop the speed, i.e. from PC3200 to PC2700, there is an initial increase in latency, followed by a trend downward until we have to drop to the next RAM ratio.
While the value RAM is clearly slower than the expensive RAM, the big question is: how much performance do you actually lose by opting for more economical RAM? This is one of the areas that we are interested in testing for this article, and we'll comment on the results throughout the benchmarks.
RAM Possibilities
One area of the BIOS that's missing (for now) is support for additional RAM ratios. The reality is that the RAM speed is derived from the CPU clock with a divider. Ideally, we'd like to see a BIOS that gives direct access to the CPU divider rather than hiding it behind approximate memory speeds. That would allow for the selection of a larger range of options, but we're not sure if that's something that AMD controls in the CPU or if the BIOS programmers can do this. Here's a list of the standard RAM Settings, CPU multipliers and the resultant memory dividers that were available in the Infinity BIOS.
Standard AMD Memory Ratios | |||
RAM Setting | CPU Multiplier | RAM Divider | True RAM Speed |
DDR200 | 9X | CPU/18 | DDR200 |
DDR266 | 9X | CPU/14 | DDR257 |
DDR333 | 9X | CPU/11 | DDR327 |
DDR400 | 9X | CPU/9 | DDR400 |
DDR200 | 10X | CPU/20 | DDR200 |
DDR266 | 10X | CPU/15 | DDR267 |
DDR333 | 10X | CPU/12 | DDR333 |
DDR400 | 10X | CPU/10 | DDR400 |
DDR200 | 11X | CPU/22 | DDR200 |
DDR266 | 11X | CPU/17 | DDR259 |
DDR333 | 11X | CPU/14 | DDR314 |
DDR400 | 11X | CPU/11 | DDR400 |
DDR200 | 12X | CPU/24 | DDR200 |
DDR266 | 12X | CPU/18 | DDR267 |
DDR333 | 12X | CPU/15 | DDR320 |
DDR400 | 12X | CPU/12 | DDR400 |
We have a LanParty UT nF3 250Gb that includes support for many in-between options from DDR200 through DDR500. Some of the interesting inclusions are DDR240, DDR280, DDR300, DDR360, DDR440, and DDR500. (CPU-Z actually failed to report the divider on a couple of those settings, but the resultant RAM speed was still read properly.) While additional memory dividers on the high end won't really help tweakers looking to get the most from the TCCD, BH5, or CH5 RAM, they can be particularly useful when using value RAM. All you need to do is select the appropriate divider to get your RAM under DDR400 - assuming standard value RAM. That way, you wouldn't take as much of a performance hit by running something like 10x250 (2.5 GHz). You could select a 13X divider rather than the standard 15X divider.
Additional ratios can even be useful for tuning performance RAM. For example, OCZ VX would not run stable above DDR500 (10x250), and we had to use 2-3-3-8-1T timings even then. (The 3.2V limit of the motherboard was at least partially to blame.) Running at 10x270, we had to drop to DDR333 (CPU/12), which resulted in the RAM running at DDR450 rather than the DDR540 result that would have been required for the normal CPU/10 ratio. However, if we could have selected a CPU/11 ratio, we could have run the RAM at DDR490 and gained a bit more performance. The additional ratios aren't required, but they would be nice to have.
If the last two paragraphs didn't make sense, then you can guess why we don't get additional access to RAM dividers. Experienced users might know how to make use of the settings, but many people would simply get confused; a "Catch-22".
101 Comments
View All Comments
edlight - Tuesday, January 3, 2006 - link
I've found a way to overclock and retain the Power Now/Cool'n'Quiet.I let the motherboard do it's Cool'n'Quiet thing but I don't load the AMD driver. I run CrystalCPUID, which lets me set up the multiplier and voltage of each of the 3 cpu steps.
The voltage setting of the motherboard, for my Gigabyte, has to be on Auto for Crystal to be able to change it.
The highest Crystal can go with my 1.4v 3000+ Winnie is 1.45v.
This let me take it up to 3800+ -- a speed of 2.4. 240 x 10.
So it's running at 1.2 -- 240 x 5 -- most of the time. I set it at 1.2v there and froze it there and p95'd it overnight, as I did to the other 2 speeds.
For me this is a great compromise between running "cool 'n quiet" and high performance.
It's only a small percentage speed jump to 2.6, but requires alot of voltage and heat.
I can't say what the maximum voltage would be for a Venice. Crystal lets me choose higher voltages than 1.45, but it doesn't actually set them.
RaulAssis - Wednesday, December 21, 2005 - link
Some people reported that the Cool 'n' Quiet feature could work in a OC system. Maybe not all bioses support correct scaling of voltages when the system is OC and the Cool 'n' Quiet feature is turned on.JarredWalton - Wednesday, December 21, 2005 - link
With any moderate OC, CnQ is going to cause problems. It dynamically adjusts multipliers and voltages... something that will usually screw up an overclocked system. I would strongly discourage trying to use CnQ with an OC'ed setup. Some motherboard BIOSes actually disable CnQ automatically if you enable overclocking features.mrmoti - Wednesday, November 30, 2005 - link
If I understand correctly:Performance RAM running at DDR400 2-2-2-8 and Value RAM running at DDR400 2.5-3-3-8
At same OC on the processor, Performance RAM outperformed the Value RAM by 5% to 10%, being the price something between 80% to 100% more.
So, what's the impact of runnig faster memory at high lateny? Say DDR500 at 3-4-4-8
Because looking at the table of estimated latencies, (Performance) DDR400 2-2-2-5 has an estimated latency of 46.5, where (Value) DDR400 2.5-3-3-7 has an estimated latency of 49.75, an improve of 6.5% being in the range of 5% to 10% better.
By the same table, DDR500 3-4-4-8 has an estimated latency of 42.4, an improvement of 8.8% over the Performance DDR400 and 14.7% over the Value DDR400, based only in latencies.
Can anybody run a benchmark confirming/denying this?
Being the case that the price of DDR500 with those timings is in the middle between Performance and Value RAM
T Rush - Sunday, October 16, 2005 - link
One of the main focuses of this article seems to be value -vs- performance RAMs when over clocking, but you chose to run the performance RAM at settings where is doesn't perform, shame on you Jarred Walton, very disappointedIf you look at the settings you used to test the two RAMs at...
http://images.anandtech.com/reviews/cpu/amd/athlon...">http://images.anandtech.com/reviews/cpu...niceover...
...you see that the MAX speed you where able to run the OCZ Rev2 at was not in it's "performance envelope", as the OCZ Rev2 is one of the worst performers in this speed range http://www.anandtech.com/memory/showdoc.aspx?i=256...">http://www.anandtech.com/memory/showdoc.aspx?i=256... <OCZ Rev2 at 266MHz, and all the other "performance RAMs" beat it
But if you look at how the OCX Rev2 does work at much higher speeds, where it does perform...
http://www.anandtech.com/memory/showdoc.aspx?i=256...">http://www.anandtech.com/memory/showdoc.aspx?i=256...
...you find that it is performing much differently than what you tested at, and would have shown a much larger performance lead over the value RAM
How did you get this on to Anandtech? How could you show such a bad comparison of value -vs- performance RAM on a site which has always shown so much information about how these RAMs perform?
I not only blame you, but also the editors for not catching how badly you have managed to make performance RAM look. It is clear you were trying to prove that cheap RAM can falsely perform as well as high-end performance RAMs. If you truly wanted to show what performance RAM can do when over clocking you either needed to run the OCZ Rev2 at much faster speeds, or use a different performance RAM that works well at the sub 270MHz speeds you tested at.
JarredWalton - Sunday, October 16, 2005 - link
This is one set of RAM run through extensive tests on one platform. I've seen the same RAM run faster in some other systems, but not a whole lot. Just because some DIMMs reach DDR636 doesn't mean that all of them do. I could run this RAM at 3-4-4-8-2T timings at DDR600, but it actually ran worse than 2.5-3-3-8-1T with the lower memory ratio.What is clear is that I wasn't trying to "prove" anything. I was running some comparison tests with a system using two different types of RAM, and I'm sorry that you don't like the results. What I did prove was that someone one a budget could build a very fast system. An FX or San Diego core with higher quality RAM and a better motherboard would be better overall, but price/performance it would get stomped by this <$1000 setup.
T Rush - Monday, October 17, 2005 - link
I don't find the OCZ Rev2 to be a good example of the high performance RAM everyone thinks it is, as it doesn’t perform well at the speeds you (and most everyone else) use…not compared to other good over clocking RAMsGranted some of the other performance RAMs do cost much more than the value RAMs, and even more than the OCZ Rev2, but they would have shown a greater performance difference than the value RAM which in your tests was not able to run any faster than its stock rated speeds or timings
Your testing shows that running RAM at faster speeds adds very little performance over stock speed value RAM, and that is because the timings/speed relationships of that peculiar performance RAM at those peculiar higher speeds were not good.
As I said before, all the other performance RAMs beat it, and perform much better at speeds under 270MHz than the OCZ Rev2 does
Using the right RAM at the right speeds to run the best timings is the true art to over clocking, as RAM timings and speeds can allow the AMD64 to perform at much higher levels when over clocked
I do not disagree with your results, as that is how those RAMs perform:
Value RAM only being able to run its rated speeds and timings, not being able to over clock at all.
OCZ Rev2 running at higher speeds but with such bad timings that it shows very little performance gains unless you are able to run it at CAS 2.5 in the 300MHz range
I am not a fan of the OCZ Rev2 because of this, but I am a fan of performance RAM over value RAM, even on a budget system.
If you read clue22’s reply “so basically what the everybody is saying about the value RAM vs. low latency more expensive RAM is that for the athlon 64 it is basically a waste of money (i.e. you only get about 5% performance gain), but usually spend 100% or more money to get the "better" RAM.”
…and cyptonomicon’s “and its nice to see those ram comparisons. good to see those results on the latest a64 platform and confirm once again that the ram makes only a few percentage points difference”
…next intellon’s “I understand how/why the memory quality is not too imoprtant (5-9% increase for 100 bucks = not worthy)”
Clearly by using the OCZ Rev2 you did not show what spending a little bit more for better performing RAM can do. You have shown that running RAM at speeds with timings where it doesn’t perform well is a waste of money, but this does not answer any questions about value -vs- performance RAM
JarredWalton - Monday, October 17, 2005 - link
I've got X2 benchmarks with four different types of RAM in the works:OCZ VX
OCZ Plat Rev2 (TCCD)
Mushkin Value
PDP 2x1GB 2-3-2-5-1T
Other than the fact that 2GB of RAM helps out certain tasks (BF2 load times!), the total performance difference with those configurations is still not huge. With a 3.5V RAM voltage, the VX would do better, but even then the difference isn't above 10%.
T Rush - Thursday, October 20, 2005 - link
for a budget system I would say the socket 754 is better...as the motherboards and CPUs are cheaper...and you can get ClawHammer CPUs with the larger performance 1MB L2 cachethe only thing you miss out on with the 754 is the dual channel memory mode(which only adds very little performance anyway)...but by over clocking the core:memory speed you can easily match the performance gained by the greater bandwidth of the dual channel mode (this could be why the socket 939 doesn't show large gains from overclocking with the memory 'in-sync'..as it can't use all the bandwidth the faster memory gives)
with a mid-range system you could pick a 10X multi 3200+ Venice, or even a 11X multi 3500+ Venice(either of those would have a much better CPU multi for overclocking than the 3000+'s 9X multi) which would allow you to keep the HTT/HTL speed at a more reasonable level(270MHz and 245MHz to reach 2.7GHz CPU speed...where a 9X multi CPU would need a 300MHz HTT speed to run the CPU that fast)
...but the 3200+ and 3500+ are costing $190 to $250...so for not much more you could have a performance San Diego core on the 939 platform (3700+ 11X multi SD is only $267 now)
...so for $80 to $40 more I would go for the larger San Diego core...I would also spend the ~$20 more for CAS 2 RAM (over $90 Value RAM)...thats like just $100...for a computer with much better parts...and say you use this system for 18 months, that works out to less than $6 a month for a using higher quality parts
Deathcharge - Saturday, October 15, 2005 - link
Jarred this was a great article and did come at a great time as i am in the market for buying a bang for the buck system. One thing you didnt mention (although i saw that in the CPU-z screen shots) is the CPU steppinghttp://www.amdcompare.com/us-en/desktop/default.as...">http://www.amdcompare.com/us-en/desktop/default.as...
the 3200+ venice core comes in 3 different stepping and i belive the one you used in your article is the E3 stepping which is being replaced with the E6 stepping. Any info on how well the new stepping OC? initial reports from around the net indicate that it doesnt OC very well for some reason would love to read your comments on this.
Do you know if it is possible to OC to 2500 or 2600O with stock HSF as i would really like to save the money spent on the TT-90 and get a 7800GT (as opposed to x800xl). one final thing would OCZ value VX require active cooling?
thanks and keep up the good work, really enjoyed reading it and would look forward to future articles