In a previous entry, I started discussing the issues related to memory bandwidth for a read-only kernel on a sample AMD Opteron system. The naive implementation gave a performance of 3.393 GB/s when compiled at “-O1” (hereafter “Version 001”) and 4.145 GB/s when compiled at “-O2” (hereafter “Version 002”). Today… read more
Computer Hardware
Optimizing AMD Opteron Memory Bandwidth, Part 1: single-thread, read-only
Opteron/PhenomII STREAM Bandwidth vs CPU and DRAM Frequency
In a recent post (link) I showed that local memory latency is weakly dependent on processor and DRAM frequency in a single-socket Phenom II system. Here are some preliminary results on memory bandwidth as a function of CPU frequency and DRAM frequency in the same system. This table does not… read more