John McCalpin's blog – Page 2 – Dr. Bandwidth explains all….

August 28, 2023, Filed Under: Cache Coherence Implementations, Cache Coherence Protocols, Computer Architecture

“Memory directories” in Intel processors

One of the (many) minimally documented features of recent Intel processor implementations is the “memory directory”. This is used in multi-socket systems to reduce cache coherence traffic between sockets. I have referred to this in various presentations as: “A Memory Directory is one or more bits per cache line… read more

April 25, 2023, Filed Under: Computer Architecture, Computer Hardware, Performance

The evolution of single-core bandwidth in multicore processors

The primary metric for memory bandwidth in multicore processors is that maximum sustained performance when using many cores. For most high-end processors these values have remained in the range of 75% to 85% of the peak DRAM bandwidth of the system over the past 15-20 years — an amazing accomplishment… read more

April 11, 2022, Filed Under: Computer Architecture, Computer Hardware, Computer Interconnects

Why don’t we talk about bisection bandwidth any more?

Is it unimportant just because we are not talking about it? I was recently asked for comments about the value of increased bisection bandwidth in computer clusters for high performance computing. That got me thinking that the architecture of most internet news/comment infrastructures is built around “engagement” — effectively amplifying… read more