Intel(R) Tuning Assistant topicMis-aligned Memory References

On Intel(R) Pentium(R) M processors, a misaligned access that crosses a cache line boundary does incur a penalty. A Data Cache Unit (DCU) split is a memory access that crosses a 64-byte line boundary. Unaligned accesses may cause a DCU split and stall Pentium M processors.

Note

Some algorithms (such as the Motion Estimation and Motion Compensation algorithms, found in video codecs) by nature tend to cause frequent misaligned references. In some cases alternative coding strategies that reduce misaligned references may actually require more total clock cycles.

This insight is relevant when:
The quotient of the counters (Misaligned Data Memory Reference/Data Memory References (all)) is poor. A value of 0.00 should be considered good, and a value of 0.002 should be considered poor.

Advice:

If you are writing in assembly:

If you are writing in a higher-level language: