Implement 64-bit atomic ops for ARMv6K+.

This implements the "quasi-atomic" 64-bit operations using LDREXD/STREXD
when those instructions are available (e.g. our ARMv7-A devices).

Also, folded in an expanded version of AtomicSpeed.c that has been
floating around my hard drive for a while.

(This time, without Condition Code Roulette.)

Change-Id: I90803794994373e2c620be423c081f0da4469fdf
4 files changed