For example, there are 8 threads use an uint64 as shared memory and use different bytes in that area (without lock).
One thread can only access to a certain byte so no byte contention exist.
In this case, is it safe? And from performance perspective, is it better to use CAS to operate the whole integer rather than each bytes?