Welcome to Software Development on Codidact!

Will you help us build our independent community of developers helping developers? We're small and trying to grow. We welcome questions about all aspects of software development, from design to code to QA and more. Got questions? Got answers? Got code you'd like someone to review? Please join us.

Post History

60%

+1 −0

Q&A How does the strict aliasing rule enable or prevent compiler optimizations?

Aliasing is perhaps better understood as a property of assignments through pointers, not of the pointers themselves. First, let's consider the reasoning behind applying the optimization to the fir...

posted 7mo ago by Karl Knechtel‭

Answer

#1: Initial revision by

Karl Knechtel‭ · 2024-11-16T21:00:01Z (7 months ago)

Copy Link

Raw

Markdown

Aliasing is perhaps better understood as a property of *assignments* through pointers, not of the pointers themselves.

First, let's consider the reasoning behind applying the optimization to the first example.

In the first code block, the underlying storage of `b` is only assigned to via the `b` pointer. Since this assignment occurs through a `long *`, but `a` is an `int *`, the strict aliasing rule means that the compiler may assume that the assignment does not affect the underlying storage of `a`.

The fundamental idea behind "compiler optimizations enabled by undefined behaviour" is that the compiler may *act as though undefined behaviour never actually happens in any program* (even if it could statically prove otherwise for the current program).

Therefore, we may reason:

1. If we know that `a` and `b` refer to different locations in memory, then we can conclude that modifying `b`'s memory doesn't modify `a`'s memory. Thus, the value stored at `a` hasn't changed since it was copied into `t`; therefore `*a - t` is equivalent to `t - t`, which is `0` for all integers.

1. Suppose we don't know whether `a` and `b` refer to different locations in memory, but we do know that *if they refer to the same location, then the program has undefined behaviour*. Then we may proceed *as if we knew* they refer to different locations. Why? Because if they refer to different locations, then obviously the result will be correct; and if they refer to the same location, then it doesn't matter what the resulting code does. Either way, the code conforms to the standard.

1. Suppose we don't know whether `a` and `b` refer to different locations in memory, in general. If there is an incompatible pointer assignment between those memory regions, then it would be undefined behaviour for them to be the same location; therefore by the previous step we assume they are different locations. But *if there are only compatible pointer assignments, then we can make no assumptions*.

Now we can understand why this reasoning doesn't apply to the second example. *It is allowed for `a` and `b` to refer to the same memory location, as long as no assignment occurs to that underlying memory through those pointers*. In this code, the underlying storage of `b` is modified via type punning - the *only assignment occurs through compatible pointers*. (The "loophole" exposed by pointer-to-character types is simply that *they are compatible with all other pointer types*.)

In the second code example, if we have a call like

```
int main() {
long data = 1;
f(reinterpret_cast<int*>(&data), &data);
}
```
then we don't invoke undefined behaviour: `sizeof(long) >= sizeof(int)`, so there is enough valid data to make use of `a` within the function; and the function doesn't perform an incompatible pointer assignment, therefore it's acceptable that both pointers are to `data`'s memory.

Therefore, the code should compile successfully, *without* applying the optimization. Depending on the platform endianness (and on whether `long` is a larger data type than `int` on the platform), the type-punning may cause the `data` to be zeroed out, such that the retrieved value of `*a` may differ from the value stored in `t`, such that the result may validly be `-1`.

Whereas if `f` is implemented as the first code block example, the call invokes undefined behaviour. The optimization may be performed, and this would cause `0` to be returned regardless of the platform (even though it *should* return `-1` on some platforms). This is a consequence of the undefined behaviour, and one of many examples where it doesn't crash the program.

Communities

Post History