Welcome to Software Development on Codidact!

Will you help us build our independent community of developers helping developers? We're small and trying to grow. We welcome questions about all aspects of software development, from design to code to QA and more. Got questions? Got answers? Got code you'd like someone to review? Please join us.

Post History

66%

+2 −0

Q&A How does the strict aliasing rule enable or prevent compiler optimizations?

Pointer conversions and aliasing First of all, C historically allows all manner of crazy pointer conversions (whereas C++ is more restrictive) between compatible and non-compatible pointed-at type...

posted 7mo ago by Lundin‭ · edited 5mo ago by Lundin‭

Answer

#2: Post edited by

Lundin‭ · 2025-01-07T10:09:36Z (5 months ago)

Copy Link

Raw

Markdown

**Pointer conversions and aliasing**
First of all, C historically allows all manner of crazy pointer conversions (whereas C++ is more restrictive) between compatible and non-compatible pointed-at types, as well as to/from the generic `void*`. There are several reasons to allow that: type punning, generic programming etc. So within reason, for example a `float*` is allowed to point at an `int`, though things like correct alignment still have to be fulfilled.
But since we allow that, it becomes problematic to determine if a certain variable was updated at a certain point, as any pointer can in theory point at anything. A compiler does a lot of reasoning behind the lines about the internal type system, the types used by temporary expressions and so on. From a compiler's perspective, there's the term "pointer aliasing" (informal term). Two pointers can be said to alias if they may point at the same variable. Example:
```c
void func(int* x, int* y) // x and y may alias
{ ... }
int a;
int b;
func(&a, &a); // in this case x and y will point at the same variable
func(&a, &b); // in this case they point at different variables
```
Since `func` in the above example has external linkage and might sit in a different source file than the caller, there's no way for the compiler to know if `x` and `y` point at the same data - short of comparing their addresses, which would cause execution overhead. Therefore if we do `*x = 5;` inside the function, we'd have to reload `*y` when used because they alias and the write to `*x` could have changed `*y`.
---
**Strict pointer aliasing**
The rationale behind "strict pointer aliasing" (informal term), or just "strict aliasing", was to formalize the rules for how the compiler's internal type system is allowed to behave in these situations. In case we have a scenario such as
int aliased(float *f, int *i)
Then the compiler should reasonably be able to assume that if we modify `f` inside the function, we did not modify what `i` points at, because they aren't compatible types. If the compiler can't assume that, it would have to reload `i` from memory after writing to `f` or vice versa, because it can no longer assume that the pointed-at data wasn't changed. And that leads to inefficient code.
Therefore, rather than looking at the type of the pointer itself, rules were set up about the actual type of the pointed-at data. The term for this is _effective type_ (formal term). If we declare `int x;` then the effective type of `x` is `int`. More intricate rules for effective type apply in case of dynamic allocation etc but I won't go into that here.
Strict aliasing refers to the scenario where a certain effective type is accessed through a de-referenced pointer, so called "lvalue access". C23 6.5 §7:
> An object shall have its stored value accessed only by an lvalue expression that has one of
the following types:
> - a type compatible with the effective type of the object,
> - a qualified version of a type compatible with the effective type of the object,
> - a type that is the signed or unsigned type corresponding to the effective type of the
object,
> - a type that is the signed or unsigned type corresponding to a qualified version of the
effective type of the object,
> - an aggregate or union type that includes one of the aforementioned types among its
members (including, recursively, a member of a subaggregate or contained union), or
> - a character type.
---
**Example of a "stict aliasing violation"**
The following scenario is undefined behavior:
- We have some variable of effective type `int`.
- We point at it with an `int*` (maybe for passing it to a function like in your example).
- Then later we cast the `int*` to a `float*`. This is allowed in case `int` and `float` have the same alignment on the given system, which is common.
- If we later de-reference that `float*` pointer, we invoke undefined behavior.
Because of this there is no telling if your example invokes undefined behavior or not, because we do not know the effective types of the pointed-at variables, which is most commonly found by checking their declarations. On the caller side in your example.
---
**Exceptions to strict aliasing**
> I also heard that character types are a loophole.
Yes, since they are one of the valid exceptions in the list I quoted from the standard. This is to allow us to inspect the raw binary contents of any type by iterating across it byte by byte, using a character type. This is guaranteed well-defined behavior (6.3.2.3 §7) and then the whole pointed at item is to be temporarily regarded as a character array.
```c
// This is fine
int some_var;
unsigned char* uptr = (unsigned char*) &some_var; // ok conversion
...
do_something(uptr[i]); // ok to de-reference and iterate across
```
But that does not mean we can go the other way around!
```c
// This is undefined behavior! (UB)
char str[] = "Hello";
int* iptr = (int*)str; // this might be a misaligned access, if so UB
do_something(*iptr); // this is a strict aliasing violation, always UB
```
---
**Optimizations and historical problems with the gcc compiler**
> I was told that this has implications for compiler optimizations.
For good and bad. With the release of C99, the rules about strict aliasing was clarified somewhat although they've always been there in one way or another. But because C99 brought up the matter, in the early 2000s this caught the attention of the gcc compiler team specifically. They started to realize that if the standard was read literally, they could start making assumptions about if a variable was modified or not.
If a `float*` is modified then the lvalue of an `int` cannot have been modified for example, because that would be a strict aliasing violation. And so there is no need to reload some `int` into memory after modifying a `float` through a pointer, making the code more efficient.
That was the original attention of strict aliasing, but what was problematic is that gcc started to do this in all manner of scenarios and for other types too, which weren't so obviously non-compatible. Like accessing an `int` through a `short` pointer.
For example there could be many reasons why we'd want to go through a 32 bit integer 16 bits at a time, or a 64 bit integer 32 bits at a time. Or do word-based copying of aligned data since that's faster. Etc etc - there are lots of scenarios in hardware-related or driver/library implementation when you might want to do things like this.
But now gcc treated all such code as undefined behavior and optimized accordingly. Worse yet they shipped this as a default setting. This caused C programs to crash left and right in the early 2000s. Linux in particular since it relies heavily on gcc, but now the default settings would break everything (and causing another memorable, childish [Torvalds tantrum](https://www.yodaiken.com/2018/06/07/torvalds-on-aliasing/)). But also lots of embedded systems got problems, where gcc was starting to gain popularity thanks to the new ARM cores.
Ever since then, gcc has struggled with recovering from all the problems it caused back then, undoing some of the problematic optimizations or making them optional. But gcc did take a reputation blow and became infamous for following the standard to the letter rather than trying to generate the code that the programmer had intended. It is a problem with that compiler specifically, since no other compiler made the same call to aggressively optimize in case of strict aliasing violations.
Yes, the gcc team was indeed reading the standard correctly. The examples I showed are undefined behavior indeed. However, most compilers strive to be tools useful to programmers in the real world, so they wouldn't start to generate all manner of implicit subtle misbehavior because of it. Rather - they would implement a non-standard compiler extension treating something like `*(short*) &my_int` as deterministic code.
A professional C or C++ programmer does need to be aware of strict aliasing when doing wild & crazy pointer conversions, but even more so misaligned access. Strict aliasing is an artificial concept of the C standard, but misalignment is a real physical problem based on how CPUs and memories are designed internally.

**Pointer conversions and aliasing**
First of all, C historically allows all manner of crazy pointer conversions (whereas C++ is more restrictive) between compatible and non-compatible pointed-at types, as well as to/from the generic `void*`. There are several reasons to allow that: type punning, generic programming etc. So within reason, for example a `float*` is allowed to point at an `int`, though things like correct alignment still have to be fulfilled.
But since we allow that, it becomes problematic to determine if a certain variable was updated at a certain point, as any pointer can in theory point at anything. A compiler does a lot of reasoning behind the lines about the internal type system, the types used by temporary expressions and so on. From a compiler's perspective, there's the term "pointer aliasing" (informal term). Two pointers can be said to alias if they may point at the same variable. Example:
```c
void func(int* x, int* y) // x and y may alias
{ ... }
int a;
int b;
func(&a, &a); // in this case x and y will point at the same variable
func(&a, &b); // in this case they point at different variables
```
Since `func` in the above example has external linkage and might sit in a different source file than the caller, there's no way for the compiler to know if `x` and `y` point at the same data - short of comparing their addresses, which would cause execution overhead. Therefore if we do `*x = 5;` inside the function, we'd have to reload `*y` when used because they alias and the write to `*x` could have changed `*y`.
---
**Strict pointer aliasing**
The rationale behind "strict pointer aliasing" (informal term), or just "strict aliasing", was to formalize the rules for how the compiler's internal type system is allowed to behave in these situations. In case we have a scenario such as
int aliased(float *f, int *i)
Then the compiler should reasonably be able to assume that if we modify `*f` inside the function, we did not modify what `i` points at, because they aren't compatible types. If the compiler can't assume that, it would have to reload `*i` from memory after writing to `*f` or vice versa, because it can no longer assume that the pointed-at data wasn't changed. And that leads to inefficient code.
Therefore, rather than looking at the type of the pointer itself, rules were set up about the actual type of the pointed-at data. The term for this is _effective type_ (formal term). If we declare `int x;` then the effective type of `x` is `int`. More intricate rules for effective type apply in case of dynamic allocation etc but I won't go into that here.
Strict aliasing refers to the scenario where a certain effective type is accessed through a de-referenced pointer, so called "lvalue access". C23 6.5 §7:
> An object shall have its stored value accessed only by an lvalue expression that has one of
the following types:
> - a type compatible with the effective type of the object,
> - a qualified version of a type compatible with the effective type of the object,
> - a type that is the signed or unsigned type corresponding to the effective type of the
object,
> - a type that is the signed or unsigned type corresponding to a qualified version of the
effective type of the object,
> - an aggregate or union type that includes one of the aforementioned types among its
members (including, recursively, a member of a subaggregate or contained union), or
> - a character type.
---
**Example of a "stict aliasing violation"**
The following scenario is undefined behavior:
- We have some variable of effective type `int`.
- We point at it with an `int*` (maybe for passing it to a function like in your example).
- Then later we cast the `int*` to a `float*`. This is allowed in case `int` and `float` have the same alignment on the given system, which is common.
- If we later de-reference that `float*` pointer, we invoke undefined behavior.
Because of this there is no telling if your example invokes undefined behavior or not, because we do not know the effective types of the pointed-at variables, which is most commonly found by checking their declarations. On the caller side in your example.
---
**Exceptions to strict aliasing**
> I also heard that character types are a loophole.
Yes, since they are one of the valid exceptions in the list I quoted from the standard. This is to allow us to inspect the raw binary contents of any type by iterating across it byte by byte, using a character type. This is guaranteed well-defined behavior (6.3.2.3 §7) and then the whole pointed at item is to be temporarily regarded as a character array.
```c
// This is fine
int some_var;
unsigned char* uptr = (unsigned char*) &some_var; // ok conversion
...
do_something(uptr[i]); // ok to de-reference and iterate across
```
But that does not mean we can go the other way around!
```c
// This is undefined behavior! (UB)
char str[] = "Hello";
int* iptr = (int*)str; // this might be a misaligned access, if so UB
do_something(*iptr); // this is a strict aliasing violation, always UB
```
---
**Optimizations and historical problems with the gcc compiler**
> I was told that this has implications for compiler optimizations.
For good and bad. With the release of C99, the rules about strict aliasing was clarified somewhat although they've always been there in one way or another. But because C99 brought up the matter, in the early 2000s this caught the attention of the gcc compiler team specifically. They started to realize that if the standard was read literally, they could start making assumptions about if a variable was modified or not.
If a `float*` is modified then the lvalue of an `int` cannot have been modified for example, because that would be a strict aliasing violation. And so there is no need to reload some `int` into memory after modifying a `float` through a pointer, making the code more efficient.
That was the original attention of strict aliasing, but what was problematic is that gcc started to do this in all manner of scenarios and for other types too, which weren't so obviously non-compatible. Like accessing an `int` through a `short` pointer.
For example there could be many reasons why we'd want to go through a 32 bit integer 16 bits at a time, or a 64 bit integer 32 bits at a time. Or do word-based copying of aligned data since that's faster. Etc etc - there are lots of scenarios in hardware-related or driver/library implementation when you might want to do things like this.
But now gcc treated all such code as undefined behavior and optimized accordingly. Worse yet they shipped this as a default setting. This caused C programs to crash left and right in the early 2000s. Linux in particular since it relies heavily on gcc, but now the default settings would break everything (and causing another memorable, childish [Torvalds tantrum](https://www.yodaiken.com/2018/06/07/torvalds-on-aliasing/)). But also lots of embedded systems got problems, where gcc was starting to gain popularity thanks to the new ARM cores.
Ever since then, gcc has struggled with recovering from all the problems it caused back then, undoing some of the problematic optimizations or making them optional. But gcc did take a reputation blow and became infamous for following the standard to the letter rather than trying to generate the code that the programmer had intended. It is a problem with that compiler specifically, since no other compiler made the same call to aggressively optimize in case of strict aliasing violations.
Yes, the gcc team was indeed reading the standard correctly. The examples I showed are undefined behavior indeed. However, most compilers strive to be tools useful to programmers in the real world, so they wouldn't start to generate all manner of implicit subtle misbehavior because of it. Rather - they would implement a non-standard compiler extension treating something like `*(short*) &my_int` as deterministic code.
A professional C or C++ programmer does need to be aware of strict aliasing when doing wild & crazy pointer conversions, but even more so misaligned access. Strict aliasing is an artificial concept of the C standard, but misalignment is a real physical problem based on how CPUs and memories are designed internally.

#1: Initial revision by

Lundin‭ · 2024-11-17T13:59:44Z (7 months ago)

Copy Link

Raw

Markdown

**Pointer conversions and aliasing**

First of all, C historically allows all manner of crazy pointer conversions (whereas C++ is more restrictive) between compatible and non-compatible pointed-at types, as well as to/from the generic `void*`. There are several reasons to allow that: type punning, generic programming etc. So within reason, for example a `float*` is allowed to point at an `int`, though things like correct alignment still have to be fulfilled. 

But since we allow that, it becomes problematic to determine if a certain variable was updated at a certain point, as any pointer can in theory point at anything. A compiler does a lot of reasoning behind the lines about the internal type system, the types used by temporary expressions and so on. From a compiler's perspective, there's the term "pointer aliasing" (informal term). Two pointers can be said to alias if they may point at the same variable. Example:

```c
void func(int* x, int* y) // x and y may alias
{ ... }

int a;
int b;
func(&a, &a); // in this case x and y will point at the same variable
func(&a, &b); // in this case they point at different variables
```

Since `func` in the above example has external linkage and might sit in a different source file than the caller, there's no way for the compiler to know if `x` and `y` point at the same data - short of comparing their addresses, which would cause execution overhead. Therefore if we do `*x = 5;` inside the function, we'd have to reload `*y` when used because they alias and the write to `*x` could have changed `*y`.

---

**Strict pointer aliasing**

The rationale behind "strict pointer aliasing" (informal term), or just "strict aliasing", was to formalize the rules for how the compiler's internal type system is allowed to behave in these situations. In case we have a scenario such as

    int aliased(float *f, int *i)

Then the compiler should reasonably be able to assume that if we modify `f` inside the function, we did not modify what `i` points at, because they aren't compatible types. If the compiler can't assume that, it would have to reload `i` from memory after writing to `f` or vice versa, because it can no longer assume that the pointed-at data wasn't changed. And that leads to inefficient code.

Therefore, rather than looking at the type of the pointer itself, rules were set up about the actual type of the pointed-at data. The term for this is _effective type_ (formal term). If we declare `int x;` then the effective type of `x` is `int`. More intricate rules for effective type apply in case of dynamic allocation etc but I won't go into that here.

Strict aliasing refers to the scenario where a certain effective type is accessed through a de-referenced pointer, so called "lvalue access". C23 6.5 §7:

> An object shall have its stored value accessed only by an lvalue expression that has one of
the following types:  
> - a type compatible with the effective type of the object,  
> - a qualified version of a type compatible with the effective type of the object,  
> - a type that is the signed or unsigned type corresponding to the effective type of the
object,  
> - a type that is the signed or unsigned type corresponding to a qualified version of the
effective type of the object,  
> - an aggregate or union type that includes one of the aforementioned types among its
members (including, recursively, a member of a subaggregate or contained union), or  
> - a character type.

---

**Example of a "stict aliasing violation"**

The following scenario is undefined behavior:

- We have some variable of effective type `int`. 
- We point at it with an `int*` (maybe for passing it to a function like in your example).
- Then later we cast the `int*` to a `float*`. This is allowed in case `int` and `float` have the same alignment on the given system, which is common.
- If we later de-reference that `float*` pointer, we invoke undefined behavior. 

Because of this there is no telling if your example invokes undefined behavior or not, because we do not know the effective types of the pointed-at variables, which is most commonly found by checking their declarations. On the caller side in your example.

---

**Exceptions to strict aliasing**

> I also heard that character types are a loophole.

Yes, since they are one of the valid exceptions in the list I quoted from the standard. This is to allow us to inspect the raw binary contents of any type by iterating across it byte by byte, using a character type. This is guaranteed well-defined behavior (6.3.2.3 §7) and then the whole pointed at item is to be temporarily regarded as a character array.

```c
// This is fine
int some_var;
unsigned char* uptr = (unsigned char*) &some_var; // ok conversion
...
do_something(uptr[i]); // ok to de-reference and iterate across
```

But that does not mean we can go the other way around!

```c
// This is undefined behavior! (UB)
char str[] = "Hello";
int* iptr = (int*)str; // this might be a misaligned access, if so UB
do_something(*iptr); // this is a strict aliasing violation, always UB
```

---

**Optimizations and historical problems with the gcc compiler**

> I was told that this has implications for compiler optimizations.

For good and bad. With the release of C99, the rules about strict aliasing was clarified somewhat although they've always been there in one way or another. But because C99 brought up the matter, in the early 2000s this caught the attention of the gcc compiler team specifically. They started to realize that if the standard was read literally, they could start making assumptions about if a variable was modified or not. 

If a `float*` is modified then the lvalue of an `int` cannot have been modified for example, because that would be a strict aliasing violation. And so there is no need to reload some `int` into memory after modifying a `float` through a pointer, making the code more efficient.

That was the original attention of strict aliasing, but what was problematic is that gcc started to do this in all manner of scenarios and for other types too, which weren't so obviously non-compatible. Like accessing an `int` through a `short` pointer. 

For example there could be many reasons why we'd want to go through a 32 bit integer 16 bits at a time, or a 64 bit integer 32 bits at a time. Or do word-based copying of aligned data since that's faster. Etc etc - there are lots of scenarios in hardware-related or driver/library implementation when you might want to do things like this.

But now gcc treated all such code as undefined behavior and optimized accordingly. Worse yet they shipped this as a default setting. This caused C programs to crash left and right in the early 2000s. Linux in particular since it relies heavily on gcc, but now the default settings would break everything (and causing another memorable, childish [Torvalds tantrum](https://www.yodaiken.com/2018/06/07/torvalds-on-aliasing/)). But also lots of embedded systems got problems, where gcc was starting to gain popularity thanks to the new ARM cores.

Ever since then, gcc has struggled with recovering from all the problems it caused back then, undoing some of the problematic optimizations or making them optional. But gcc did take a reputation blow and became infamous for following the standard to the letter rather than trying to generate the code that the programmer had intended. It is a problem with that compiler specifically, since no other compiler made the same call to aggressively optimize in case of strict aliasing violations.

Yes, the gcc team was indeed reading the standard correctly. The examples I showed are undefined behavior indeed. However, most compilers strive to be tools useful to programmers in the real world, so they wouldn't start to generate all manner of implicit subtle misbehavior because of it. Rather - they would implement a non-standard compiler extension treating something like `*(short*) &my_int` as deterministic code.

A professional C or C++ programmer does need to be aware of strict aliasing when doing wild & crazy pointer conversions, but even more so misaligned access. Strict aliasing is an artificial concept of the C standard, but misalignment is a real physical problem based on how CPUs and memories are designed internally.

Communities

Post History