为什么快速整数类型比其他整数类型快？

小开

最佳答案

想象一下，一个 CPU 只执行64位的算术运算。现在想象一下如何在这样的 CPU 上实现一个无符号的8位附加值。为了得到正确的结果，需要进行多次操作。在这样的 CPU 上，64位操作比其他整数宽度的操作更快。在这种情况下，所有 Xint_fastY_t可能都是64位类型的别名。

如果 CPU 支持对窄整数类型的快速操作，因此较宽的类型不会比较窄的类型快，那么 Xint_fastY_t不会(不应该)是一个比表示所有 Y 位所必需的较宽类型的别名。

Out of curiosity, I checked the sizes on a particular implementation (GNU, Linux) on some architectures. These are not same across all implementations on same architecture:

┌────╥───────────────────────────────────────────────────────────┐
│ Y  ║   sizeof(Xint_fastY_t) * CHAR_BIT                         │
│    ╟────────┬─────┬───────┬─────┬────────┬──────┬────────┬─────┤
│    ║ x86-64 │ x86 │ ARM64 │ ARM │ MIPS64 │ MIPS │ MSP430 │ AVR │
╞════╬════════╪═════╪═══════╪═════╪════════╪══════╪════════╪═════╡
│ 8  ║ 8      │ 8   │ 8     │ 32  │ 8      │ 8    │ 16     │ 8   │
│ 16 ║ 64     │ 32  │ 64    │ 32  │ 64     │ 32   │ 16     │ 16  │
│ 32 ║ 64     │ 32  │ 64    │ 32  │ 64     │ 32   │ 32     │ 32  │
│ 64 ║ 64     │ 64  │ 64    │ 64  │ 64     │ 64   │ 64     │ 64  │
└────╨────────┴─────┴───────┴─────┴────────┴──────┴────────┴─────┘

Note that although operations on the larger types may be faster, such types also take more space in cache, and thus using them doesn't necessarily yield better performance. Furthermore, one cannot always trust that the implementation has made the right choice in the first place. As always, measuring is required for optimal results.

Android 用户的表格截图:

^{(Android 没有使用单一字体裁判的方块绘图字符)}

小开

他们不是，至少不可靠。

快速类型只是常规类型的 typedef，但是如何定义它们取决于实现。它们必须至少达到所要求的尺寸，但是它们可以更大。

It is true that on some architectures some integer types have better performance than others. For example, early 手臂 implementations had memory access instructions for 32-bit words and for unsigned bytes, but they did not have instructions for half-words or signed bytes. The half-word and signed-byte instructions were added later, but they still have less flexible addressing options, because they had to be shoehorned into the spare encoding space. Furthermore all the actual data processing instructions on ARM work on words, so in some cases it may be necessary to mask off smaller values after calculation to give correct results.

然而，还存在缓存压力的竞争问题，即使加载/存储/处理较小的值需要更多的指令。如果较小的值可以减少缓存丢失的次数，那么它仍然可以执行得更好。

许多通用平台上的类型定义似乎没有经过深思熟虑。特别是，现代的64位平台往往对32位整数有很好的支持，然而“快速”类型在这些平台上通常是不必要的64位。

此外，C 语言中的类型成为平台 ABI 的一部分。因此，即使平台供应商发现他们做了愚蠢的选择，以后也很难改变这些愚蠢的选择。

忽略“快速”类型。如果您真的关心整数性能，请使用所有可用的大小对代码进行基准测试。

小开

快速类型并不比其他所有整数类型快——它们实际上是 一模一样到某种“正常”整数类型(它们只是该类型的别名)——无论哪种类型，只要能保持至少那么多位的值，都是最快的。

它只是平台相关的哪个整数类型，每个快速类型都是。