C语言中的位域、字节序、比特序、大小端

1.比特序 / 位序 / bitnumbering / bit endianness

我们知道一个字节有8位，也就是8个比特位。从第0位到第7位共8位。比特序就是用来描述比特位在字节中的存放顺序的。通过阅读网页http://en.wikipedia.org/wiki/Bit_numbering的内容，关于比特序我们得到下面的结论：

（1）比特序分为两种：LSB 0 位序和MSB 0 位序。

LSB是指 leastsignificant bit，MSB是指 most significantbit。

LSB 0 位序是指：字节的第0位存放数据的leastsignificant bit，即我们的数据的最低位存放在字节的第0位。

MSB 0 位序是指：字节的第0位存放数据的most significantbit，即我们的数据的最高位存放在字节的第0位。

所以说对于代码：char *ch = 0x96; // 0x96 = 1001 0110

指针ch到底指向哪里呢？不难知道，如果是LSB 0 位序则显然指针ch指向最右边的也是最低位的0.

而如果是MSB 0 位序则显然指针ch指向最左边的也是最高位的1.

LSB 0: A container for 8-bit binary numberwith the highlighted leastsignificant bit assigned the bit number 0

MSB 0:A container for 8-bit binary numberwith the highlighted most significantbit assignedthe bit number 0

（2）小端cpu通常采用的是LSB 0 位序，但是大端cpu却有可能采用LSB0 位序也有可能采用的是MSB0 位序

(Little-endian cpususually employ "LSB 0" bit numbering,however both bit numberingconventions can be seen in big-endianmachines. )

（3）推荐的标准是MSB0 位序。

(The recommended style for Request forComments documents is "MSB 0" bit numbering.)

（4）Bit numbering is usually transparent tothe software.

2.大小端和字节序 http://en.wikipedia.org/wiki/Endianess

In computing,the term endian or endianness refersto the ordering of individually addressable sub-components within therepresentation of a larger data item as stored in external memory (or,sometimes,as sent on a serial connection). Each sub-component in therepresentation has a unique degree of significance,like the place value of digitsin a decimal number. These sub-components are typically 16- or 32-bit words,8-bit bytes,or even bits. Endianness isa difference in data representation at the hardware level and may or may not betransparent at higher levels,depending on factors such as the type of highlevel language used.

计算机中，术语“端”是指：在内存中的一个较大的数据，它是由各个可以被单独寻址的部分组成，这些组成部分在该数据中是以怎样的顺序存放的呢？而这个问题涉及到“端”的概念，cpu是大端还是小端决定了这些组成部分的存放顺序。

这些组成部分可能是16或32位的字、8位的字节、甚至是比特位。

The most commoncases refer to how bytes are ordered within a single 16-, 32-,or 64-bit word。

我们通常碰到的情况是：字节是以怎样的顺序存放在一个16、32、64位的数据中。

（当我们要存取一个16、32、64位数据的某一组成部分，也就是某一个或几个字节时，就要特别注意机器的“大小端”）

A big-endian machinestores the most significant byte first,and a little-endian machinestores the least significant byte first.

Quick Reference - Byte Machine Example
Endian	First Byte (lowest address)	Middle Bytes	Last Byte (highest address)	Summary
big	most significant	...	least significant	Similar to a number written on paper (in Arabic numerals)
little	least significant	...	most significant	Arithmetic calculation order (see carry propagation)

Examples ofstoring the value 0A0B0C0Dh in memory

Big-endian

Atomic elementsize 8-bit,address increment 1-byte (octet)

increasing addresses →
...	0Ah	0Bh	0Ch	0Dh	...

The most significantbyte (MSB)value,which is 0Ah in our example,is stored at the memory locationwith the lowest address,the next byte value in significance, 0Bh,isstored at the following memory location and so on. This is akin toLeft-to-Right reading in hexadecimal order.

Atomic elementsize 16-bit

increasing addresses →
...	0A0Bh	0C0Dh	...

The most significant atomic elementstores Now the value 0A0Bh,followed by 0C0Dh.

Little-endian

Atomic elementsize 8-bit,address increment 1-byte (octet)

increasing addresses →
...	0Dh	0Ch	0Bh	0Ah	...

The leastsignificant byte (LSB) value, 0Dh,is at the lowestaddress. The other bytes follow in increasing order of significance.

Atomic elementsize 16-bit

increasing addresses →
...	0C0Dh	0A0Bh	...

The least significant 16-bit unit storesthe value 0C0Dh,immediately followed by 0A0Bh. Notethat 0C0Dh and 0A0Bh represent integers,not bit layouts(see bit numbering).

很显然“小端”机器符合“高高低低”的原则。及高位字节或字存放在高地址，低位字节或字存放在低地址。

另外“小端”机器中，数据在cpu的寄存器和内存中的存放顺序是一致的。

Byte addresses increasing from right toleft

在我们写: 0xFF86 时，很明显地址是从右向左递增的。也就是低位写在右边，高位写在左边。

但是当我们写字符串时：char *str = "Hello World!"，却是低位的字符写在左边，高位的字符写在了右边。

With 8-bitatomic elements:

← increasing addresses
...	0Ah	0Bh	0Ch	0Dh	...

The leastsignificant byte (LSB) value,isat the lowest address. The other bytes follow in increasing order ofsignificance.（这个明显符合我们的习惯）

With 16-bit atomic elements:

← increasing addresses
...	0A0Bh	0C0Dh	...

The least significant 16-bit unit storesthe value 0C0Dh,immediately followed by 0A0Bh.

The display of text is reversed from thenormal display of languages such as English that read from left to right. Forexample,the word "XRAY" displayed in this manner,with eachcharacter stored in an 8-bit atomic element:

← increasing addresses
...	"Y"	"A"	"R"	"X"	...

（可以看到和我们手写的顺序是相反的，这一点特别要注意！）

If pairs of characters are stored in16-bit atomic elements (using 8 bits per character),it Could look evenstranger:

← increasing addresses
...	"AY"	"XR"	...

C语言中的位域、字节序、比特序、大小端

相关推荐