[置顶] C语言中各种数据类型的长度 sizeof char, short, int, long, long long
2013-07-14 22:59
656 查看
这些数据类型的sizeof具体长度依赖于编译器和操作系统(32-bit or 64-bit)
C99链接: http://www.open-std.org/JTC1/SC22/WG14/www/docs/n1256.pdf
C90 standard requires that
C99 standard requires that
short int
int
long int
long long int
64-bit 操作系统中,事实标准为LP64, int - 4字节, long, pointer 是8字节
在linux操作系统中,参见头文件 int-ll64.h
数据来源:http://docs.oracle.com/cd/E19620-01/805-3024/lp64-1/index.html
Table F-1 C Data Type Sizes
In addition to the data model changes, some system-derived types, such as size_t, have been expanded to be 64-bit quantities when
compiled in the 64-bit environment.
数据来源: http://publib.boulder.ibm.com/infocenter/zvm/v6r2/index.jsp?topic=%2Fcom.ibm.zos.r12.cbcpx01%2Fdatatypesize64.htm
同上
也就是没用到long 类型,用了char, short, int, long long 就够了。
http://publib.boulder.ibm.com/infocenter/zvm/v6r2/index.jsp?topic=%2Fcom.ibm.zos.r12.cbcpx01%2Fcbcpg1b0228.htm
http://www.unix.org/whitepapers/64bit.html
http://software.intel.com/en-us/articles/data-alignment-when-migrating-to-64-bit-intel-architecture
https://en.wikipedia.org/wiki/Data_structure_alignment
http://csweapon.diandian.com/post/2011-08-26/4372667
自然对齐
64-bit operating environment
Align 8-bit data at any address
Align 16-bit data to be contained within an aligned four-byte word
Align 32-bit data so that its base address is a multiple of four
Align 64-bit data so that its base address is a multiple of eight
Align 80-bit data so that its base address is a multiple of sixteen
Align 128-bit data so that its base address is a multiple of sixteen
Comparison of data structure member lengths produced from the same code
1: 首先,参见c99标准
标准中没有定义这些数据类型的长度,而是定义了这些数据类型能表达的大小范围的最小极限。C99链接: http://www.open-std.org/JTC1/SC22/WG14/www/docs/n1256.pdf
The C++ standard does not specify the size of integral types in bytes, but it specifies minimum ranges they must be able to hold. You can infer minimum size in bits from the required range and the value of CHAR_BIT macro, that defines the number of bits in a byte (in all but the most obscure platforms it's 8). One additional constraint for char is that its size is always 1 byte, or CHAR_BIT bits (hence the name). Minimum ranges required by the standard (page 22) are: signed char: -127 to 127 (note, not -128 to 127; this accommodates 1's-complement platforms) unsigned char: 0 to 255 "plain" char: -127 to 127 or 0 to 255 (depends on default char signedness) signed short: -32767 to 32767 unsigned short: 0 to 65535 signed int: -32767 to 32767 unsigned int: 0 to 65535 signed long: -2147483647 to 2147483647 unsigned long: 0 to 4294967295 signed long long: -9223372036854775807 to 9223372036854775807 unsigned long long: 0 to 18446744073709551615 A C++ (or C) implementation can define the size of a type in bytes sizeof(type) to any value, as long as the expression sizeof(type) * CHAR_BIT evaluates to the number of bits enough to contain required ranges, and the ordering of type is still valid (e.g. sizeof(int) <= sizeof(long)). The actual implementation-specific ranges can be found in <limits.h> header in C, or <climits> in C++ (or even better, templated std::numeric_limits in <limits> header).
2: 数据类型长度需要符合2个标准
一个是数据类型能描述的范围,一个是数据类型表达范围之间的顺序C90 standard requires that
sizeof(short)<=sizeof(int)<=sizeof(long)
C99 standard requires that
sizeof(short)<=sizeof(int)<=sizeof(long)<sizeof(longlong)
3: 5种标准数据类型和他们的衍生类型
signed charshort int
int
long int
long long int
There are five standard signed integer types : signed char, short int, int, long int, and long long int. In this list, each type provides at least as much storage as those preceding it in the list. For each of the standard signed integer types, there exists a corresponding (but different) standard unsigned integer type: unsigned char, unsigned short int, unsigned int, unsigned long int, and unsigned long long int, each of which occupies the same amount of storage and has the same alignment requirements.
The C++ Standard says it like this : 3.9.1, §2 : There are five signed integer types : "signed char", "short int", "int", "long int", and "long long int". In this list, each type provides at least as much storage as those preceding it in the list. Plain ints have the natural size suggested by the architecture of the execution environment (44); the other signed integer types are provided to meet special needs. (44) that is, large enough to contain any value in the range of INT_MIN and INT_MAX, as defined in the header <climits>. The conclusion : it depends on which architecture you're working on. Any other assumption is false.
4: 实践中的事实标准
32-bit 操作系统中,事实标准为 ILP32, int, long, pointer 都是4字节64-bit 操作系统中,事实标准为LP64, int - 4字节, long, pointer 是8字节
在linux操作系统中,参见头文件 int-ll64.h
For 32-bit systems, the 'de facto' standard is ILP32 - that is, int, long and pointer are all 32-bit quantities. For 64-bit systems, the primary Unix 'de facto' standard is LP64 - long and pointer are 64-bit (but int is 32-bit). The Windows 64-bit standard is LLP64 - long long and pointer are 64-bit (but long and int are both 32-bit). At one time, some Unix systems used an ILP64 organization. None of these de facto standards is legislated by the C standard (ISO/IEC 9899:1999), but all are permitted by it.
5: 数据模型 LP64和ILP32
[b]数据来源: http://en.wikipedia.org/wiki/64-bit#64-bit_data_models[/b]Data model | short (integer) | int | long (integer) | long long | pointers/ size_t | Sample operating systems |
---|---|---|---|---|---|---|
LLP64/ IL32P64 | 16 | 32 | 32 | 64 | 64 | Microsoft Windows (X64/IA-64) |
LP64/ I32LP64 | 16 | 32 | 64 | 64 | 64 | Most Unix and Unix-like systems, e.g. Solaris, Linux, BSD, and OS X; z/OS |
Table F-1 C Data Type Sizes
C Type | ILP32 | LP64 |
---|---|---|
char | 8 | 8 |
short | 16 | 16 |
int | 32 | 32 |
long | 32 | 64 |
long long | 64 | 64 |
pointer | 32 | 64 |
compiled in the 64-bit environment.
数据来源: http://publib.boulder.ibm.com/infocenter/zvm/v6r2/index.jsp?topic=%2Fcom.ibm.zos.r12.cbcpx01%2Fdatatypesize64.htm
同上
6: linux 中的实际使用
#ifndef __ASSEMBLY__ /* * __xx is ok: it doesn't pollute the POSIX namespace. Use these in the * header files exported to user space */ typedef __signed__ char __s8; typedef unsigned char __u8; typedef __signed__ short __s16; typedef unsigned short __u16; typedef __signed__ int __s32; typedef unsigned int __u32; #ifdef __GNUC__ __extension__ typedef __signed__ long long __s64; __extension__ typedef unsigned long long __u64; #else typedef __signed__ long long __s64; typedef unsigned long long __u64; #endif
也就是没用到long 类型,用了char, short, int, long long 就够了。
7. printf
http://www.gnu.org/software/libc/manual/html_mono/libc.html#Integer-Conversions 12.12.4 Integer Conversions This section describes the options for the ‘%d’, ‘%i’, ‘%o’, ‘%u’, ‘%x’, and ‘%X’ conversion specifications. These conversions print integers in various formats. The ‘%d’ and ‘%i’ conversion specifications both print an int argument as a signed decimal number; while ‘%o’, ‘%u’, and ‘%x’ print the argument as an unsigned octal, decimal, or hexadecimal number (respectively). The ‘%X’ conversion specification is just like ‘%x’ except that it uses the characters ‘ABCDEF’ as digits instead of ‘abcdef’. ‘l’ Specifies that the argument is a long int or unsigned long int, as appropriate. Two ‘l’ characters is like the ‘L’ modifier, below. If used with ‘%c’ or ‘%s’ the corresponding parameter is considered as a wide character or wide character string respectively. This use of ‘l’ was introduced in Amendment 1 to ISO C90. ‘L’ ‘ll’ ‘q’ Specifies that the argument is a long long int. (This type is an extension supported by the GNU C compiler. On systems that don't support extra-long integers, this is the same as long int.) The ‘q’ modifier is another name for the same thing, which comes from 4.4 BSD; a long long int is sometimes called a “quad” int.
8: 字节对齐
http://publib.boulder.ibm.com/infocenter/zvm/v6r2/index.jsp?topic=%2Fcom.ibm.zos.r12.cbcpx01%2Fcbcpg1b0233.htmhttp://publib.boulder.ibm.com/infocenter/zvm/v6r2/index.jsp?topic=%2Fcom.ibm.zos.r12.cbcpx01%2Fcbcpg1b0228.htm
http://www.unix.org/whitepapers/64bit.html
http://software.intel.com/en-us/articles/data-alignment-when-migrating-to-64-bit-intel-architecture
https://en.wikipedia.org/wiki/Data_structure_alignment
http://csweapon.diandian.com/post/2011-08-26/4372667
自然对齐
64-bit operating environment
Align 8-bit data at any address
Align 16-bit data to be contained within an aligned four-byte word
Align 32-bit data so that its base address is a multiple of four
Align 64-bit data so that its base address is a multiple of eight
Align 80-bit data so that its base address is a multiple of sixteen
Align 128-bit data so that its base address is a multiple of sixteen
|
Source: | #include <stdio.h> int main(void) { struct li{ long la; int ia; } li; struct lii{ long la; int ia; int ib; } lii; struct ili{ int ia; long la; int ib; } ili; printf("length li = %d\n",sizeof(li)); printf("length lii = %d\n",sizeof(lii)); printf("length ili = %d\n",sizeof(ili)); } |
ILP32 member lengths: | length li = 8 length lii = 12 length ili = 12 |
LP64 member lengths: | length li = 16 length lii = 16 length ili = 24 |
[b]插曲:[/b]
[b]我为什么写这篇文章。前不久去[b]某公司面试,boss问我int在64-bit OS上是多少,我说是4字节。然后他说是8字节,我表示又学到了很多知识。我其实以前就看过关于这个数据模型的一些帖子,只是没这么仔细。于是今天整理了一下。[/b][/b]相关文章推荐
- C语言中各种数据类型的长度 sizeof char, short, int, long, long long
- c语言基本数据类型short、int、long、char、float、double
- c语言基本数据类型short、int、long、char、float、double
- c语言基本数据类型short、int、long、char、float、double
- c语言基本数据类型short、int、long、char、float、double
- c语言基本数据类型short、int、long、char、float、double
- c语言基本数据类型short、int、long、char、float、double大小及命名规则
- c语言基本数据类型(short、int、long、char、float、double)
- c语言基本数据类型short、int、long、char、float、double
- 在C语言中,double、long、unsigned、int、char类型数据所占字节数
- C的无符号数据类型int,short,byte 转为 Java对应的数据long,char,short
- 各种int,double,char等类型数据长度
- 在C语言中,double、long、unsigned、int、char类型数据所占字节数
- 在C语言中,double、long、unsigned、int、char类型数据所占字节数
- 在C语言中,double、long、unsigned、int、char类型数据所占字节数
- 数据类型 -- char , unsigned char = BYTE, short, unsigned short = WORD, long, unsigned long = DWORD, int , unginged int = UINT
- 在C语言中,double、long、unsigned、int、char类型数据所占字节数
- 在C语言中,double、long、unsigned、int、char类型数据所占字节数
- C/C++——求下面数据类型的最大值和最小值: char, short, int, long, float, double, long double和numeric_limits使用
- 在C语言中,double、long、unsigned、int、char类型数据所占字节数