C- Size of structure when bit field is used and how it is stored in memory

Question

void main()
{
  struct bitfield
  {
    signed int a :3;
    unsigned int b :13;
    unsigned int c :1;
  };

  struct bitfield bit1 = { 2, 14, 1 };
  clrscr();
  printf("%d", sizeof(bit1));
  getch();
}

Why is size 4 bytes here? And how exactly these elements are stored in memory?

possible duplicate of [structure padding and structure packing](http://stackoverflow.com/questions/4306186/structure-padding-and-structure-packing) — Chad, Aug 17 '13 at 02:52
Please don't use `void main()`. Neither the C standard nor Microsoft sanctions such usage, despite numerous books suggesting otherwise. — Jonathan Leffler, Aug 17 '13 at 03:01
The 'possible duplicate' on structure padding and packing does not deal with bit fields, and bit fields have their own sets of rules that are sufficiently different to make that answer not appropriate as the duplicate. There's probably some other duplicate, but it isn't that one. — Jonathan Leffler, Aug 17 '13 at 03:04
@JonathanLeffler +1. However, I believe that on some compilers targetting embedded systems (which are the only place people use bitfields these days, `main()` doesn't return a value - though I forget where I read this. — , Aug 17 '13 at 03:06
@ldrumm: It is a sweeping statement (though one with a kernel of truth to it) to say that bit fields are only used in embedded systems. If you're coding for an embedded system, you'd better say so to avoid accusations of mis-defining `main()`. In the absence of such identification, it is a (depressingly common) mistake. — Jonathan Leffler, Aug 17 '13 at 03:09
@JonathanLeffler, an insignificant problem that has no bearing on the type of coding asked about on SO. A sense of scale is required. When someone asks about seeing "pigs that fly" it is important to NOT argue about where they got their wings. — JackCColeman, Aug 17 '13 at 07:15

score 6 · Answer 1 · edited Jun 20 '20 at 09:12

Almost every aspect of bit fields is implementation defined. Even the signedness of a 'plain int' bit field is implementation defined; it may be signed or unsigned. The layout of the fields — whether they go from most significant bit to least significant bit in the containing 'unit' (the term used in the standard) or from least to most significant is implementation defined. The size of the largest permissible bit field; when a bit field is stored in a new unit; all these are implementation defined.

For example, on Mac OS X 10.8.4 using GCC 4.8.1, it is possible to demonstrate that the struct bitfield in the question is laid out with a occupying the 3 least significant bits (bits 0-2), b occupying the next 13 bits (3-15), and c occupying the next 1 bit (16):

#include <stdio.h>

static void print_info(int v);

int main(void)
{
    int values[] =
    {
        0x55555555, 0xAAAAAAAA, 0x87654321, 0xFEDCBA98,
        0xFEDCBA90, 0xFEDCBA91, 0xFEDCBA92, 0xFEDCBA93,
        0xFEDCBA94, 0xFEDCBA95, 0xFEDCBA96, 0xFEDCBA97,
        0xFEDCBA98, 0xFEDCBAA0, 0xFEDCBAA8, 0x0000BAA0,
        0x0001BAA0, 0x00000008, 0x00000010, 0x00000018,
        0x0000FFF0, 0x0000FFF8,
    };

    for (size_t i = 0; i < sizeof(values)/sizeof(values[0]); i++)
        print_info(values[i]);
    return 0;
}

static void print_info(int v)
{
    union
    {
        unsigned int x;
        struct bitfield
        {
            signed int   a:3;
            unsigned int b:13;
            unsigned int c:1;
        } y;
    } u;
    u.x = v;
    printf("0x%.8X => %2d 0x%.4X %1X\n", u.x, u.y.a, u.y.b, u.y.c);
}

Sample output:

0x55555555 => -3 0x0AAA 1
0xAAAAAAAA =>  2 0x1555 0
0x87654321 =>  1 0x0864 1
0xFEDCBA98 =>  0 0x1753 0
0xFEDCBA90 =>  0 0x1752 0
0xFEDCBA91 =>  1 0x1752 0
0xFEDCBA92 =>  2 0x1752 0
0xFEDCBA93 =>  3 0x1752 0
0xFEDCBA94 => -4 0x1752 0
0xFEDCBA95 => -3 0x1752 0
0xFEDCBA96 => -2 0x1752 0
0xFEDCBA97 => -1 0x1752 0
0xFEDCBA98 =>  0 0x1753 0
0xFEDCBAA0 =>  0 0x1754 0
0xFEDCBAA8 =>  0 0x1755 0
0x0000BAA0 =>  0 0x1754 0
0x0001BAA0 =>  0 0x1754 1
0x00000008 =>  0 0x0001 0
0x00000010 =>  0 0x0002 0
0x00000018 =>  0 0x0003 0
0x0000FFF0 =>  0 0x1FFE 0
0x0000FFF8 =>  0 0x1FFF 0

The test values are not chosen completely at random. From the test values 0xFEDCBA90 to 0xFECBA97, we can see that the least significant 3 bits contain a. From the test values 0x0000BAA0 and 0x0001BAA0, we can see that the 17th bit (or bit 16) contains c. And from the test values 0x00000008 to 0x0000FFF8, we can see that bits 3-15 contain b.

It must, however, be pointed out that the code is debatably portable in theory; since the code writes to u.x and then reads u.x and u.y.a, u.y.b and u.y.c, it is not accessing the member of the union last written to, which is strictly undefined behaviour. In practice, it 'always' works (I've not heard of a system where it doesn't work — it is unlikely but not technically impossible that there is a system where it doesn't work).

This layout is not the only possible layout by any stretch of the imagination. However, I don't have access to compilers or systems that demonstrate alternative layouts.

In ISO/IEC 9899:2011, section §6.7.2.1 Structure and union specifiers says:

¶11 An implementation may allocate any addressable storage unit large enough to hold a bitfield. If enough space remains, a bit-field that immediately follows another bit-field in a structure shall be packed into adjacent bits of the same unit. If insufficient space remains, whether a bit-field that does not fit is put into the next unit or overlaps adjacent units is implementation-defined. The order of allocation of bit-fields within a unit (high-order to low-order or low-order to high-order) is implementation-defined. The alignment of the addressable storage unit is unspecified.

¶12 A bit-field declaration with no declarator, but only a colon and a width, indicates an unnamed bit-field.¹²⁶⁾ As a special case, a bit-field structure member with a width of 0 indicates that no further bit-field is to be packed into the unit in which the previous bitfield, if any, was placed.

¹²⁶⁾ An unnamed bit-field structure member is useful for padding to conform to externally imposed layouts.

A slight variant of the structure in the question is:

struct exegesis
{
    signed int   a:3;
    unsigned int  :0;
    unsigned int b:13;
    unsigned int  :0;
    unsigned int c:1;
};

This structure has a size of 12 (on the same compiler/platform as before). The storage unit for bit fields on this platform is 4 bytes, so the anonymous zero-width fields start a new storage unit. a is stored in the least significant 3 bits of the first 4-byte unit; b in the least significant 13 bits of the second 4-byte unit; and c in the least significant bit of the third 4-byte unit. As noted in the quote from the standard, you can have anonymous bit fields that are larger than 0 too.

Yu Hao · Answer 2 · 2013-08-17T03:06:42.837

3

How bit-field is stored in memory is implementation dependent.

One possible reason is, the bit-field which has 17 bits, should hold at least 3 bytes, but the compiler choose to pad it to 4 bytes.

Again, almost everything about bit-field is implementation dependent, including the size of them and memory layout.

Last, don't use void main, always stay with int main

edited Aug 17 '13 at 03:06

answered Aug 17 '13 at 03:01

Yu Hao

119,891
44
235
294

The data for `a` and `b` might well be in the same 16-bit unit; but the 17th bit for `c` has to go somewhere. Although the structure could be made 3 bytes long, the compiler chooses to make it 4 bytes. Since it is an implementation-defined issue, that is perfectly valid behaviour. – Jonathan Leffler Aug 17 '13 at 03:03

score 0 · Answer 3 · answered Aug 17 '13 at 07:25

how the bit field is stored is architeture dependent means for little endian left to right and for big enian right to left. the storage in the memory happens with the word size.1 byte is 1 word.so in our example its 17 bit as first two 8 bits are going to be stored in first two word.the next bit will be stored in the next word .the size should be 3 byte but the compiler does some padding so finally the size is becoming 4 byte.

for the storage in the memory you can refer this link http://en.wikipedia.org/wiki/Endianness

C- Size of structure when bit field is used and how it is stored in memory

3 Answers3

Linked