45

Browsing the Linux kernel sources I found some piece of code where a block of statements surrounded by parenthesis is treated as a expression a la lisp (or ML), that is, an expression which value is the value of the last statement.

For example:

int a = ({
    int i;
    int t = 1;
    for (i = 2; i<5; i++) {
        t*=i;
    }
    t;
});

I've been looking at the ANSI C grammar trying to figure out how this piece of code would fit in the parse tree, but I haven't been successful.

So, does anybody know if this behaviour is mandated by the standard or is just a peculiarity of GCC?

Update: I've tried with the flag -pedantic and the compiler now gives me a warning:

warning: ISO C forbids braced-groups within expressions
Praetorian
  • 106,671
  • 19
  • 240
  • 328
fortran
  • 74,053
  • 25
  • 135
  • 175

2 Answers2

43

This is not standard C. It is a gcc extension called statement expressions. You can find the complete list of C extensions here. This is actually one of the many gcc extensions used in the Linux kernel and it seems like clang supports this too and although it is not explicitly named in the document.

As you observed the last expression serves as the value of the expression, the document says (emphasis mine):

The last thing in the compound statement should be an expression followed by a semicolon; the value of this subexpression serves as the value of the entire construct. (If you use some other kind of statement last within the braces, the construct has type void, and thus effectively no value.)

One of the main benefits would be to make safe macros that would avoid multiple evaluations of arguments with side effects. The example given uses this unsafe macro:

#define max(a,b) ((a) > (b) ? (a) : (b))

which evaluates either a or b twice and can be rewritten to eliminate this problem using statement expressions as follows:

#define maxint(a,b) \
   ({int _a = (a), _b = (b); _a > _b ? _a : _b; }) 

Note, the need to explicitly use int which can fixed using another gcc extension Typeof:

#define max(a,b) \
   ({ typeof (a) _a = (a), _b = (b); _a > _b ? _a : _b; }) 

Note that clang also supports typeof.

John Kugelman
  • 349,597
  • 67
  • 533
  • 578
Shafik Yaghmour
  • 154,301
  • 39
  • 440
  • 740
  • 1
    Is it possible to write a macro to be safe like that without statement expressions? – Flimm Sep 25 '14 at 14:38
  • @Flimm I don't think so but I would not consider myself a macro expert. I avoid them as much as I can, although there are some cases they are hard to avoid such as rolling your own assert. – Shafik Yaghmour Sep 26 '14 at 01:24
  • @Flimm That depends on what you want your macro to do, in some cases it's certainly possible to write a safe macro. For the complete `max` functionality you'll fall short in someway no matter what you do in standard C - the problem is that you either need to know the type of the arguments or you have to evaluate one of them twice (of which the later is the problem with the macro approach above). – skyking May 03 '17 at 13:17
  • Hmpf, the last one should be modified to properly support heterogenous arguments. – Deduplicator Aug 04 '18 at 20:51
38

It's called "braced-group within expression".

It's not allowed by ANSI/ISO C nor C++ but gcc supports it.

laalto
  • 150,114
  • 66
  • 286
  • 303