Implement generic swap macro in C

Question

Possible Duplicate:
is there an equivalent of std::swap() in c

Hi folks,

I was attempting a problem to write a generic swap macro in C and my macro looks like this:

#define swap(x,y) { x = x + y; y = x - y; x = x - y; }

It works fine for integers and floats but I am unsure if there is any catch in it. What if by generic macro they mean swapping pointers, characters etc ? Can anyone help me with writing a generic macro for swapping every input ?

Thanks

It won't work because of integer overflow issues. Paul R's answer looks like what you want. — nmichaels, Oct 20 '10 at 21:21
This will not work for all floats. Imagine `float x = 1e9; float y = 1.0f;`. Also, addition is not defined for pointers. — Oliver Charlesworth, Oct 20 '10 at 21:21
And why don't you just the normal code with a temporary variable? I'm pretty sure that the temp variable code is at least as fast if not faster. — CodesInChaos, Oct 20 '10 at 21:40
@CodeInChaos: Because, since it's a macro and not an actual template, he doesn't know what type to make the temporary variable. — Puppy, Oct 21 '10 at 08:33
`#define MIN(A,B) ({typeof(A) *_a = &(A); \ typeof(B) *_b = &(B);\ typeof(A) _c = *_b;\ *_b = *_a;\ *_a = _c;\ })` — andy, Aug 17 '13 at 13:20
While this might indeed be closely related to http://stackoverflow.com/questions/2637856/is-there-an-equivalent-of-stdswap-in-c, I personally don't think it qualifies as a duplicate - I've voted to reopen, especially as the answers and discussion so far are very insightful. — Greg S, Nov 12 '13 at 09:55

adamk · Accepted Answer · 2010-10-21T08:28:12.563

88

This works well only with integers.

For floats it will fail (e.g. try running it with a very large float and a very small one).

I would suggest something as follows:

#define swap(x,y) do \ 
   { unsigned char swap_temp[sizeof(x) == sizeof(y) ? (signed)sizeof(x) : -1]; \
     memcpy(swap_temp,&y,sizeof(x)); \
     memcpy(&y,&x,       sizeof(x)); \
     memcpy(&x,swap_temp,sizeof(x)); \
    } while(0)

memcpy is pretty optimized when the amount to copy is known at compilation time. Also, there's no need to manually pass a type name or use compiler specific extensions.

edited Oct 21 '10 at 08:28

answered Oct 20 '10 at 21:22

adamk

45,184
7
50
57

4

+1 for the only correct answer than does not require passing the type as an argument to the macro. – R.. GitHub STOP HELPING ICE Oct 20 '10 at 21:26
9

Why not make that `swap_temp[sizeof(x) == sizeof(y) ? sizeof(x) : -1]`. You'll get a generic buffer and compile-time check that they're the same. – GManNickG Oct 20 '10 at 21:26
@Oli: Presumably from the assumption that it will be enough for any type one would use such a macro with. – Matti Virkkunen Oct 20 '10 at 21:28
1

@Matti: that's what I originally meant, but it's actually not true (e.g .structs) – adamk Oct 20 '10 at 21:30
1

By the way, is there a way with C99 compound literals to perform this operation all as a single expression with no temporary variable, exploiting return values of `memcpy`? – R.. GitHub STOP HELPING ICE Oct 20 '10 at 21:32
@adamk: That's why I called it an assumption. – Matti Virkkunen Oct 20 '10 at 21:43
20

This will silently fail if my variable name is `swap_temp`. – JaredPar Oct 20 '10 at 21:45
@JaredPar: true - this can be fixed though using the token pasting technique in my solution. – Paul R Oct 20 '10 at 22:00
@Jared: That's only an issue because we're using macro's. There's no way around it, AFAIK, except to say "don't be stupid and use the name `swap_temp`". – GManNickG Oct 20 '10 at 22:33
1

@GMan, See @Paul`s solution. It has flaws (can't use dereferenced pointers) but it will either 1) work or 2) fail to compile. I would prefer a less functioning solution that doesn't silently fail. – JaredPar Oct 20 '10 at 22:34
@Jared: That's pretty annoying, though. I'd rather decorate the name like `_swap_temp_do_not_use_this_name_` and call it good. – GManNickG Oct 20 '10 at 22:39
@R.., Couldn't temporary `void *` variables be used to store `&x` and `&y` (evaluating `x` and `y` only once each)? That seems less of a hack to me. EDIT: Actually, I misread your comment; my suggestion seems to be going *against* what you had in mind. ;P – strager Oct 20 '10 at 22:41
@GMan agreed it's annoying but C/C++ is confusing enough without introducing macros that can silently fail. – JaredPar Oct 20 '10 at 22:42
@Jared: Do you *really* think it's going to be a problem with such a decorated name? I added an answer regardless, attempting to solve it. – GManNickG Oct 20 '10 at 22:53
2

@GMan, Yes and here's a real scenario where I've seen it happen (not specifically with swap but with other macros that ended up with conflicts). Developer A: writes the solution with the name you specified. Developer B: sees Developers A solution and says "wow, great temp name" and re-uses it. Developer C uses B's macro in combination with A's. Boom. – JaredPar Oct 20 '10 at 23:07
2

@Jared: Then Developer B is the moron, here. Why on Earth, implementing some *other* thing, would I use the name `_swap_temp_do_not_use_this_name_ ` as a "great temp name"; it has `swap_temp` in it! Blaming Developer A for "allowing it to silently fail" is about as backwards as it gets. It's not his job to protect against Machiavelli, only Murphy. – GManNickG Oct 20 '10 at 23:12
1

@GMan, I still blame developer A. They presented a solution which has a flaw and a solution but chose to check-in the flaw. The amount of time we've debated this in comments is much more than it would take to actually fix the problem. – JaredPar Oct 20 '10 at 23:17
@Jared: What's your fix? I seriously don't see a problem. The problem is your team had someone that stupid on it, to be frank. – GManNickG Oct 20 '10 at 23:22
1

@GMan, nitpick: not my team, bug I was tracking down in another's code base. For the fix I would choose 1) not a macro 2) your or Paul's solution. Corner cases like this, when hit are absolute nightmares to track down. IMHO it's well worth the extra few minutes to eliminate the silent failure. – JaredPar Oct 20 '10 at 23:25
2

@adamK: as it is now, the expression of the array bound may silently promote the `-1` to `SIZE_MAX`. You'd have to cast the second `sizeof(x)` to a signed type to make this trick work. – Jens Gustedt Oct 21 '10 at 07:49
@JaredPar: C/C++ is not this confusing. It's a C specific flaw. C and C++ are different languages, if you can't see how this kind of problem never happens in C++, you need to read another book on it. – Puppy Oct 21 '10 at 08:35
1

@R.: With compound literals yes, but not with the returns of `memcpy` (at least I didn't find one), but with an auxiliary `inline` function. Such a thing easily produces optimized code; see my answer (which took up the idea of GMan). – Jens Gustedt Oct 21 '10 at 12:16
@DeadMG, sorry you are incorrect. C++ like C has macros and hence can have the exact same flaw. – JaredPar Oct 21 '10 at 16:15
1

@JaredPar: Except that you don't need or want a macro to do this in C++, thus instantly avoiding the problem. Infact, there is such a thing as std::swap(), which has none of these issues. – Puppy Oct 21 '10 at 21:26
@DeadMG, the discussion is on how to properly implement a swap macro and my arguments hold with the language being C or C++. The discussion is not "what is the best swap routine". If it was then yes `std::swap` is clearly a winner. – JaredPar Oct 21 '10 at 21:29
@JaredPar: A flaw in macros is only a flaw in the language if you actually use macros. Flaws in macros like this are supserseded by the template system. Trying to suggest that C++ fails because a specific macro can't be implemented easily but it's functionality could be very easily duplicated in a far superior fashion with another language feature is plain wrong. It's like complaining about doing addition in binary by hand and ignoring the in-built arithmetic datatypes. You've taken a limitation of one system that was explicitly designed around in another system and called it a language limit. – Puppy Oct 21 '10 at 22:18
@DeadMG, no. I've taken a language feature available in 2 related languages and pointed out a flaw in the usage of that feature which is applicable to both languages. Again, this is not a discussion on what is the best approach to problem X, it is a discussion of how to best implement a solution with feature Y. – JaredPar Oct 21 '10 at 22:19
5

The question was not "how to implement a swap macro", it's "how to implement a swap macro in C". In C++, you don't implement a swap macro. It's that simple. This question does not concern or relate to C++. Talking about C/C++, especially in this specific context when C++ takes a totally different approach to the same problem, is incredibly wrong. – Puppy Oct 21 '10 at 22:20
Why is the do{}while() loop in the code? Won`t a simple block execute? – Ilian Zapryanov Mar 11 '14 at 05:09
@adamk Can you please explain why the sizeof function works here but not if I write this is as general function instead of a macro. – Sohaib Feb 07 '15 at 17:39
This silently misbehaves if the arguments have side-effects, e.g. `swap(arr[n++], var);` – M.M Nov 05 '17 at 21:05
The suggested macro runs into problems if one of the arguments is a pointer (e.g. an argument passed into the function invoking the macro) and the other is a local array of the same size as the array pointed at by the pointer. Or if both arguments are pointers and swapping the data that they're pointing at is the intended operation, not swapping the pointer value. These aren't insuperable problems with suitable alternative macros — they just require care in the use of the correct macro. – Jonathan Leffler May 28 '18 at 05:43

Paul R · Answer 2 · 2015-03-22T08:16:30.573

70

You can do something like this:

#define SWAP(x, y, T) do { T SWAP = x; x = y; y = SWAP; } while (0)

which you would then invoke like this:

SWAP(a, b, int);

or:

SWAP(x, y, float);

If you are happy to use gcc-specific extensions then you can improve on this like so:

#define SWAP(x, y) do { typeof(x) SWAP = x; x = y; y = SWAP; } while (0)

and then it would just be:

SWAP(a, b);

or:

SWAP(x, y);

This works for most types, including pointers.

Here is a test program:

#include <stdio.h>

#define SWAP(x, y) do { typeof(x) SWAP = x; x = y; y = SWAP; } while (0)

int main(void)
{
    int a = 1, b = 2;
    float x = 1.0f, y = 2.0f;
    int *pa = &a;
    int *pb = &b;

    printf("BEFORE:\n");
    printf("a = %d, b = %d\n", a, b);
    printf("x = %f, y = %f\n", x, y);
    printf("pa = %p, pb = %p\n", pa, pb);

    SWAP(a, b);     // swap ints
    SWAP(x, y);     // swap floats
    SWAP(pa, pb);   // swap pointers

    printf("AFTER:\n");
    printf("a = %d, b = %d\n", a, b);
    printf("x = %f, y = %f\n", x, y);
    printf("pa = %p, pb = %p\n", pa, pb);

    return 0;
}

edited Mar 22 '15 at 08:16

answered Oct 20 '10 at 21:18

Paul R

208,748
37
389
560

1

Fast, clear, correct. +1 – nmichaels Oct 20 '10 at 21:22
1

I'm curious; why the `do{}while(0)` construct? To combat macro expansion issues? – You Oct 20 '10 at 21:30
12

@You: it's a canonical form for macros which addresses the problem of an invalid `;` in an if/else construct. If you take away the `do`/`while (0)` then an expression such as `if (foo) SWAP(x, y); else SWAP(a, b);` will generate an error, as it expands to `if (foo) { ... }; else { ... };`, and the `};` immediately before the `else` will generate a compile error. – Paul R Oct 20 '10 at 21:32
1

`temp##x##y` is going to cause problems for otherwise valid expressions such as `SWAP(a->b, c.d);` or `SWAP(a[0], a[1]);` – ideasman42 Mar 22 '15 at 08:00
@ideasman42: good point - I need to think of a better way to generate a relatively unique name for the temporary - any ideas ? – Paul R Mar 22 '15 at 08:04
2

@Paul R, Call the variable `SWAP`, confusing but works :) – ideasman42 Mar 22 '15 at 08:12
@ideasman42: now *that* is fiendishy clever - I like it - I'll update the answer to use this instead of the token pasting kludge if you don't mind. – Paul R Mar 22 '15 at 08:13
1

@Paul R, jokes aside, shadowing variables is allowed in C. so even if you get a naming collision, its not going to fail since its in a nested block, unless you explicitly enable `-Werror=shadow` or similar. – ideasman42 Mar 22 '15 at 08:17
4

@ideasman42: yes, it wasn't so much shadowing that this kludge was trying to address though - the problem is that if one of the parameters is called say "temp" and the local temporary variable is also called "temp" then it breaks, hence the need to use a name which is as close to unique as possible. (This could even happen with a temporary named "SWAP" of course, but it's highly unlikely that anyone would have a variable named "SWAP" in this context.) – Paul R Mar 22 '15 at 08:20
1

This version misbehaves for `SWAP( arr[n++], var );` Which is maybe acceptable if you expect the reader to understand the convention that ALL CAPS means macros, and macro arguments should not have side-effects; however there are other solutions available that solve this problem – M.M Nov 05 '17 at 21:02
@M.M: yes, fair point - it can probably be improved but I think there will always be corner cases with macros like this. Better not to use them in the first place. – Paul R Nov 05 '17 at 21:10

Jens Gustedt · Answer 3 · 2010-10-21T13:07:18.173

GMan started this attempt, to code this in combination of an inline function and a macro. This solution supposes that you have modern C compiler that supports C99, since it uses a compound literal:

inline void swap_detail(void* p1, void* p2, void* tmp, size_t pSize)
{
   memcpy(tmp, p1, pSize);
   memcpy(p1, p2, pSize);
   memcpy(p2 , tmp, pSize);
}
#define SWAP(a, b) swap_detail(&(a), &(b), (char[(sizeof(a) == sizeof(b)) ? (ptrdiff_t)sizeof(a) : -1]){0}, sizeof(a))

This has the following properties:

It evaluates each of a and b only once.
It has a compile time check for the correct sizes.
It has no naming issue with a hidden variable.
The size of the temporary variable is computed at compile time, so the compound literal is not a dynamic array.

The cast (ptrdiff_t) is needed such that the -1 is not silently promoted to SIZE_MAX.

This solution still suffers from two drawbacks:

It is not type safe. It only checks for the sizes of the types, not their semantics. If the types differ, say a double of size 8 and a uint64_t, you are in trouble.
The expressions must allow the & operator to be applicable. Thus it will not work on variables that are declared with the register storage class.

I like how terse it is, I didn't know you could do that with arrays. — GManNickG, Oct 20 '10 at 23:26
`sizeof *(1 ? &(a) : &(b))` would be a simpler and more type-safe option for the third argument to `swap_detail` . [Credit here](https://stackoverflow.com/a/20380289/1505939) — M.M, Nov 05 '17 at 21:00

Mr. Llama · Answer 4 · 2010-10-22T02:00:26.073

9

Simply put: you cannot make a generic swap macro in C, at least, not without some risk or headache. (See other posts. Explanation lies below.)

Macros are nice, but the problem you'll run into with actual code would be data type issues (as you've stated). Furthermore, macros are "dumb" in a way. For example:

With your example macro #define swap(x,y) { x = x + y; y = x - y; x = x - y; }, swap(++x, y) turns into { ++x = ++x + y; y = ++x - y; ++x = ++x - y;}.

If you ran int x = 0, y = 0; swap(++x, y); you'd get x=2, y=3 instead of x=0, y=0. On top of this, if any of the temp variables in your macro appear in your code, you could run into some annoying errors.

The functionality you're looking for were introduced in C++ as templates. The closest you can get in C is using an inline function for each data type imaginable or a decently complex macro (see previous macro issues and previous posts).

Here's what that the solution using templates in C++ looks like:

template<typename T>
inline void swap(T &x, T &y)
{
    T tmp = x;
    x = y; y = tmp;
}

In C you'd need something like:

inline void swap_int(int *x, int *y) { /* Code */ }
inline void swap_char(char *x, char *y) { /* Code */ }
// etc.

or (as mentioned a few times) a fairly complex macro which has potential to be hazardous.

edited Oct 22 '10 at 02:00

answered Oct 20 '10 at 22:30

Mr. Llama

20,202
2
62
115

5

+1 for "not without some risk or headache" – ysap Oct 21 '10 at 01:06
1

No, your so-said C solution at the end isn't one, references are not part of C. – Jens Gustedt Oct 21 '10 at 07:43
Whoops! Thanks for pointing that out. Meant for those to be pointers. Will tweak it. – Mr. Llama Oct 21 '10 at 20:23
1

The C solution is still not C. You can't have multiple definitions of a function with different parameters defined. Maybe `swap_int()`, `swap_char()`, etc.? – tomlogic Oct 22 '10 at 00:49
Man, they really didn't teach us this kind of stuff in school. Thanks for the tips guys. – Mr. Llama Oct 22 '10 at 02:00

score 0 · Answer 5 · answered Oct 20 '10 at 21:19

0

This does not necessarily work fine for int according to the standand. Imagine the case where x and y were INT_MAX and INT_MAX-1 respectively. The first addition statement would result in a signed overflow which is undefined by the standard.

A more reliable implementation of the swap macro for int would be the XOR swap algorithm

http://en.wikipedia.org/wiki/XOR_swap_algorithm

answered Oct 20 '10 at 21:19

JaredPar

733,204
149
1,241
1,454

5

-1 purely on the basis that the XOR swap algorithm is never the correct answer... – Oliver Charlesworth Oct 20 '10 at 21:20
1

@Oli, that is in fact a terrible reason to downvote. It is a completely valid answer to the problem of swapping integers. – JaredPar Oct 20 '10 at 21:21
@JaredPar: I agree it works, but it doesn't solve the OP's problem of a generic swap macro, and using XOR to swap is a really grim idea, unless you're writing ultra-optimised assembler on particular platforms. The other answers that suggest using temp variables are far more appropriate... – Oliver Charlesworth Oct 20 '10 at 21:23
@Oli, did I say it would solve the OP's problem? No. I pointed out that even the solution for `int` was incorrect (OP asserted otherwise) and gave a solution for the problem of `int` (and explicitly stated it was for `int`). Other answers being more complete (they all have their issues) is not a reason to downvote a different valid answer. It's a reason to upvote theres – JaredPar Oct 20 '10 at 21:25
1

@Oli: Or ultra-slow assembler. Using a temporary is faster on modern CPU's. – GManNickG Oct 20 '10 at 21:27
@GMan: Yes, it only helps on simple CPUs (but not necessarily non-modern). – Oliver Charlesworth Oct 20 '10 at 21:28
@JaredPar: Then this would be better suited as a comment, rather than an answer, IMHO. – Oliver Charlesworth Oct 20 '10 at 21:29
1

@GMan, @Oli the temporary solutions provided in other answers are flawed. They all fail silently in the case where the temporary has the same name as the decided upon name in the macro. – JaredPar Oct 20 '10 at 21:47
@JaredPar: good point - I've now updated my macro to avoid this problem. – Paul R Oct 20 '10 at 21:53
1

@Paul, the new code avoids this problem but only works if variables are used. It's not possible values which result from expressionsn (pointer dereference for example). The plus side though is it forces a compilation error instead of silently compiling and failing. – JaredPar Oct 20 '10 at 21:59
1

@JaredPar: another good point - just goes to reinforce the collective wisdom as to why macros should be avoided. ;-) – Paul R Oct 20 '10 at 22:02

score -2 · Answer 6 · answered Oct 21 '10 at 07:55

-2

Seriously, how many swaps do you have to do in your code that it is worth all the headaches coming up here in this thread with the given solutions? I mean, this is not a 10 line complicated and error prone code structure, it is a well-known idiom with one temporary variable and three simple assignments. Write it where you need it, even in one line if you want to save space:

function foo ()
{
    int a, b, tmp;
    ...
    tmp = a; a = b; b = tmp;
    ....
}

Or use a "local" macro where a and b are more complex.

#define SWAP(t,a,b) ( (t) = (a), (a) = (b), (b) = (t) )

function foo (pointer)
{
    int tmp;
    ...
    SWAP(tmp, pointer->structure.array[count+1], pointer->structure.array[count+2]);
    ...
}

#undef SWAP

answered Oct 21 '10 at 07:55

Secure

4,268
1
18
16

1

@Secure: sure just your snipsets perfectly demostrate the dangers. First of all it only assumes that the three types in question are assignment compatible. You easily will have loss of information if the types differ by accident. For your first snipset you should at least go with C99 and declare `tmp` there where you use it. This would at least document what type you expect. The second snipset is even worse, since it doesn't even permit the declaration at the same place. It shows how error prone this is. – Jens Gustedt Oct 21 '10 at 12:06
@Jens: When you use an argument, then try it for both sides. If assignment compatibility is an issue at the local scope I've used here, how **critical** and **dangerous** will it become at the global scope used in the other solutions? None of them does a type compatibility check for x and y except for the sizeof, as far as I've seen, swapping pointers with ints without a warning. Your solution included. – Secure Oct 21 '10 at 12:38
@Secure: you are right that all these solutions suffer for types that have the same size but different semantics. But at least they check for size compatibility, yours doesn't. The problem I had in mind was a quite simple one, where e.g `a` is `size_t` and `b` is `int`. This may work for years until you encounter a buggy case. I'll add a remark for that to my solution. – Jens Gustedt Oct 21 '10 at 13:00
@Jens: Well, these are three simple direct assignments `a = b;`, no complex calculations with type conversions and all that involved. Writing bugs just happens, but seriously, if you can't get **this** right on a regular basis, then you won't be able to write any working program beyond the complexity of Hello World, IMHO. You're talking about a general problem here, not specific to swap. How will you ever be able to guarantee the type correctness of arguments given to a function? The very same problem, but much better hidden than in the direct assignment. – Secure Oct 21 '10 at 13:16
@Secure: I personally find function calls much more readable. Passing on wrongly typed arguments usually clearly shows up in a debugger, e.g. But it seem that we just disagree on that point. Be it so. – Jens Gustedt Oct 21 '10 at 13:28
@Jens: As I've said, when using an argument then try it for both sides. ;) The wrong tmp type doesn't show up in a debugger? – Secure Oct 21 '10 at 13:48
@Secure: yeah look at both sides ;-) Sure the `tmp` shows up in a debugger, if you suspect it to be the culprit. Call stacks with the arguments that the functions receive appear more directly, without asking for them. I'd call that the advantage of a modular design... But your mileage seems to be different from mine. – Jens Gustedt Oct 21 '10 at 14:54

Implement generic swap macro in C

6 Answers6

Linked

Related