Making swap faster, easier to use and exception-safe

Question

I could not sleep last night and started thinking about std::swap. Here is the familiar C++98 version:

template <typename T>
void swap(T& a, T& b)
{
    T c(a);
    a = b;
    b = c;
}

If a user-defined class Foo uses external ressources, this is inefficient. The common idiom is to provide a method void Foo::swap(Foo& other) and a specialization of std::swap<Foo>. Note that this does not work with class templates since you cannot partially specialize a function template, and overloading names in the std namespace is illegal. The solution is to write a template function in one's own namespace and rely on argument dependent lookup to find it. This depends critically on the client to follow the "using std::swap idiom" instead of calling std::swap directly. Very brittle.

In C++0x, if Foo has a user-defined move constructor and a move assignment operator, providing a custom swap method and a std::swap<Foo> specialization has little to no performance benefit, because the C++0x version of std::swap uses efficient moves instead of copies:

#include <utility>

template <typename T>
void swap(T& a, T& b)
{
    T c(std::move(a));
    a = std::move(b);
    b = std::move(c);
}

Not having to fiddle with swap anymore already takes a lot of burden away from the programmer. Current compilers do not generate move constructors and move assignment operators automatically yet, but as far as I know, this will change. The only problem left then is exception-safety, because in general, move operations are allowed to throw, and this opens up a whole can of worms. The question "What exactly is the state of a moved-from object?" complicates things further.

Then I was thinking, what exactly are the semantics of std::swap in C++0x if everything goes fine? What is the state of the objects before and after the swap? Typically, swapping via move operations does not touch external resources, only the "flat" object representations themselves.

So why not simply write a swap template that does exactly that: swap the object representations?

#include <cstring>

template <typename T>
void swap(T& a, T& b)
{
    unsigned char c[sizeof(T)];

    memcpy( c, &a, sizeof(T));
    memcpy(&a, &b, sizeof(T));
    memcpy(&b,  c, sizeof(T));
}

This is as efficient as it gets: it simply blasts through raw memory. It does not require any intervention from the user: no special swap methods or move operations have to be defined. This means that it even works in C++98 (which does not have rvalue references, mind you). But even more importantly, we can now forget about the exception-safety issues, because memcpy never throws.

I can see two potential problems with this approach:

First, not all objects are meant to be swapped. If a class designer hides the copy constructor or the copy assignment operator, trying to swap objects of the class should fail at compile-time. We can simply introduce some dead code that checks whether copying and assignment are legal on the type:

template <typename T>
void swap(T& a, T& b)
{
    if (false)    // dead code, never executed
    {
        T c(a);   // copy-constructible?
        a = b;    // assignable?
    }

    unsigned char c[sizeof(T)];

    std::memcpy( c, &a, sizeof(T));
    std::memcpy(&a, &b, sizeof(T));
    std::memcpy(&b,  c, sizeof(T));
}

Any decent compiler can trivially get rid of the dead code. (There are probably better ways to check the "swap conformance", but that is not the point. What matters is that it's possible).

Second, some types might perform "unusual" actions in the copy constructor and copy assignment operator. For example, they might notify observers of their change. I deem this a minor issue, because such kinds of objects probably should not have provided copy operations in the first place.

Please let me know what you think of this approach to swapping. Would it work in practice? Would you use it? Can you identify library types where this would break? Do you see additional problems? Discuss!

Most present use cases for `std::swap` will have better solutions using move semantics anyway. — aschepler, Feb 02 '11 at 14:23
Yes move semantics and move constructors. See this, http://stackoverflow.com/questions/4820643/understanding-stdswap-what-is-the-purpose-of-tr1-remove-reference — Michael Smith, Feb 02 '11 at 14:40
consider `swap(*polymorphicPtr1,*polymorphicPtr2)`... your swap function would swap the vtable of both objects aswell... wich will cause havoc if someone calls a virtual function ofter the call to swap. — smerlin, Feb 02 '11 at 14:41
@smerlin: But polymorphic objects *really* should not have copy constructors or assignment operators, right? — fredoverflow, Feb 02 '11 at 14:51
@FredOverflow: Yep, but the question is: Can you write a static assertion to check if T is a polymorphic type or not. Because otherwise someone WILL use it with polymorphic types, despite it is not a good habit to have assignment-operators for polymorphic types (imo: copy-constructors are okay sometimes). — smerlin, Feb 02 '11 at 15:16
@smerlin: Actually, yes :) C++0x provides a `std::is_polymorphic` type trait. — fredoverflow, Feb 02 '11 at 15:20

sbi · Accepted Answer · 2011-08-24T07:16:58.783

20

So why not simply write a swap template that does exactly that: swap the object representations*?

There's many ways in which an object, once being constructed, can break when you copy the bytes it resides in. In fact, one could come up with a seemingly endless number of cases where this would not do the right thing - even though in practice it might work in 98% of all cases.

That's because the underlying problem to all this is that, other than in C, in C++ we must not treat objects as if they are mere raw bytes. That's why we have construction and destruction, after all: to turn raw storage into objects and objects back into raw storage. Once a constructor has run, the memory where the object resides is more than only raw storage. If you treat it as if it weren't, you will break some types.

However, essentially, moving objects shouldn't perform that much worse than your idea, because, once you start to recursively inline the calls to std::move(), you usually ultimately arrive at where built-ins are moved. (And if there's more to moving for some types, you'd better not fiddle with the memory of those yourself!) Granted, moving memory en bloc is usually faster than single moves (and it's unlikely that a compiler might find out that it could optimize the individual moves to one all-encompassing std::memcpy()), but that's the price we pay for the abstraction opaque objects offer us. And it's quite small, especially when you compare it to the copying we used to do.

You could, however, have an optimized swap() using std::memcpy() for aggregate types.

edited Aug 24 '11 at 07:16

answered Feb 02 '11 at 14:18

sbi

219,715
46
258
445

s/praxis/practice/ ;-) Also, it would be nice to add a link to the POD/aggregate FAQ. – fredoverflow Feb 02 '11 at 14:58
1

@Fred, my dictionary says that "praxis" is a perfectly valid English word. Isn't it? English is not my native language, but I'm curious. – Sergei Tachenov Feb 02 '11 at 15:05
@Sergey: Google has 27.700.000 search results for "in practice", but only 214.000 for "in praxis"... – fredoverflow Feb 02 '11 at 15:07
1

are you sure about the absence of coalescence of adjacent moves into a single one ? It seems like a trivial optimization. (I suppose it could get thrown off by the presence of "not moved" data like virtual table pointers) – Matthieu M. Feb 02 '11 at 15:21
Matthieu: No, I'm not sure. Note that I wrote "it's unlikely". – sbi Feb 02 '11 at 23:20
@sbi: I realize what you're saying, but I'm also curious -- you come up with an *actual example* of where `memcpy`'ing an object breaks it? The *only* instance I know of is for types that hold pointers to themselves... which I don't think I've ever seen in practice yet. – user541686 Jul 24 '12 at 17:11
@Mehrdad: I'd find bit-wise copying suspicious every time a class has a copy ctor/assignment op. Of course, swapping doesn't require as much attention as actual copying, but it still leaves enough cases. For an actual example, consider objects that have their address registered somewhere, and where copying etc. updates that registry. (Basically, every case where the object is referred to by itself or some other object.) – sbi Jul 24 '12 at 20:09
@sbi: We were actually discussing that [here](http://stackoverflow.com/a/11638327/541686); feel free to post there. I realize that *if* there is a cyclic dependency *and* the objects would normally update the external references with a normal `swap`, *then* there would be trouble... but in practice the only situation we could think of was iterators, which would get invalidated anyway. See that post for the discussion. So in practice, it seems to me like if you know you don't have auto-updating cyclic dependencies (which aren't really common), then you should be fine? – user541686 Jul 24 '12 at 20:26

Sergei Tachenov · Answer 2 · 2011-02-02T15:03:29.243

20

This will break class instances that have pointers to their own members. For example:

class SomeClassWithBuffer {
  private:
    enum {
      BUFSIZE = 4096,
    };
    char buffer[BUFSIZE];
    char *currentPos; // meant to point to the current position in the buffer
  public:
    SomeClassWithBuffer();
    SomeClassWithBuffer(const SomeClassWithBuffer &that);
};

SomeClassWithBuffer::SomeClassWithBuffer():
  currentPos(buffer)
{
}

SomeClassWithBuffer::SomeClassWithBuffer(const SomeClassWithBuffer &that)
{
  memcpy(buffer, that.buffer, BUFSIZE);
  currentPos = buffer + (that.currentPos - that.buffer);
}

Now, if you just do memcpy(), where would currentPos point? To the old location, obviously. This will lead to very funny bugs where each instance actually uses another's buffer.

edited Feb 02 '11 at 15:03

answered Feb 02 '11 at 14:20

Sergei Tachenov

24,345
8
57
73

1

Honestly, making a `Reader` object copy-constructible and assignable seems like a design error to me. – fredoverflow Feb 02 '11 at 14:45
1

@Fred, it is just an abstract example. I should probably have named it "SomeClassWithBuffer" instead, but that's irrelevant. – Sergei Tachenov Feb 02 '11 at 14:59
@Matthieu, that issue was mentioned by the OP, so I didn't mention that. There are probably more issues that it seems at first, too. – Sergei Tachenov Feb 02 '11 at 15:01
Also the buffer the OP suggests may not be correctly aligned for `T`. – Motti Feb 02 '11 at 15:23
@Motti, I think this is not a problem, since it is only for temporary storage, and T is treated as an array of bytes which is perfectly valid for any type. – Sergei Tachenov Feb 02 '11 at 15:27
@FredOverflow: It's not your position to judge that. It's perfectly legal within the language, and you're not doing anyone any benefits by doing it that way. – Puppy Feb 02 '11 at 16:30
6

Upvoted: Pertinent trivia: Every node-based std::container in both libstdc++ and in libc++ (http://libcxx.llvm.org/) has the design Sergey illustrates (and would thus break with a memcpy-swap). This is the "embedded end node" optimization and is one of the more important optimizations for a node-based container. This enables both a noexcept default constructor and a noexcept move constructor. Imho it doesn't get much more important than that. Naturally these containers can and do create their own swap overloads. But the points is: Sergey's design is not rare. – Howard Hinnant May 20 '11 at 17:44

Fred Nurk · Answer 3 · 2011-02-02T14:32:13.250

Some types can be swapped but cannot be copied. Unique smart pointers are probably the best example. Checking for copyability and assignability is wrong.

If T isn't a POD type, using memcpy to copy/move is undefined behavior.

The common idiom is to provide a method void Foo::swap(Foo& other) and a specialization of std::swap<Foo>. Note that this does not work with class templates, …

A better idiom is a non-member swap and requiring users to call swap unqualified, so ADL applies. This also works with templates:

struct NonTemplate {};
void swap(NonTemplate&, NonTemplate&);

template<class T>
struct Template {
  friend void swap(Template &a, Template &b) {
    using std::swap;
#define S(N) swap(a.N, b.N);
    S(each)
    S(data)
    S(member)
#undef S
  }
};

The key is the using declaration for std::swap as a fallback. The friendship for Template's swap is nice for simplifying the definition; the swap for NonTemplate might also be a friend, but that's an implementation detail.

Puppy · Answer 4 · 2011-02-02T16:28:01.127

I deem this a minor issue, because such kinds of objects probably should not have provided copy operations in the first place.

That is, quite simply, a load of wrong. Classes that notify observers and classes that shouldn't be copied are completely unrelated. How about shared_ptr? It obviously should be copyable, but it also obviously notifies an observer- the reference count. Now it's true that in this case, the reference count is the same after the swap, but that's definitely not true for all types and it's especially not true if multi-threading is involved, it's not true in the case of a regular copy instead of a swap, etc. This is especially wrong for classes that can be moved or swapped but not copied.

because in general, move operations are allowed to throw

They are most assuredly not. It is virtually impossible to guarantee strong exception safety in pretty much any circumstance involving moves when the move might throw. The C++0x definition of the Standard library, from memory, explicitly states any type usable in any Standard container must not throw when moving.

This is as efficient as it gets

That is also wrong. You're assuming that the move of any object is purely it's member variables- but it might not be all of them. I might have an implementation-based cache and I might decide that within my class, I should not move this cache. As an implementation detail it is entirely within my rights not to move any member variables that I deem are not necessary to be moved. You, however, want to move all of them.

Now, it's true that your sample code should be valid for a lot of classes. However, it's extremely very definitely not valid for many classes that are completely and totally legitimate, and more importantly, it's going to compile down to that operation anyway if the operation can be reduced to that. This is breaking perfectly good classes for absolutely no benefit.

+1 for pointing out: "... The C++0x definition of the Standard library, from memory, explicitly states any type usable in any Standard container must not throw when moving. ...." — Martin Ba, Feb 03 '11 at 07:42
and I would add another +1 for "... and more importantly, it's going to compile down to that operation anyway ..." — Martin Ba, Feb 03 '11 at 07:42

score 1 · Answer 5 · answered Feb 02 '11 at 14:50

your swap version will cause havoc if someone uses it with polymorphic types.

consider:

Base *b_ptr = new Base();    // Base and Derived contain definitions
Base *d_ptr = new Derived(); // of a virtual function called vfunc()
yourmemcpyswap( *b_ptr, *d_ptr );
b_ptr->vfunc(); //now calls Derived::vfunc, while it should call Base::vfunc
d_ptr->vfunc(); //now calls Base::vfunc while it should call Derived::vfunc
//...

this is wrong, because now b contains the vtable of the Derived type, so Derived::vfunc is invoked on a object which isnt of type Derived.

The normal std::swap only swaps the data members of Base, so this is OK with std::swap

Swapping only the data members of `Base` can break the invariants of the `Derived` object. That's one reason why it does not make much sense to have an assignment operator for polymorphic objects. Note that Bjarne Stroustrup considers it a historical accident that the assignment operator is provided for every user-defined class by default. — fredoverflow, Feb 02 '11 at 14:54

Making swap faster, easier to use and exception-safe

5 Answers5

Linked