Some programmers argue that it is better to pass IEnumerable<T>
parameters over passing implementations like List<T>
, and one of the popular reasons to do this is that the API is immediately usable in more scenarios because more collections will be compatible with IEnumerable<T>
than any other specific implementation, e.g. List<T>
.
The more I dive into multi-threaded development scenarios, the more I am starting to think that IEnumerable<T>
is not the correct type either and I will try to explain why below.
Have you ever received the following exception while enumerating a collection?
Collection was modified; enumeration operation may not execute. (an InvalidOperationException)
Basically what causes this is, you are given the collection on one thread, while someone else modifies the collection on another thread.
To circumvent the problem, I then developed a habit to take a snapshot of the collection before I enumerate it, by converting the collection to an array inside the method, like this:
static void LookAtCollection(IEnumerable<int> collection)
{
foreach (int Value in collection.ToArray() /* take a snapshot by converting to an array */)
{
Thread.Sleep(ITEM_DELAY_MS);
}
}
My question is this. Wouldn't it be better to code towards arrays instead of enumerables as a general rule, even if it means that callers are now forced to convert their collections to arrays when using your API?
Is this not cleaner and more bullet-proof? (the parameter type is an array instead)
static void LookAtCollection(int[] collection)
{
foreach (int Value in collection)
{
Thread.Sleep(ITEM_DELAY_MS);
}
}
The array meets all the requirements of being enumerable and of a fixed length, and the caller is aware of the fact that you will be operating on a snapshot of the collection at that point in time, which can save more bugs.
The only better alternative I can find right now is the IReadOnlyCollection which will then be even more bullet proof because the collection is then also readonly in terms of item-content.
EDIT:
@DanielPryden provided a link to a very nice article "Arrays considered somewhat harmful". And the comments made by the writer "I rarely need a collection which has the rather contradictory properties of being completely mutable, and at the same time, fixed in size" and "In almost every case, there is a better tool to use than an array." kind of convinced me that arrays are not as close to the silver bullet as I had hoped for, and I agree with the risks and loopholes. I think now the IReadOnlyCollection<T>
interface is a better alternative than both the array and the IEnumerable<T>
, but it kind of leaves me with the question now: Does the callee enforce it by having a IReadOnlyCollection<T>
parameter type in the method declaration? Or should the caller still be allowed to decide what implementation of IEnumerable<T>
he passes into the method that looks at the collection?
Thanks for all the answers. I learned a lot from these responses.