Merge sort space and time complexity

Question

Given the following merge sort algorithm :

mergesort(A,p,r) 

    if (r <= l) return    //constant amount of time
    int m = (p+r)/2       //constant amount of time

    mergesort(A, p, q)      // these two calls will decide the
    mergesort(A, q+1, r)    // O(logn) factor inside O(n * logn) right?

    merge(A, p, q, r)       lets dwell further


merge(a,p,q,r){

    n1 = q-p+1         //constant time
    n2 = r-q           //constant time

    // Let L[1...n1+1] and R[1...n2+1] be new arrays    // idk , lets say constant

    for i,j in L[],R[]
        L[i] = A[p+i-1]
        R[j] = A[q+j]  // shouldn't this take time varying on size of array?
                       // also extra space too?

    i=1 j =1           // constant time

    for k = p to r     // everything below this will contribute to O(n)
                       // inside O(n*logn) amirite?
        if L[i]<=R[j]
            A[k] = L[i]
            i++

        else A[k] = R[j]
             j++

How come we are estimating O(nlogn) time complexity for it , keeping in mind that there are left and right arrays being created to be merged back?

And how come space complexity is O(n) only if extra size is being used? Won't the two of them be increased by n, because filling up array takes O(n) and L[] and R[] are being created at each recursion step.

possible duplicate of [Merge Sort Time and Space Complexity](http://stackoverflow.com/questions/10342890/merge-sort-time-and-space-complexity) — NathanOliver, May 11 '15 at 14:24
That merge sort algorithm is wrong: first you fill the two halves with 0 and THEN you merge them? — IVlad, May 11 '15 at 14:27
@NathanOliver that code talks about parallelism and in-place merge , atleast read my description bro. — hg_git, May 11 '15 at 14:27
Your variable names are all mixed up in `mergesort` function. — interjay, May 11 '15 at 14:28
@crashmstr just to justify that I'm using that language , until someone pops up saying array is also passed up by value and it should take more space on each recursive call. — hg_git, May 11 '15 at 14:35

IVlad · Accepted Answer · 2015-05-11T14:47:51.767

I suggest you reason about this by drawing a tree on paper: first write down your whole array:

2 4 7 1 4 6 2 3 7 ...

Then write what the recursion causes it to be split in below it:

 2 4 7 1 3 4 6 2 3 7 ...

          |
2 4 7 1 3   4 6 2 3 7 ...

    |            |
2 4   7 1 3  4 6   2 3 7

And so on with each piece.

Then, count how many rows you've written. This will be close to the base 2 logarithm of the number of elements you started with (O(log n)).

Now, how much work is being done for each row? It's O(n). Merging two arrays of lengths n1, n2 will take O(n1 + n2), even if you have to allocate space for them (and you don't in a proper implementation). Since each row in the recursion tree has n array elements, it follows that the work done for each row is O(n) and therefore the entire algorithm is O(n log n).

And how come space complexity is O(n) only if extra size is being used? Won't the two of them be increased by n , because filling up array takes O(n) and L[] and R[] are being created at each recursion step.

This is more interesting. If you do indeed create new L, R arrays at each recursion step, then the space complexity will be O(n log n). But you don't do that. You create one extra array of size n at the beginning (think of it as a global variable), and then you store the result of each merge into it.

You only pass around things that help you identify the subarrays, such as their sizes and the indexes they begin at. Then you access them using the original array and store the result of the merge in the globally allocated array, resulting in O(n) extra space:

global_temp = array of size equal to the array you're sorting
merge(a,p,q,r){

    i=p 
    j =q              // constant time

    while i < q and j <= r // or a similar condition       
        if A[i]<=A[j]
            global_temp[k++] = A[i]
            i++

        else 
            global_temp[k++] = A[j]
            j++
     // TODO: copy remaining elements into global_temp
     // TODO: copy global_temp into A[p..r]

Yup , I understood the size , create in start and pass by reference so that new size isn't allocated each time. But what about filling new arrays Left[] and RIght[] at each recursion step. Won't they add up to running time complexity? I'm sorry for any trouble , i'm just curious. — hg_git, May 11 '15 at 14:46
@hg_git - I've added pseudocode that doesn't even involve filling them up :). But even if you did fill them up, it wouldn't increase the complexity, because it would just be another for loop. It would increase the constant factors, but those are ignored by big-oh. Even if you allocated them, it wouldn't increase the big-oh for time (it would for space). — IVlad, May 11 '15 at 14:49
Even though we create new L, R arrays at each recursion step, the space complexity will be O(n) instead of O(n*lgn). — llinvokerl, Sep 08 '16 at 12:37

Dov · Answer 2 · 2015-05-11T14:36:08.023

1

Your question is unclear, but perhaps you are confused by the extra space you need.

Obviously, on the first pass (and every pass) you read the entire data, and merge each partition into one twice as big.

Let's just focus on 8 elements.

8 7 6 5 4 3 2 1

In the first pass, the size of each partition is 1, and you are merging them to size=2. So you read the 8 and the 7, and merge them into a partition:

7 8     5 6     3 4     1 2

The next stage is to merge groups of 2 into groups of 4. Obviously, you have to read every element. So both of these passes take O(n) operations. The number of times to double is log2(n), which is why this algorithm is O(n log n)

In order to merge, you need extra room. You could recycle it. But the worst case is when you merge two n/2 partitions into n (the last time). The easy way to envision that is to allocate a buffer big enough to copy the entire data into. That would be O(n) storage.

5 6 7 8        1 2 3 4
i              j
EMPTY Buffer int buf[8]
k = 0
buf[k++] = (orig[j] < orig[i]) ? orig[j++] : orig[k++]

edited May 11 '15 at 14:36

answered May 11 '15 at 14:34

Dov

8,000
8
46
75

I'm asking about merge operation where we create 2 sub arrays left[] and right[] and fill them to merge them back. Dosen't that adds up extra space and time complexity? – hg_git May 11 '15 at 14:39
You merge two partitions into one. I'm not sure why you are talking about left and right. GIven that you currently have left and right, you need one partition as big as both. Copying from two partitions of size = k is O(2k). For example, if you copy 2 paritions of size=2 you will end with one partition of size = 4. You can then reuse the memory that was the array for the other side. You do not need to keep allocating. Think of it this way: at each level you copy the complete array into another location. Then you can use the array as the temp for the next time. – Dov May 11 '15 at 14:42
I've made changes and introduced comments inside pseudo code. Can you check :) – hg_git May 11 '15 at 14:43

Merge sort space and time complexity

2 Answers2

Linked