How does ThreadLocalRandom.nextLong(long n) work?

Question

This is the code I don't understand:

// Divide n by two until small enough for nextInt. On each
// iteration (at most 31 of them but usually much less),

What? With my trivial simulation for a randomly chosen n I get up to 32 iterations and 31 is the average.

// randomly choose both whether to include high bit in result
// (offset) and whether to continue with the lower vs upper
// half (which makes a difference only if odd).

This makes sense.

long offset = 0;
while (n >= Integer.MAX_VALUE) {
    int bits = next(2);

Get two bits for the two decisions, makes sense.

    long half = n >>> 1;
    long nextn = ((bits & 2) == 0) ? half : n - half;

Here, n is n/2.0 rounded up or down, nice.

    if ((bits & 1) == 0) offset += n - nextn;
    n = nextn;

I'm lost.

}
return offset + nextInt((int) n);

I can see that it generates a properly sized number, but it looks pretty complicated and rather slow and I definitely can't see why the result should be uniformly distributed.¹

¹It can't be really uniformly distributed as the state is 48 bits only, so it can generate no more than 2**48 different numbers. The vast majority of longs can't be generated, but an experiment showing it would probably take years.

rolfl · Accepted Answer · 2013-11-04T12:45:49.730

I think you're somewhat mistaken.... let me try to describe the algorithm the way I see it:

First, assume that nextInt(big) (nextInt, not nextLong) is correctly able to generate a well distributed range of values between 0 (inclusive) and big (exclusive).

This nextInt(int) function is used as part of nextLong(long)

So, the algorithm works by looping until the value is less than Integer.MAX_INT, at which point it uses nextInt(int). The more interesting thing is what it does before that...

Mathematically, if we take half a number, subtract half of it, and then subtract half of the half, and then half of the half of the half, etc. and we do that enough times it will tend toward zero. Similarly, if you take half of a number, and add it to half of the half of the half, etc. it will tend toward the original number.

What the algorithm does here is it takes half of the number. By doing integer division, if the number is odd, then there's a 'big' half and a 'small' half. The algorithm 'randomly' chooses one of those halves (either big or small).

Then, it randomly chooses to add that half, or not add that half to the output.

It keeps halving the number and (possibly) adding the half until the half is less than Integer.MAX_INT. At that point it simply computes the nextInt(half) value and adds that to the result.

Assuming the initial long limit was much greater than Integer.MAX_VALUE then the final result will get all the benefit of nextInt(int) for a large int value which is at least 32 bits of state, as well as 2 bits of state for all the higher bits above Integer.MAX_VALUE.

The larger the orignal limit is (closer it is to Long.MAX_VALUE), the more times the loop will iterate. At worst case it will go 32 times, but for smaller limits it will go through fewer times. At worst case, for very large limits you will get 64 bits of randomness used for the loops, and then whatever is needed for the nextInt(half) too.

EDIT: WalkThrough added - Working out the number of outcomes is harder, but, all values of long from 0 to Long.MAX_VALUE - 1 are possible outcomes. As 'proof' using nextLong(0x4000000000000000) is a neat example to start from because all the halving processes will be even and it has bit 63 set.

Because bit 63 is set (the most significant bit available because bit64 would make the number negative, which is illegal) it means that we will halve the value 32 times before the value is <= Integer.MAX_VALUE (which is 0x000000007fffffff - and our half will be 0x0000000004000000 when we get there....). Because halving and bit-shifting are the same process it holds that there are as many halves to do as the difference between the highest bit set and bit 31. 63 - 31 is 32, so we halve things 32 times, and thus we do 32 loops in the while loop. The initial start value of 0x4000000000000000 means that as we halve the value, there will only be one bit set in the half, and it will 'walk' down the value - shifting 1 to the right each time through the loop.

Because I chose the initial value carefully, it is apparent that in the while loop the logic is essentially deciding whether to set each bit or not. It takes half of the input value (which is 0x2000000000000000) and decides whether to add that to the result or not. Let's assume for the sake of argument that all our loops decide to add the half to the offset, in which case, we start with an offset of 0x0000000000000000, and then each time through the loop we add a half, which means each time we add:

0x2000000000000000
0x1000000000000000
0x0800000000000000
0x0400000000000000
.......
0x0000000100000000
0x0000000080000000
0x0000000040000000  (this is added because it is the last 'half' calculated)

At this point our loop has run 32 times, it has 'chosen' to add the value 32 times, and thus has at least 32 states in the value (64 if you count the big/little half decision). The actual offset is now 0x3fffffffc0000000 (all bits from 62 down to 31 are set).

Then, we call nextInt(0x40000000) which, as luck would have it, produces the result 0x3fffffff, using 31 bits of state to get there. We add this value to our offset and get the result:

0x3fffffffffffffff

With a 'perfect' distribution of nextInt(0x40000000) results we would have had a perfect coverage of the values 0x7fffffffc0000000 through 0x3fffffffffffffff with no gaps. With perfect randomness in our while loop, our high bits would have been a perfect distribution of 0x0000000000000000 through 0x3fffffffc0000000 Combined there is full coverage from 0 through to our limit (exclusive) of 0x4000000000000000

With the 32 bits of state from the high bits, and the (assumed) minimum 31 bits of state from nextInt(0x40000000) we have 63bits of state (more if you count the big/little half decision), and full coverage.

The last paragraph seems to oppose my claim about the uniform distribution. You're saying "64 bits of randomness", but there are only 48 bits of state. Whatever deterministic method you call, you can't get more then `2**48` different outcomes. — maaartinus, Nov 04 '13 at 07:13
Added a walk-through showing how all values for a large input are available — rolfl, Nov 04 '13 at 12:25
I see, but you're assuming that all the decisions are independent, which is true for subclasses like `SecureRandom`, but impossible for `Random` itself. Any deterministic computation starting from a known state must return the same result - that's by definition. Starting from one of 1000 states can return up to 1000 different states, right? Starting from `2**48` states.... you see? See also [this "Inverse function of Random function" question](http://stackoverflow.com/a/15237585/581205). — maaartinus, Nov 07 '13 at 04:31
I'm accepting your answer, it's nice and very detailed, but I'd be glad if you could somehow incorporate my last comment. — maaartinus, Nov 07 '13 at 04:33
Are you still concerned about whether all 2^63 values in the range 0..Long.MAX_VALUE can be generated, or are you satisfied with that part of the answer? If your concern is about the 48-bit seed in the Random class, then you should remember that there are 2^48 different possible pseudo-random sequences.... and the 48-bit seed does not affect the statistical distribution, range, or coverage of any one sequence. — rolfl, Nov 07 '13 at 05:10
I'm not concerned... I'm perfectly **sure** they *can not*. But this wasn't my question and your answer was good w.r.t. what I asked. Is betting allowed on this site? — maaartinus, Nov 07 '13 at 05:42

How does ThreadLocalRandom.nextLong(long n) work?

1 Answers1