More efficient simluation of 2 dice rolls - Python

Question

I wrote a program that records how many times 2 fair dice need to be rolled to match the probabilities for each result that we should expect.

I think it works but I'm wondering if there's a more resource friendly way to solve this problem.

 import random

 expected = [0.0, 0.0, 0.028, 0.056, 0.083, 
             0.111, 0.139, 0.167, 0.139, 0.111,
             0.083, 0.056, 0.028]

 results = [0.0] * 13  # store our empirical results here

 emp_percent = [0.0] * 13  # results / by count

 count = 0.0  # how many times have we rolled the dice? 

 while True:
     r = random.randrange(1,7) + random.randrange(1,7)  # roll our die
     count += 1 
     results[r] += 1
     emp_percent = results[:]

     for i in range(len(emp_percent)):
         emp_percent[i] /= count
         emp_percent[i] = round(emp_percent[i], 3)

     if emp_percent == expected:
         break

print(count)
print(emp_percent)

might have more luck over at http://codereview.stackexchange.com/ — pancho018, Sep 30 '15 at 17:55
This is a better fit for [Code Review](http://codereview.stackexchange.com/help/on-topic) — HPierce, Sep 30 '15 at 17:57
*"I think it works"* - then test it until you're sure one way or the other. If it's broken, give us a [mcve] with full error traceback; if not, go to [codereview.se]. At the very least add a tolerance to `emp_percent == expected`, comparing floats can go wrong easily. — jonrsharpe, Sep 30 '15 at 17:59
I believe that your premise is inherently flawed. There is no guarantee that you will **ever** match all 11 probabilities at once; in fact, the various properties of statistics predict enough chaos that you will circle around the desired values for a **very** long time before stumbling on **exactly** the expected distribution. — Prune, Sep 30 '15 at 18:08

score 1 · Accepted Answer · edited May 23 '17 at 10:27

There are several problems here.

Firstly, there is no guarantee that this will ever terminate, nor is it particularly likely to terminate in a reasonable amount of time. Ignoring floating point arithmetic issues, this should only terminate when your numbers are distributed exactly right. But the law of large numbers does not guarantee this will ever happen. The law of large numbers works like this:

Your initial results are (by random chance) almost certainly biased one way or another.
Eventually, the trials not yet performed will greatly outnumber your initial trials, and the lack of bias in those later trials will outweigh your initial bias.

Notice that the initial bias is never counterbalanced. Rather, it is dwarfed by the rest of the results. This means the bias tends to zero, but it does not guarantee the bias actually vanishes in a finite number of trials. Indeed, it specifically predicts that progressively smaller amounts of bias will continue to exist indefinitely. So it would be entirely possible that this algorithm never terminates, because there's always that tiny bit of bias still hanging around, statistically insignificant, but still very much there.

That's bad enough, but you're also working with floating point, which has its own issues; in particular, floating point arithmetic violates lots of conventional rules of math because the computer keeps doing intermediate rounding to ensure the numbers continue to fit into memory, even if they are repeating (in base 2) or irrational. The fact that you are rounding the empirical percents to three decimal places doesn't actually fix this, because not all terminating decimals (base 10) are terminating binary values (base 2), so you may still find mismatches between your empirical and expected values. Instead of doing this:

if emp_percent == expected:
    break

...you might try this (in Python 3.5+ only):

if all(map(math.is_close, emp_percent, expected)):
    break

This solves both problems at once. By default, math.is_close() requires the values to be within (about) 9 decimal places of one another, so it inserts the necessary give for this algorithm to actually have a chance of working. Note that it does require special handling for comparisons involving zero, so you may need to tweak this code for your use case, like this:

is_close = functools.partial(math.is_close, abs_tol=1e-9)
if all(map(is_close, emp_percent, expected)):
    break

math.is_close() also removes the need to round your empiricals, since it can do this approximation for you:

is_close = functools.partial(math.is_close, rel_tol=1e-3, abs_tol=1e-5)
if all(map(is_close, emp_percent, expected)):
    break

If you really don't want these approximations, you will have to give up floating point and work with fractions exclusively. They produce exact results when divided by one another. However, you still have the problem that your algorithm is unlikely to terminate quickly (or perhaps at all), for the reasons discussed above.

Good point about the law of large numbers. In fact you can say more -- the probability that the sample distribution will match the empirical distribution exactly tends to zero as the sample size tends to infinity. It is fairly common that you will get 2 heads on 4 flips of a fair coin. It is vanishingly unlikely that you will get 500,000 heads on 1,000,000 flips. — John Coleman, Sep 30 '15 at 18:59
@JohnColeman: While that certainly seems intuitive to me, I've learned [the hard way](https://en.wikipedia.org/wiki/Monty_Hall_problem) not to trust intuition in matters of probability. Remember, OP isn't just doing 1M flips and checking for 500k heads. They are checking for the number of heads *after each flip* and exiting as soon as the numbers match up. The increased number of trials may (or may not) raise the probability enough to matter. — Kevin, Sep 30 '15 at 19:02
Good point, though it is consistent with what I said. The probability drops towards zero but on the other hand you have more and more trials. Eventually it will happen (and happen infinitely often) but the expected additional waiting time for the first such occurrence (given that it hasn't happened yet) grows rather than shrinks with time. For a six-sided die, perfectly representative samples must be even rarer. My hunch is that if it doesn't happen early it isn't feasible to wait for it to happen. For the sum of a pair of dice -- it is even worse. — John Coleman, Sep 30 '15 at 20:14

John Coleman · Answer 2 · 2015-10-01T11:00:43.133

Rather than trying to match floating point numbers -- you could try to match expected values for each possible sum. This is equivalent to what you are trying to do since (observed number)/(number of trials) == (theoretical probability) if and only if the observed number equals the expected number. These will always be an integer exactly when the number of rolls is a multiple of 36. Hence, if the number of rolls is not a multiple of 36 then it is impossible for your observations to equal expectations exactly.

To get the expected values, note that the numerators that appear in the exact probabilities of the various sums (1,2,3,4,5,6,5,4,3,2,1 for the sums 2,3,..., 12 respectively) are the expected values for the sums if the dice are rolled 36 times. If the dice are rolled 36i times then multiply these numerators by i to get the expected values of the sums. The following code simulates repeatedly rolling a pair of fair dice 36 times, accumulating the total counts and then comparing them with the expected counts. If there is a perfect match, the number of trials (where a trial is 36 rolls) needed to get the match is returned. If this doesn't happen by max_trials, a vector showing the discrepancy between the final counts and final expected value is given:

import random

def roll36(counts):
    for i in range(36):
        r1 = random.randint(1,6)
        r2 = random.randint(1,6)
        counts[r1+r2 - 2] += 1

def match_expected(max_trials):
    counts = [0]*11
    numerators = [1,2,3,4,5,6,5,4,3,2,1]
    for i in range(1, max_trials+1):
        roll36(counts)
        expected = [i*j for j in numerators]
        if counts == expected:
            return i
    #else:
    return [c-e for c,e in zip(counts,expected)]

Here is some typical output:

>>> match_expected(1000000)
[-750, 84, 705, -286, 5783, -3504, -1208, 1460, 543, -1646, -1181]

Not only have the exact expected values never been observed in 36 million simulated rolls of a pair of fair dice, in the final state the discrepancies between observations and expectations have become quite large (in absolute value -- the relative discrepancies are approaching zero, as the law of large numbers predicts). This approach is unlikely to ever yield a perfect match. A variation that would work (while still focusing on expected numbers) would be to iterate until the observations pass a chi-squared goodness of fit test when compared with the theoretical distribution. In that case there would no longer be any reason to focus on multiples of 36.

More efficient simluation of 2 dice rolls - Python

2 Answers2