Algorithm to identify a unique free polyomino (or polyomino hash)

Question

In short: How to hash a free polyomino?

This could be generalized into: How to efficiently hash an arbitrary collection of 2D integer coordinates, where a set contains unique pairs of non-negative integers, and a set is considered unique if and only if no translation, rotation, or flip can map it identically to another set?

For impatient readers, please note I'm fully aware of a brute force approach. I'm looking for a better way -- or a very convincing proof that no other way can exist.

I'm working on some different algorithms to generate random polyominos. I want to test their output to determine how random they are -- i.e. are certain instances of a given order generated more frequently than others. Visually, it is very easy to identify different orientations of a free polyomino, for example the following Wikipedia illustration shows all 8 orientations of the "F" pentomino (Source):

The F pentimino

How would one put a number on this polyomino - that is, hash a free polyomino? I don't want to depend on a prepolulated list of "named" polyominos. Broadly agreed-upon names only exists for orders 4 and 5, anyway.

This is not necessarily equavalent to enumerating all free (or one-sided, or fixed) polyominos of a given order. I only want to count the number of times a given configuration appears. If a generating algorithm never produces a certain polyomino it will simply not be counted.

The basic logic of the counting is:

testcount = 10000 // Arbitrary
order = 6         // Create hexominos in this test
hashcounts = new hashtable
for i = 1 to testcount
    poly = GenerateRandomPolyomino(order)
    hash = PolyHash(poly)
    if hashcounts.contains(hash) then  
        hashcounts[hash]++
    else
        hashcounts[hash] = 1

What I'm looking for is an efficient PolyHash algorithm. The input polyominos are simply defined as a set of coordinates. One orientation of the T tetronimo could be, for example:

[[1,0], [0,1], [1,1], [2,1]]:

 |012
-+---
0| X
1|XXX

You can assume that that input polyomino will already be normalized to be aligned against the X and Y axes and have only positive coordinates. Formally, each set:

Will have at least 1 coordinate where the x value is 0
Will have at least 1 coordinate where the y value is 0
Will not have any coordinates where x < 0 or y < 0

I'm really looking for novel algorithms that avoid the increasing number of integer operations required by a general brute force approach, described below.

Brute force

A brute force solution suggested here and here consists of hashing each set as an unsigned integer using each coordinate as a binary flag, and taking the minimum hash of all possible rotations (and in my case flips), where each rotation / flip must also be translated to the origin. This results in a total of 23 set operations for each input set to get the "free" hash:

Rotate (6x)
Flip (1x)
Translate (7x)
Hash (8x)
Find minimum of computed hashes (1x)

Where the sequence of operations to obtain each hash is:

Hash
Rotate, Translate, Hash
Rotate, Translate, Hash
Rotate, Translate, Hash
Flip, Translate, Hash
Rotate, Translate, Hash
Rotate, Translate, Hash
Rotate, Translate, Hash

Are you sure there are 8 combintaions for F? Are you sure there aren't 4 for F and 4 for F'? — corsiKa, Aug 17 '12 at 17:22
@corsiKa. Should be 8, it is isomorphic with respect to reflection. — cmh, Aug 17 '12 at 17:24
@corsiKa: That's just a difference in nomenclature. You are defining `F` as one of the _one-sided_ orientations (chiralities) and `F'` as the other. Both chiralities are instances of the same _free_ polyomino. — Joshua Honig, Aug 17 '12 at 17:25
@jmh_gr I'm saying that if I had on a table an F piece, I couldn't turn it into an F' without picking it up and flipping it over. I wouldn't have considered them equal, but that's probably because I'm thinking more of the game-rules of dominos than I am the mathematical aspect of the objects. — corsiKa, Aug 17 '12 at 17:31
@corsiKa : They are definitely both valid mathematical ways of defining the object. They even have special names: 'free' and 'one-sided'. In my question I just stated that I'm looking to hash free polyominos. __THAT SAID__: if you have a solution that efficiently hashes one-sided sets, please do post it! It would still be 4x better than the brute force method! — Joshua Honig, Aug 17 '12 at 17:41
"avoid the exponentially increasing number of integer operations" -- It seems that the number of operations would be proportional to the tile size, or am I missing something? — Tom Sirgedas, Aug 17 '12 at 17:59
I doubt a faster approach exists than brute force. But, consider checking if a polyomino exists in the hash table using only "step 1". When you add an entry to your hash table, use add all 8 orientation. This will use more memory, but should be faster for most cases, I think. — Tom Sirgedas, Aug 17 '12 at 18:03
@TomSirgedas That's a good point. I guess I was thinking the size of the hash value itself must increase exponentially with the order, since a "complete" grid of n-order would have `n*n` squares and thus require `n*n` bits to uniquely represent. Further thoughts exceed appropriate comment length, but suffice to say a 144-bit hash for a 12-polyomino (really only 76 are required...) does not require exponentially more _calculations_... question edited accordingly. — Joshua Honig, Aug 17 '12 at 18:12
@jmh_gr Encoding the polyomino idea: choose which hash-table to use based on bounding box dimensions. Then, 12-omino uses 42 bits at most (7x6). OR, encode DFS traversal: 33 bits for any 12-omino (33=(12-1)*3) (scales nicely). I'll spell this out in an answer if you're interested. — Tom Sirgedas, Aug 17 '12 at 19:46
How do you chose a DFS root which is invariant under rotation, reflection etc? — cmh, Aug 17 '12 at 19:50
@cmh: I don't, I'm just suggesting a different encoding ("hash") to use for the brute force method. — Tom Sirgedas, Aug 17 '12 at 19:56

Joshua Honig · Accepted Answer · 2012-08-20T15:43:34.960

Well, I came up with a completely different approach. (Also thanks to corsiKa for some helpful insights!) Rather than hashing / encoding the squares, encode the path around them. The path consists of a sequence of 'turns' (including no turn) to perform before drawing each unit segment. I think an algorithm for getting the path from the coordinates of the squares is outside the scope of this question.

This does something very important: it destroys all location and orientation information, which we don't need. It is also very easy to get the path of the flipped object: you do so by simply reversing the order of the elements. Storage is compact because each element requires only 2 bits.

It does introduce one additional constraint: the polyomino must not have fully enclosed holes. (Formally, it must be simply connected.) Most discussions of polyominos consider a hole to exist even if it is sealed only by two touching corners, as this prevents tiling with any other non-trivial polyomino. Tracing the edges is not hindered by touching corners (as in the single heptomino with a hole), but it cannot leap from one outer loop to an inner one as in the complete ring-shaped octomino:

enter image description here

It also produces one additional challenge: finding the minumum ordering of the encoded path loop. This is because any rotation of the path (in the sense of string rotation) is a valid encoding. To always get the same encoding we have to find the minimal (or maximal) rotation of the path instructions. Thankfully this problem has already been solved: see for example http://en.wikipedia.org/wiki/Lexicographically_minimal_string_rotation.

Example:

If we arbitrarily assign the following values to the move operations:

No turn: 1
Turn right: 2
Turn left: 3

Here is the F pentomino traced clockwise:

enter image description here

An arbitrary initial encoding for the F pentomino is (starting at the bottom right corner):

2,2,3,1,2,2,3,2,2,3,2,1

The resulting minimum rotation of the encoding is

1,2,2,3,1,2,2,3,2,2,3,2

With 12 elements, this loop can be packed into 24 bits if two bits are used per instruction or only 19 bits if instructions are encoded as powers of three. Even with the 2-bit element encoding can easily fit that in a single unsigned 32 bit integer 0x6B6BAE:

   1- 2- 2- 3- 1- 2- 2- 3- 2- 2- 3- 2
= 01-10-10-11-01-10-10-11-10-10-11-10
= 00000000011010110110101110101110
= 0x006B6BAE

The base-3 encoding with the start of the loop in the most significant powers of 3 is 0x5795F:

    1*3^11 + 2*3^10 + 2*3^9 + 3*3^8 + 1*3^7 + 2*3^6 
  + 2*3^5  + 3*3^4  + 2*3^3 + 2*3^2 + 3*3^1 + 2*3^0
= 0x0005795F

The maximum number of vertexes in the path around a polyomino of order n is 2n + 2. For 2-bit encoding the number of bits is twice the number of moves, so the maximum bits needed is 4n + 4. For base-3 encoding it's:

Base 3 Encoded max bits

Where the "gallows" is the ceiling function. Accordingly any polyomino up to order 9 can be encoded in a single 32 bit integer. Knowing this you can choose your platform-specific data structure accordingly for the fastest hash comparison given the maximum order of the polyominos you'll be hashing.

you can condense it even more - starting with the top right corner, you have `2,1,2,2,3,1,2,2,3,2,2,3` - that is to say you only need 1 entry for each coordinate. A `1` only needs to be used in the situation where you have three colinear elements, as on the top and the bottom left side. But I do like this approach. — corsiKa, Aug 18 '12 at 23:16
@corsiKa That's a really good point. You only need record what to do _at each vertex_ since the move in between is a given. — Joshua Honig, Aug 19 '12 at 00:42
I also think this is going to fail for the flipped versions. But I think you can get around this by testing whether or not the first non-1 value is a 2. If it isn't a 2, swap all 2 and 3 values. This should effectively flip the poly's encoding string to the 'other side'. — corsiKa, Aug 19 '12 at 03:25
Also, if you're willing to div/mod instead of bitshift, you can fit 40 pairs instead of 31 using 4 bits. You can always take the min-length encoding, and continue to rotate it until a non-1 is hit. The sign bit can represent 2 or 3, depending. Of course, you could also just leave it in a string format and let whatever string framework you use handle the hashcodes and equality functionality. It's not a significant cost, really. — corsiKa, Aug 19 '12 at 04:19
@corsiKa I was thinking that too, as with only 3 possibile values there is 25% wasted space in a 2-bit number. However, I was pondering the use of a fourth value to encode something like "pick up the 'pen'" for encoding polyominoes with holes... — Joshua Honig, Aug 19 '12 at 18:46
@corsiKa I revised my answer to reflect your insights about condensing the encoding. — Joshua Honig, Aug 19 '12 at 19:54

corsiKa · Answer 2 · 2012-08-17T20:20:27.397

You can reduce it down to 8 hash operations without the need to flip, rotate, or re-translate.

Note that this algorithm assumes you are operating with coordinates relative to itself. That is to say it's not in the wild.

Instead of applying operations that flip, rotate, and translate, instead simply change the order in which you hash.

For instance, let us take the F pent above. In the simple example, let us presume the hash operation was something like this:

int hashPolySingle(Poly p)
    int hash = 0
    for x = 0 to p.width
        fory = 0 to p.height
            hash = hash * 31 + p.contains(x,y) ? 1 : 0
    hashPolySingle = hash

int hashPoly(Poly p)
    int hash = hashPolySingle(p)
    p.rotateClockwise() // assume it translates inside
    hash = hash * 31 + hashPolySingle(p)
    // keep rotating for all 4 oritentations
    p.flip()
    // hash those 4

Instead of applying the function to all 8 different orientations of the poly, I would apply 8 different hash functions to 1 poly.

int hashPolySingle(Poly p, bool flip, int corner)
    int hash = 0
    int xstart, xstop, ystart, ystop
    bool yfirst
    switch(corner)
        case 1: xstart = 0
                xstop = p.width
                ystart = 0
                ystop = p.height
                yfirst = false
                break
        case 2: xstart = p.width
                xstop = 0
                ystart = 0
                ystop = p.height
                yfirst = true
                break
        case 3: xstart = p.width
                xstop = 0
                ystart = p.height
                ystop = 0
                yfirst = false
                break
        case 4: xstart = 0
                xstop = p.width
                ystart = p.height
                ystop = 0
                yfirst = true
                break
        default: error()
    if(flip) swap(xstart, xstop)
    if(flip) swap(ystart, ystop)

    if(yfirst)
        for y = ystart to ystop
            for x = xstart to xstop
                hash = hash * 31 + p.contains(x,y) ? 1 : 0
    else
        for x = xstart to xstop
            for y = ystart to ystop
                hash = hash * 31 + p.contains(x,y) ? 1 : 0
    hashPolySingle = hash

Which is then called in the 8 different ways. You could also encapsulate hashPolySingle in for loop around the corner, and around the flip or not. All the same.

int hashPoly(Poly p)
    // approach from each of the 4 corners
    int hash = hashPolySingle(p, false, 1)
    hash = hash * 31 + hashPolySingle(p, false, 2)
    hash = hash * 31 + hashPolySingle(p, false, 3)
    hash = hash * 31 + hashPolySingle(p, false, 4)
    // flip it
    hash = hash * 31 + hashPolySingle(p, true, 1)
    hash = hash * 31 + hashPolySingle(p, true, 2)
    hash = hash * 31 + hashPolySingle(p, true, 3)
    hash = hash * 31 + hashPolySingle(p, true, 4)
    hashPoly = hash

In this way, you're implicitly rotating the poly from each direction, but you're not actually performing the rotation and translation. It performs the 8 hashes, which seem to be entirely necessary in order to accurately hash all 8 orientations, but wastes no passes over the poly that are not actually doing hashes. This seems to me to be the most elegant solution.

Note that there may be a better hashPolySingle() algorithm to use. Mine uses a Cartesian exhaustion algorithm that is on the order of O(n^2). Its worst case scenario is an L shape, which would cause there to be an N/2 * (N-1)/2 sized square for only N elements, or an efficiency of 1:(N-1)/4, compared to an I shape which would be 1:1. It may also be that the inherent invariant imposed by the architecture would actually make it less efficient than the naive algorithm.

My suspicion is that the above concern can be alleviated by simulating the Cartesian exhaustion by converting the set of nodes into an bi-directional graph that can be traversed, causing the nodes to be hit in the same order as my much more naive hashing algorithm, ignoring the empty spaces. This will bring the algorithm down to O(n) as the graph should be able to be constructed in O(n) time. Because I haven't done this, I can't say for sure, which is why I say it's only a suspicion, but there should be a way to do it.

@cmh The question uses 8 hashes, 7 translates, 6 rotates and a flip. My answer uses only 8 hashes, with none of the extra translates, rotates, or flips. — corsiKa, Aug 17 '12 at 18:54
Ok so you've removed the ancillary operations. Neat approach. — cmh, Aug 17 '12 at 18:56
If you're interested, I posted a completely different approach to solving this. — Joshua Honig, Aug 18 '12 at 19:55

score 3 · Answer 3 · answered Sep 01 '12 at 00:35

Here's my DFS (depth first search) explained:

Start with the top-most cell (left-most as a tiebreaker). Mark it as visited. Every time you visit a cell, check all four directions for unvisited neighbors. Always check the four directions in this order: up, left, down, right.

Example

enter image description here

In this example, up and left fail, but down succeeds. So far our output is 001, and we recursively search the "down" cell.

We mark our new current cell as visited (and we'll finish searching the original cell when we finish searching this cell). Here, up=0, left=1.

We search the left-most cell and there are no unvisted neighbors (up=0, left=0, down=0, right=0). Our total output so far is 001010000.

enter image description here

We continue our search of the second cell. down=0, right=1. We search the cell to the right.

enter image description here

up=0, left=0, down=1. Search the down cell: all 0s. Total output so far is 001010000010010000. Then, we return from the down cell...

enter image description here

right=0, return. return. (Now, we are at the starting cell.) right=0. Done!

So, the total output is 20 (N*4) bits: 00101000001001000000.

Encoding improvement

But, we can save some bits.

The last visited cell will always encode 0000 for its four directions. So, don't encode the last visited cell to save 4 bits.

Another improvement: if you reached a cell by moving left, don't check that cells right-side. So, we only need 3 bits per cell, except 4 bits for the first cell, and 0 for the last cell.

The first cell will never have an up, or left neighbor, so omit these bits. So the first cell takes 2 bits.

So, with these improvements, we use only N*3-4 bits (e.g. 5 cells -> 11 bits; 9 cells -> 23 bits).

If you really want, you can compact a little more by noting that exactly N-1 bits will be "1".

Caveat

Yes, you'll need to encode all 8 rotations/flips of the polyomino and choose the least to get a canonical encoding.

I suspect this will still be faster than the outline approach. Also, holes in the polyomino shouldn't be a problem.

score 1 · Answer 4 · answered Aug 17 '12 at 17:29

1

I worked on the same problem recently. I solved the problem fairly simply by (1) generate a unique ID for a polyomino, such that each identical poly would have the same UID. For example, find the bounding box, normalize the corner of the bounding box, and collect the set of non-empty cells. (2) generate all possible permutations by rotating (and flipping, if appropriate) a polyomino, and look for duplicates.

The advantage of this brute approach, other than it's simplicity, is that it still works if the polys are distinguishable in some other way, for example if some of them are colored or numbered.

answered Aug 17 '12 at 17:29

ddyer

1,792
19
26

So you're saying there isn't a more elegant solution than brute forcing? – corsiKa Aug 17 '12 at 17:34
1

Hi ddyer - I appreciate the input. However, I have already described this brute force approach in my question. My question is : is there a better way? As an analogy, I can calculate a triangular number by brute force by adding up 1..n, but I can get it much faster from n*(n+1)/2. – Joshua Honig Aug 17 '12 at 17:48
Correct, I'm not aware of a better way, especially with the added condition in my second paragraph. – ddyer Aug 17 '12 at 17:57

score 1 · Answer 5 · answered Aug 17 '12 at 20:39

1

You can set up something like a trie to uniquely identify (and not just hash) your polyomino. Take your normalized polyomino and set up a binary search tree, where the root branches on whether (0,0) is has a set pixel, the next level branches on whether (0,1) has a set pixel, and so on. When you look up a polyomino, simply normalize it and then walk the tree. If you find it in the trie, then you're done. If not, assign that polyomino a unique id (just increment a counter), generate all 8 possible rotations and flips, then add those 8 to the trie.

On a trie miss, you'll have to generate all the rotations and reflections. But on a trie hit it should cost less (O(k^2) for k-polyominos).

To make lookups even more efficient, you could use a couple bits at a time and use a wider tree instead of a binary tree.

answered Aug 17 '12 at 20:39

Keith Randall

22,985
2
35
54

Can you expand a little more on how you would use a trie to encode a 2D structure? While a search for "2D trie" returns some very dense CS papers, I'm most familiar with using a trie to collapse a one-dimensional object (the classic example being a 'word' - a list of letters) – Joshua Honig Aug 17 '12 at 20:56
I think all the "hash" functions mentioned so far actually do fully encode the polyomino. "Take your normalized polyomino..." -- this seems unnecessary if all 8 reflection/flips of all found polyominos are stored in the trie. – Tom Sirgedas Aug 17 '12 at 21:03
I think my DFS suggestion pairs well with this approach. Instead of branching on k*k pixels, branch on the 3*(k-1) edges that a DFS would traverse. Though, I'm not sure if a hash table or trie would be better. – Tom Sirgedas Aug 17 '12 at 21:09
@TomSirgedas: you still need to normalize the polyomino by translating it as the OP describes. – Keith Randall Aug 17 '12 at 22:27
@jmh_gr: You can visit the pixels in any order you like - the 2d-ness is irrelevant. So you could do `(0,0),...,(0,k-1),(1,0),...,(1,k-1),...,(k-1,k-1)`. I might use a zig-zag pattern, as most of the interesting stuff would be in the upper left (low indexes). – Keith Randall Aug 17 '12 at 22:29

score 0 · Answer 6 · answered Aug 17 '12 at 17:28

0

A valid hash function, if you're really afraid of hash collisions, is to make a hash function x + order * y for coordinates and then loop trough all the coordinates of a piece, adding (order ^ i) * hash(coord[i]) to the piece hash. That way, you can guarantee you won't get any hash collisions.

answered Aug 17 '12 at 17:28

Pieter Bos

1,554
1
12
20

This doesn't guarantee a unique hash when you rotate/reflect. – cmh Aug 17 '12 at 17:30
@cmh this is only an implementation of a hash function of an arbitrary polyomino, you just have to brute-force all rotations and flips. – Pieter Bos Aug 17 '12 at 17:32
Exactly, the asker knows the brute force solution and is asking for something more elegant. – cmh Aug 17 '12 at 17:32
That's a good point for uniqueness. My primary concern, however, is in reducing the number of calculations required for brute-force calculating and comparing the hash for 8 different orientations of a given set. – Joshua Honig Aug 17 '12 at 17:35

Algorithm to identify a unique free polyomino (or polyomino hash)

6 Answers6

Example

Encoding improvement

Caveat

Linked