I'm working on a new version of git-stats
a tool to make some graphs based on Git commits, authors etc.
In the current version it accepts identical commit ids, namespacing the project names:
{
"some-project-url" { "hash1": "date", ... }
"some-project-url-fork" { "hash1": "date", ..., "commit-in-fork-id": "date" }
}
I want to remove the requirement of storing the project url, that means not have identical hashes.
Now I'm thinking if this is a good move.
When multiple projects are imported and each commit is stored once, what is the probability to have two identical ids?
Actually, in the real life, when does it happen to have two identical ids (in two different projects)?