The closest existing question I have found is this or this
I would like to write a function or class that accepts a string and then based on whatever criteria can be programmed into it will return the probability that it is a real human name. At the moment I would expect it to be heavily biased toward English or European names or English transliterations of other names. (for example, "bob", "bob smith", and "smith" should all return 1.0 and "sfgoisxdzzg" should return something like .001 or even .0000001)
Does anyone know if this is already done / being done? (even if in another language) My first thought was that I'd have to do some sort of machine learning script. My problem with that is my complete ignorance of any machine learning theory.
So, the second part of my question is this: Is machine learning a viable option for tackling this problem? If so, what resources should I start with to learn how to do it? IF not, can you point me in the right direction?