As an educational project in machine learning, I was thinking of creating a voice identification system from scratch. It should be able to identify a speaker from his / her voice after being trained on his / her voice previously.
What approach should I take in tackling this challenge? Specifically, how would such a system work at a high level?
Any advice would be appreciated :)