I need to remove duplicated paragraphs in a text with many paragraphs.
I use functions from the class java.security.MessageDigest
to calculate each paragraph's MD5 hash value, and then add these hash value into a Set
.
If add()
'ed successfully, it means the latest paragraph is a duplicate one.
Is there any risk of this way?
Except String.equals()
, is there any other way to do it?