I have a set of text files in a particular domain. I need to rank the files based on some metric.
Please help me out with a few metrics that can be used to rank my text files (term frequency, size, frequency of use, etc..). I would then like to use text mining techniques to rank the files based on one of these techniques.