I'm doing something like detect website languages. It's about 1k websites so I can't do it 1 by 1. I think to do so I need to get website html code by python and get all the texts, do analyze. Is it correct, has anyone did something like this it would be very helpful :)
Asked
Active
Viewed 78 times
-4
-
SO is not a code writing service. Kindly show your attempts and any errors you might face. – sgp Jun 09 '15 at 08:56
-
http://stackoverflow.com/questions/4605062/detecting-whether-or-not-text-is-english-in-bulk this might help – Ajay Jun 09 '15 at 08:59
-
Do you mean the code, or the content? What about sites that support multiple languages/i18n? – jonrsharpe Jun 09 '15 at 09:09