i wish to design an application that can detect plagiarism ( web based ) in all formats using Google search engine API. what are the resources that i would be requiring for such an application. Basically a user can upload a file and that file would be checked against various formats on the web . i have read that there are web crawlers but how exactly do we use them. Is Google search appliance API the correct approach?
Also I have not used a Google API before so what would be the best way to start?
Thanks a lot