6

I want to develop a plagiarism checker for checking several source codes but I couldn't find any proper source code or even a resource to get an idea about it. I have checked the Boss2 which is useless. they claim that they use Sherlock module for detecting plagiarism but it seems there is no such tools included in boss2.

if any open source detection tool is available for checking source code please let me know. regards

Vahid Hashemi
  • 5,182
  • 10
  • 58
  • 88
  • These guys offer a commercial product that includes that capability, FWIW: http://en.wikipedia.org/wiki/Black_Duck_Software – Paul Jackson Dec 25 '11 at 04:32
  • 1
    https://github.com/architshukla/Plagiarism-Checker – Pandya May 07 '18 at 09:27
  • 1
    For those who end up here I have also listed the available FOSS plagiarism checkers [here](https://www.reddit.com/r/opensource/comments/8xgldr/any_freeopensource_software_for_plagiarism/) – Foad S. Farimani Feb 10 '19 at 11:47
  • Add your comment as an answer please – Vahid Hashemi Feb 10 '19 at 17:30
  • 1
    This master thesis provides an overview of tools and methods in section 2.2: https://dspace.library.uu.nl/bitstream/handle/1874/355678/source-code-plagiarism.pdf?sequence=2&isAllowed=y – S. Huber Dec 03 '19 at 16:12

3 Answers3

5

I'm aware of open-source plagiarism detectors for text (e.g., WCopyFind), but not code.

I couldn't find... even a resource to get an idea about it.

The authors of the excellent closed-source tool MOSS have published a helpful paper about the technology.

Norman Ramsey
  • 198,648
  • 61
  • 360
  • 533
2

I know the question is old, but I did land here from a google.

Sherlock is an open source plagiarism detector. Sherlock's home page is here

Kevin Tindall
  • 375
  • 5
  • 16
0

I wrote SimiCheck, and you are welcome to use it. If you are interested in an API, I could probably write one very quickly. I wrote the original algorithm as part of the CrowdGrader peer-grading tool, but then I decided to make the comparison tools available independently. SimiCheck can handle code, Word (.docx), html, pdf, text, ..., as well as .zip, .tar, .gz, .tgz, and some more formats, and can deal with variable renaming, code moves, code across multiple files, etc.

Luca
  • 968
  • 1
  • 9
  • 17