Questions tagged [plagiarism-detection]

74 questions
21
votes
1 answer

How do I check source-code-homework for plagiarism?

How do I tell that two source-codes (independent of their language C,Java,Lisp...) have strong indications that they could be plagiarism of each other? Background: I going to give my first seminar on computer languages. We have prepared small…
math
  • 8,514
  • 10
  • 53
  • 61
17
votes
6 answers

Similar code detector

I'm search for a tool that could compare source codes for similarity. We have a very trivial system right now that has huge amount of false positives and the real positives can easily get buried in them. My requirements are: reasonably small amount…
Šimon Tóth
  • 35,456
  • 20
  • 106
  • 151
13
votes
3 answers

Should I commit all my computer science homework assignments to GitHub?

After reading a community wiki on Quora, I decided it would be good to start experimenting with GitHub. I thought, "What a better way to experiment than with introductory computer science homework?" However this practice opens up my solutions to the…
Elliott
  • 360
  • 1
  • 4
  • 15
9
votes
5 answers

Can Git detect if two source files are essentially copies of each others?

Sorry if this is off-topic, but here is your chance to reduce the amount of "homework" questions on this site :-) I'm teaching a class of C programming where the students work on a small library of numeric routines in C. This year, the source files…
lindelof
  • 34,556
  • 31
  • 99
  • 140
9
votes
4 answers

Checking for code plagiarism with JavaScript

I was wondering how I can detect code plagiarism with Javascript. I want to test assignment submissions for homework I'm going to hand out. I've looked at using MOSS, but—from what I've heard—it's pretty poor for anything other than C.…
Alex Ciminian
  • 11,398
  • 15
  • 60
  • 94
8
votes
1 answer

Plagiarism detection - winnowing algorithm - fingerprints clash

I write application for plagiarism detection in big text files. After reading many articles about it i decided to use Winnowing algorithm (with Karp-Rabin rolling hash function), but i have some problems with it. Data: I have two simple text files -…
Blood
  • 4,126
  • 3
  • 27
  • 37
6
votes
3 answers

is there any working/real open source Plagiarism checker available?

I want to develop a plagiarism checker for checking several source codes but I couldn't find any proper source code or even a resource to get an idea about it. I have checked the Boss2 which is useless. they claim that they use Sherlock module for…
Vahid Hashemi
  • 5,182
  • 10
  • 58
  • 88
6
votes
4 answers

Variable renaming for plagiarism detection for C/C++

I have a couple of simple C++ homeworks and I know the students shared code. These are smart students and they know how to cheat moss. I'm looking for a tool that can rename variables based on their types (first variable of type int will be int1,…
perreal
  • 94,503
  • 21
  • 155
  • 181
6
votes
5 answers

Copyright content API

I am looking to see if there is an automatic "copyright content" API that we can use. I know atrributor have a paid service, but I'm wondering if their is something that effectively does a google search for a portion of the content, to check if…
Lizard
  • 43,732
  • 39
  • 106
  • 167
5
votes
1 answer

Replacing a word by synonyms in Haskell

I was going through this plagiarism detector and trying to write a program in Haskell which will read a file and replace some of its words with synonyms. Is there any dictionary available for this purpose in Haskell? Also, if you have any input…
keep_learning
  • 1,057
  • 5
  • 14
5
votes
3 answers

Plagiarism Analyzer (compared against Web Content)

Hi everyone all over the world, Background I am a final year student of Computer Science. I've proposed my Final Double Module Project which is a Plagiarism Analyzer, using Java and MySQL. The Plagiarism Analyzer will: Scan all the paragraphs of…
Mr CooL
  • 1,529
  • 8
  • 23
  • 27
5
votes
0 answers

Fuzzy Matching a paragraph within a large text

I need to do a pretty complex matching of phrases. I have large bodies of text in files which exceed 1000 words each. The phrases I am searching for (searchphrase) are like this: Investment does not mean: i. Claims to money that arise solely…
shoi
  • 167
  • 1
  • 3
  • 7
4
votes
5 answers

How do I extract significant text content from a LaTeX document

I need to extract text-only content from my thesis document written in LaTeX for an automated anti-plagiarism check. I know only about the "draft" option and it's not enough. I am supposed to omit: images, tables and other…
odiroot
  • 53
  • 1
  • 5
3
votes
2 answers

How to obfuscate C++ variables and functions

I'm trying to do some algorithm comparison for plagiarism. I've found many TEXT comparison for plagiarism. But in an algorithm it's very different. Let's say that some algorithm uses an huge number of variables, functions and user defined…
IPValverde
  • 2,019
  • 2
  • 21
  • 38
3
votes
1 answer

Hashes generated by Rabin Karp Rolling Hash not reflecting on the Text

Note: Lots of Possible duplicates, but nothing seems to be solving my problem. I am working on a Plagiarism detection based on MOSS. After successfully implementing a filter which strips out all the necessary details(comments,punctuations etc) I…
Nitish Upreti
  • 6,312
  • 9
  • 50
  • 92
1
2 3 4 5