sábado, 30 de junio de 2012

Evaluation of literary pastilles with the help plagiarism detection algorithms

This is was my final project for my master's first year in computer science - artificial intelligence degree. I developed a system, which evaluates a literary pastille against several books using the implementation and programming of plagiarism detection algorithms as Bag of words, Longest Common Substring, and Textual Detection Footprints.

The system is developed in python (Tkinter, Matplotlib, Os, Numpy), and consists in a system, which received as entry :
  - A collection of parameters.
  - One document to evaluate.
  - Several books in txt format for being evaluated with the document.

The system consists in a choice between the three differents algorithms. Each one of them algorithms give a solution with several statistics graphics regarding the property of plagiarism, and after, the system gave  in detail the more similar book with the paragraphs where it founds the plagiarism patron.

Paris - France, February – June 2012