LAZY students take note – lifting an article off the internet, translating it into another language and presenting it as your own work won't necessarily go unnoticed. It used to be really tough to spot this kind of plagiarism, thanks to creativity on the part of online translators. Not any more.
A team led by Alberto Barron-Cedeno at the Polytechnic University of Catalonia, Spain, used a number of statistical methods to analyse suspicious-looking documents. One involved breaking each text down into fragments that were five sentences long and looking for elements of words that were similar in two languages.
Another method used a bilingual dictionary to automatically check how many words in each text were the same. The documents could also be translated into a language with a common root to make the analysis easier.
The results surprised even them: their technique showed "remarkable performance" not only in identifying entire documents that had been copied – but in spotting tracts that made use of excessive paraphrasing, too (Knowledge Based Systems, doi.org/nqc). If a document is flagged by the system as being similar to another, then human experts can take a closer look.
This article appeared in print under the headline "Cheating is cheating – in any language"
- New Scientist
- Not just a website!
- Subscribe to New Scientist and get:
- New Scientist magazine delivered every week
- Unlimited online access to articles from over 500 back issues
- Subscribe Now and Save
If you would like to reuse any content from New Scientist, either in print or online, please contact the syndication department first for permission. New Scientist does not own rights to photos, but there are a variety of licensing options available for use of articles and graphics we own the copyright to.
Have your say
Only subscribers may leave comments on this article. Please log in.
Only personal subscribers may leave comments on this article
All comments should respect the New Scientist House Rules. If you think a particular comment breaks these rules then please use the "Report" link in that comment to report it to us.
If you are having a technical problem posting a comment, please contact technical support.