Detection of Paraphrastic Cases of Mono-lingual and Cross-lingual Plagiarism
نویسندگان
چکیده
External plagiarism detection is a unique retrieval process where the algorithm has to provide an evidence of plagiarism if any for a suspicious section from the pool of source documents available. This paper focuses on paraphrasing involved in detection of plagiarism both from monolingual and cross-lingual aspect. In order to investigate the challenges in detection, we further analyse the performance of Vector Space Model(VSM) based external plagiarism detection system on PAN-PC-2011 corpus.
منابع مشابه
English-Persian Plagiarism Detection based on a Semantic Approach
Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...
متن کاملOn Cross-lingual Plagiarism Analysis using a Statistical Model
The automatic detection of plagiarism is a task that has acquired relevance in the Information Retrieval area and it becomes more complex when the plagiarism is made in a multilingual panorama, where the original and suspicious texts are written in different languages. From a cross-lingual perspective, a text fragment in one language is considered a plagiarism of a text in another language if t...
متن کاملMultilingual Plagiarism Detection
Cross lingual plagiarism detection has recently caught attention due to copy-right violations occurring in many fields such as education, journalism, scientific research, literature, screenplays, etc, where an author would translate an article in language L1 into language L2 and then either publish/submit it or change some of the sentences to suit his/her motivations. Therefore, the need for a ...
متن کاملDetecting Cross-Lingual Plagiarism Using Simulated Word Embeddings
Cross-lingual plagiarism (CLP) occurs when texts written in one language are translated into a different language and used without acknowledging the original sources. One of the most common methods for detecting CLP requires online machine translators (such as Google or Microsoft translate) which are not always available, and given that plagiarism detection typically involves large document com...
متن کاملA Comparison in Reading Ability and Achievement between Mono-Lingual and Bilingual Fifth Graders
A Comparison in Reading Ability and Achievement between Mono-Lingual and Bilingual Fifth Graders Y. Adib, Ph.D. Z. Sharifi N. Mahmoodi To compare both the reading ability and academic achievement among Farsi speaking mono-lingual fifth graders and their bilingual Aazari and Kordi counterparts, three samples of 153, 132, and 145 (total 430) such students from three cities o...
متن کامل