April 3, 2024, 4:46 a.m. | Jiarong Xian, Jibao Yuan, Peiwei Zheng, Dexian Chen

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.01582v1 Announce Type: new
Abstract: Text plagiarism detection task is a common natural language processing task that aims to detect whether a given text contains plagiarism or copying from other texts. In existing research, detection of high level plagiarism is still a challenge due to the lack of high quality datasets. In this paper, we propose a plagiarized text data generation method based on GPT-3.5, which produces 32,927 pairs of text plagiarism detection datasets covering a wide range of plagiarism …

abstract arxiv bert challenge cs.ai cs.cl cs.ir datasets detection homework language language processing natural natural language natural language processing plagiarism processing quality research retrieval text tool type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US