April 10, 2024, 4:47 a.m. | Adaku Uchendu, Thai Le, Dongwon Lee

cs.CL updates on arXiv.org arxiv.org

arXiv:2309.12934v2 Announce Type: replace
Abstract: Recent advances in Large Language Models (LLMs) have enabled the generation of open-ended high-quality texts, that are non-trivial to distinguish from human-written texts. We refer to such LLM-generated texts as deepfake texts. There are currently over 72K text generation models in the huggingface model repo. As such, users with malicious intent can easily use these open-sourced LLMs to generate harmful texts and dis/misinformation at scale. To mitigate this problem, a computational method to determine if …

abstract advances arxiv attribution authorship cs.cl deepfake diverse generated huggingface human language language models large language large language models llm llms quality text text generation topology type writing

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Scientist

@ ITE Management | New York City, United States