Feb. 1, 2024, 12:45 p.m. | Benoit Baudry Khashayar Etemadi Sen Fang Yogya Gamage Yi Liu Yuxin Liu Martin Monperrus Javier

cs.LG updates on arXiv.org arxiv.org

Generating fake data is an essential dimension of modern software testing, as demonstrated by the number and significance of data faking libraries. Yet, developers of faking libraries cannot keep up with the wide range of data to be generated for different natural languages and domains. In this paper, we assess the ability of generative AI for generating test data in different domains. We design three types of prompts for Large Language Models (LLMs), which perform test data generation tasks at …

cs.ai cs.lg cs.se data developers domains fake fake data generate generated generative languages libraries modern natural paper significance software software testing test testing

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US