Aug. 8, 2023, 7:49 p.m. | Ervin Szilagyi

DEV Community dev.to

Lately, we can bloc GPT bots from scraping our pages for a site that we control, by setting the following lines in the robots.txt file:



User-agent: GPTBot
Disallow: /


I, myself, found out this from a tweet from Gergely Orosz:



My stance on this is similar to what Gergely is saying. GPT offers no citation to the information it provides. While I did update the robots.txt file on my personal website, I am also cross-posting to DEV. If we …

blog bot bots control discuss found gpt gpt3 gptbot openai robots scraping tweet watercooler

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote