The New York Times blocks OpenAI’s web crawler | allainews.com

Aug. 21, 2023, 10:04 p.m. | Wes Davis

The Verge - All Posts www.theverge.com

Illustration by Alex Castro / The Verge

The New York Times has blocked OpenAI’s web crawler, meaning that OpenAI can’t use content from the publication to train its AI models. If you check the NYT’s robots.txt page, you can see that the NYT disallows GPTBot, the crawler that OpenAI introduced earlier this month. Based on the Internet Archive’s Wayback Machine, it appears NYT blocked the crawler as early as August 17th.

Screenshot by Jay Peters / The Verge …

ai models alex check crawler gptbot illustration meaning openai publication robots the new york times web web crawler

More from www.theverge.com / The Verge - All Posts

Google’s Gemini AI plan for schools promises extra data protection and privacy 4 hours ago | www.theverge.com

access age ai model ai model training +16

Newspaper conglomerate Gannett is adding AI-generated summaries to the top of its articles 5 hours ago | www.theverge.com

articles feature gannett generated +11

Reddit’s deal with OpenAI will plug its posts into ‘ChatGPT and new products’ 5 hours ago | www.theverge.com

access agreement alex api +13

The printer company, that makes a camera, prints one more edition 10 hours ago | www.theverge.com

alex diffusion filter highlight +5

Sony’s new Xperia 1 VI flagship zooms in on photography nerds 11 hours ago | www.theverge.com

app cameras devices experience +10

Hugging Face is sharing $10 million worth of compute to help beat the big AI … 13 hours ago | www.theverge.com

academics ai companies ai technologies big +19

From ChatGPT to Gemini: how AI is rewriting the internet 1 day ago | www.theverge.com

advancement ai-powered ai-powered chatbots chatbots +8

Google I/O 2024: all the news from the developer conference 1 day, 1 hour ago | www.theverge.com

call conference developer developer conference +10

Microsoft’s AI obsession is jeopardizing its climate ambitions 1 day, 6 hours ago | www.theverge.com

ai day ambitions bold build +17

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net