how to protect your website against "AI Scrapers"

what is ai scraping, you may ask? well basically it's a thing AI bots do where they steal info from ur website to help them develop, and you should be able to have a say in whether you give your data to these AIs instead of them stealing all your hard work.

protecting your site is very straightforward! however it's not guaranteed that any of this will work 100% since some bots have ways around this/ignore it.

1) robots.txt

go to yourwebsite.com/robots.txt to see if there's already a robots.txt file. if there isn't, just make a new file callled robots.txt and copy-paste this into it:

User-agent: CCBot
Disallow: /

User-agent: ChatGPT-User
Disallow: /

User-agent: GPTBot
Disallow: /

2. spawning

download their file (which is simply  a text file called something like ai.txt) and upload it to your site, done.

3. protect your images

this method doesnt always work either, so you might be better off watermarking or blocking direct access to your images, or using glaze or mist instead.

add this into <head>:

<meta name=”robots” content=”noai, noimageai”>


youre welcome.


2 Kudos

Comments

Displaying 0 of 0 comments ( View all | Add Comment )