simonwillison.net
robots.txt

Robots Exclusion Standard data for simonwillison.net

Resource Scan

Scan Details

Site Domain simonwillison.net
Base Domain simonwillison.net
Scan Status Ok
Last Scan2024-09-30T02:04:31+00:00
Next Scan 2024-10-01T02:04:31+00:00

Last Scan

Scanned2024-09-30T02:04:31+00:00
URL https://simonwillison.net/robots.txt
Domain IPs 104.21.56.206, 172.67.136.172, 2606:4700:3031::6815:38ce, 2606:4700:3032::ac43:88ac
Response IP 104.21.56.206
Found Yes
Hash a8a0f94587ad3d4c347205481ddce94396d86102af84c29f3d2ca52becc5b9b3
SimHash 59155c05e093

Groups

chatgpt-user

Rule Path
Disallow

*

Rule Path
Disallow /admin/
Disallow /search/

Other Records

Field Value
sitemap https://simonwillison.net/sitemap.xml