josephwilk.net
robots.txt

Robots Exclusion Standard data for josephwilk.net

Resource Scan

Scan Details

Site Domain josephwilk.net
Base Domain josephwilk.net
Scan Status Ok
Last Scan2024-10-29T09:23:17+00:00
Next Scan 2024-11-28T09:23:17+00:00

Last Scan

Scanned2024-10-29T09:23:17+00:00
URL http://josephwilk.net/robots.txt
Redirect https://art.josephwilk.net/robots.txt
Redirect Domain art.josephwilk.net
Redirect Base josephwilk.net
Domain IPs 3.131.150.69
Redirect IPs 185.199.108.153, 185.199.109.153, 185.199.110.153, 185.199.111.153, 2606:50c0:8000::153, 2606:50c0:8001::153, 2606:50c0:8002::153, 2606:50c0:8003::153
Response IP 185.199.110.153
Found Yes
Hash e18e090645a1fb915c3b9d039e1fa3c6607e793a69e3e35ccfee3e88c195cbb4
SimHash 50165960e113

Groups

applebot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

chatgpt

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /