gites.co.uk
robots.txt

Robots Exclusion Standard data for gites.co.uk

Resource Scan

Scan Details

Site Domain gites.co.uk
Base Domain gites.co.uk
Scan Status Ok
Last Scan2025-04-15T21:20:26+00:00
Next Scan 2025-05-15T21:20:26+00:00

Last Scan

Scanned2025-04-15T21:20:26+00:00
URL https://gites.co.uk/robots.txt
Redirect https://www.gites.co.uk/robots.txt
Redirect Domain www.gites.co.uk
Redirect Base gites.co.uk
Domain IPs 172.66.40.138, 172.66.43.118, 2606:4700:3108::ac42:288a, 2606:4700:3108::ac42:2b76
Redirect IPs 172.66.40.138, 172.66.43.118, 2606:4700:3108::ac42:288a, 2606:4700:3108::ac42:2b76
Response IP 172.66.43.118
Found Yes
Hash a9ae5957e5208acd7b0a990a9453fa2714196ab855c090125f6adc6c24b51871
SimHash db0c58eaa034

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

*

Rule Path
Disallow /logs/
Disallow /cache/
Disallow /utmp/
Disallow /languages/
Disallow /private/
Disallow /public/css/
Disallow /public/js/
Disallow /public/jseditors/
Disallow /public/players/
Disallow /public/errors/
Disallow /public/themes/
Disallow /public/themes_c/
Disallow /public/fonts/

Comments

  • robotstxt.org/