serverhouse.co.uk
robots.txt

Robots Exclusion Standard data for serverhouse.co.uk

Resource Scan

Scan Details

Site Domain serverhouse.co.uk
Base Domain serverhouse.co.uk
Scan Status Ok
Last Scan2024-05-14T10:51:56+00:00
Next Scan 2024-06-13T10:51:56+00:00

Last Scan

Scanned2024-05-14T10:51:56+00:00
URL https://serverhouse.co.uk/robots.txt
Domain IPs 5.2.19.176
Response IP 5.2.19.176
Found Yes
Hash 0cc8fbf3afcebb9c3ea79c4769955aa47984106c6b763bfc74e67cf86dbf5879
SimHash 621ec944e6b5

Groups

aa-site-audit-crawler

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ravencrawler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Comments

  • Slow down bots