willthompson.co.uk
robots.txt

Robots Exclusion Standard data for willthompson.co.uk

Resource Scan

Scanned	2025-11-02T06:18:23+00:00
URL	https://willthompson.co.uk/robots.txt
Domain IPs	2a00:1098:0:80:1000:3b:1:1, 2a00:1098:0:82:1000:3b:1:1, 46.235.225.189, 93.93.129.174
Response IP	46.235.225.189
Found	Yes
Hash	4701991e3f62f2136f4a15f0d0885990df17f36e1bac036656524a2d8e6140fc
SimHash	712ec901c1e5

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Field	Value
sitemap	https://willthompson.co.uk/sitemap.xml

Field

Value

sitemap

https://willthompson.co.uk/sitemap.xml

Back to top

Block Common Crawl (https://commoncrawl.org/ccbot)
Block generative AI/LLM bots (source: https://github.com/ai-robots-txt/ai.robots.txt)

Back to top