architectsjournal.co.uk
robots.txt

Robots Exclusion Standard data for architectsjournal.co.uk

Resource Scan

Scan Details

Site Domain architectsjournal.co.uk
Base Domain architectsjournal.co.uk
Scan Status Ok
Last Scan2024-11-12T04:41:54+00:00
Next Scan 2024-11-19T04:41:54+00:00

Last Scan

Scanned2024-11-12T04:41:54+00:00
URL https://architectsjournal.co.uk/robots.txt
Redirect https://www.architectsjournal.co.uk/robots.txt
Redirect Domain www.architectsjournal.co.uk
Redirect Base architectsjournal.co.uk
Domain IPs 52.18.101.26, 52.19.234.53
Redirect IPs 52.18.101.26, 52.19.234.53
Response IP 52.18.101.26
Found Yes
Hash 8035fe0f001300290c4a8bf533173056986e38f340e8c4f13c74349aa33ab566
SimHash 5354c170ed91

Groups

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

neticlebot

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

livelapbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin
Disallow /account/
Disallow /?orderby*
Disallow /?s=*
Disallow /news/email-template/*