crowdstorm.co.uk
robots.txt

Robots Exclusion Standard data for crowdstorm.co.uk

Resource Scan

Scan Details

Site Domain crowdstorm.co.uk
Base Domain crowdstorm.co.uk
Scan Status Ok
Last Scan2024-06-10T23:13:28+00:00
Next Scan 2024-07-10T23:13:28+00:00

Last Scan

Scanned2024-06-10T23:13:28+00:00
URL https://crowdstorm.co.uk/robots.txt
Redirect https://www.crowdstorm.co.uk/robots.txt
Redirect Domain www.crowdstorm.co.uk
Redirect Base crowdstorm.co.uk
Domain IPs 104.21.26.70, 172.67.135.153, 2606:4700:3034::6815:1a46, 2606:4700:3036::ac43:8799
Redirect IPs 104.21.26.70, 172.67.135.153, 2606:4700:3034::6815:1a46, 2606:4700:3036::ac43:8799
Response IP 172.67.135.153
Found Yes
Hash 1e5904a08a52cc3ed3385feb4e602f22761dacf11b0ecd28daa5f38d946650b1
SimHash 891ed9764a11

Groups

trovitbot

Rule Path
Disallow /

magpie-crawler/1.1

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow /out
Disallow /out/
Disallow *?*
Allow *?v*

googlebot

Rule Path
Allow *?aj=true*