guidelinepublications.co.uk
robots.txt

Robots Exclusion Standard data for guidelinepublications.co.uk

Resource Scan

Scan Details

Site Domain guidelinepublications.co.uk
Base Domain guidelinepublications.co.uk
Scan Status Ok
Last Scan2025-08-04T04:19:29+00:00
Next Scan 2025-09-03T04:19:29+00:00

Last Scan

Scanned2025-08-04T04:19:29+00:00
URL https://guidelinepublications.co.uk/robots.txt
Domain IPs 217.160.0.191
Response IP 217.160.0.191
Found Yes
Hash cbfdfec35e5e5c5759ceee40e6e4af6eaae3ce7d48ce0a5981c456c6d8ea92af
SimHash 0d5cdfc68710

Groups

*

Rule Path
Disallow /help/
Disallow /html/
Disallow /admin/

baidumobaider

Rule Path
Disallow /

baiduimagespider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow /