plexuslaw.co.uk
robots.txt

Robots Exclusion Standard data for plexuslaw.co.uk

Resource Scan

Scan Details

Site Domain plexuslaw.co.uk
Base Domain plexuslaw.co.uk
Scan Status Ok
Last Scan2024-06-25T23:20:07+00:00
Next Scan 2024-07-02T23:20:07+00:00

Last Scan

Scanned2024-06-25T23:20:07+00:00
URL http://plexuslaw.co.uk/robots.txt
Domain IPs 52.16.25.241
Response IP 52.16.25.241
Found Yes
Hash f61db731828384f701709067b8f214ed312f438a135fe082324271d62f5bd275
SimHash 4555c5562353

Groups

baiduspider

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /*?

googlebot

Rule Path
Disallow /*.html?$

googlebot

Rule Path
Disallow /*.php$

googlebot

Rule Path
Disallow /*.aspx?$

mj12bot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /