listcompanies.co.uk
robots.txt

Robots Exclusion Standard data for listcompanies.co.uk

Resource Scan

Scan Details

Site Domain listcompanies.co.uk
Base Domain listcompanies.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-06T07:59:35+00:00
Next Scan 2025-02-04T07:59:35+00:00

Last Successful Scan

Scanned2024-07-10T07:58:00+00:00
URL https://listcompanies.co.uk/robots.txt
Domain IPs 104.21.77.48, 172.67.204.164, 2606:4700:3032::6815:4d30, 2606:4700:3034::ac43:cca4
Response IP 104.21.77.48
Found Yes
Hash 3c4de73e4fabe309ad27ff40135c4a32dfde9fc9ed55e7f0a315bbb8f65e40ec
SimHash 29207a323333

Groups

*

Rule Path
Disallow /new.html
Disallow /search
Disallow /update/
Disallow /directions/
Disallow /add-review/
Disallow /pages/

yandex

Rule Path
Disallow /

Comments

  • www.robotstxt.org/
  • Allow crawling of all content