john-clark.co.uk
robots.txt
Robots Exclusion Standard data for john-clark.co.uk
Resource Scan
Scan Details
Site Domain | john-clark.co.uk |
Base Domain | john-clark.co.uk |
Scan Status | Ok |
Last Scan | 2024-05-22T09:48:23+00:00 |
Next Scan | 2024-06-21T09:48:23+00:00 |
Last Scan
Scanned | 2024-05-22T09:48:23+00:00 |
URL | https://www.john-clark.co.uk/robots.txt |
Domain IPs | 18.155.68.105, 18.155.68.78, 18.155.68.8, 18.155.68.93 |
Response IP | 18.155.68.78 |
Found | Yes |
Hash | d2f76f55445d12f6e62c8c38dee60906ff12fbc78c003269de242a059f0da99c |
SimHash | 891ddb03cd42 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | */online-deposits/* |
Disallow | */site/* |
Disallow | */enquiry/* |
Disallow | */new-car-variants/* |
Disallow | */book-a-test-drive/* |
Disallow | */api/* |
Disallow | */ajax/* |
Disallow | */search/* |
Disallow | *order%3D* |
Disallow | */vehicle-search/* |
Disallow | */new-car-configurator/* |
Disallow | */data-preferences/* |
Disallow | */online-payments/* |
Disallow | */sitemap/* |
Disallow | */js/_nd/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.john-clark.co.uk/sitemap.xml.gz |
Comments