commercialcleaning-services.co.uk
robots.txt

Robots Exclusion Standard data for commercialcleaning-services.co.uk

Resource Scan

Scan Details

Site Domain commercialcleaning-services.co.uk
Base Domain commercialcleaning-services.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-01T15:57:42+00:00
Next Scan 2024-07-30T15:57:42+00:00

Last Successful Scan

Scanned2023-06-14T14:58:37+00:00
URL https://commercialcleaning-services.co.uk/robots.txt
Redirect https://www.commercialcleaning-services.co.uk/robots.txt
Redirect Domain www.commercialcleaning-services.co.uk
Redirect Base commercialcleaning-services.co.uk
Domain IPs 104.21.63.39
Redirect IPs 172.67.169.116
Response IP 172.67.169.116
Found Yes
Hash c6b93fee36f1db3957661fb737dcd26b08c53edaf90efe0c3bfc8c087c55d576
SimHash 69195a000452

Groups

*

Rule Path
Allow /
Allow /wp-content/uploads/*
Allow /wp-content/*.css
Disallow /cgi-bin
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /*/attachment/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /?attachment_id*
Disallow /?s=
Disallow /search
Disallow /trackback
Disallow /*trackback*
Disallow /*/trackback
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/*/feed/$
Disallow /*/*/*/feed/rss/$

duckduckbot

Rule Path
Disallow /Sitemap

googlebot

Rule Path
Disallow /Sitemap

bingbot

Rule Path
Disallow /Sitemap

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

linko

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.commercialcleaning-services.co.uk/sitemapindex.xml