sgs.co.uk
robots.txt

Robots Exclusion Standard data for sgs.co.uk

Resource Scan

Scan Details

Site Domain sgs.co.uk
Base Domain sgs.co.uk
Scan Status Ok
Last Scan2024-10-08T15:26:40+00:00
Next Scan 2024-11-07T15:26:40+00:00

Last Scan

Scanned2024-10-08T15:26:40+00:00
URL https://sgs.co.uk/robots.txt
Redirect https://www.sgs.co.uk/robots.txt
Redirect Domain www.sgs.co.uk
Redirect Base sgs.co.uk
Domain IPs 52.232.96.213
Redirect IPs 23.209.46.140, 23.209.46.145, 2600:1413:b000:1b::17d7:709, 2600:1413:b000:1b::17d7:70b
Response IP 184.28.235.74
Found Yes
Hash 3ea5319fc8a5d1357fe50943520c0cfd8e2e9bf6e9c7024c1f6f068c03ebbca2
SimHash 480cd6128b12

Groups

ahrefsbot

Rule Path
Disallow /

keynote

Rule Path
Disallow /

nutch

Rule Path
Disallow /

mozilla/5.0+(compatible;+msie+9.0;+windows+nt+6.1;+wow64;+trident/5.0;+ktxn)

Rule Path
Disallow /

*

Rule Path
Disallow /sitecore/
Disallow /api/
Disallow /api/sitecore/
Disallow *.aspx

mjbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sgs.co.uk/sitemap.xml.gz

Warnings

  • 2 invalid lines.