nhsggc.org.uk
robots.txt

Robots Exclusion Standard data for nhsggc.org.uk

Resource Scan

Scan Details

Site Domain nhsggc.org.uk
Base Domain nhsggc.org.uk
Scan Status Ok
Last Scan2026-02-26T12:19:58+00:00
Next Scan 2026-03-12T12:19:58+00:00

Last Scan

Scanned2026-02-26T12:19:58+00:00
URL https://nhsggc.org.uk/robots.txt
Redirect https://www.nhsggc.org.uk/robots.txt
Redirect Domain www.nhsggc.org.uk
Redirect Base nhsggc.org.uk
Domain IPs 20.0.114.104
Redirect IPs 20.0.114.104
Response IP 20.0.114.104
Found Yes
Hash ff3c6ae10d931b64b91adfa7330d2fe66b30a384f466589936f3b3a8189b5a19
SimHash 2f283941ef80

Groups

*

Rule Path
Disallow /about-us/
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /content/
Disallow /CONTENT/
Disallow /data/
Disallow /macroScripts/
Disallow /media/
Disallow /mediaassets/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

amazonbot

Product Comment
amazonbot Amazon's user agent
Rule Path
Disallow /

adsbot

Rule Path
Disallow /

semanticscholarbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nhsggc.org.uk/seo-sitemap