wgc.ltd.uk
robots.txt

Robots Exclusion Standard data for wgc.ltd.uk

Resource Scan

Scan Details

Site Domain wgc.ltd.uk
Base Domain wgc.ltd.uk
Scan Status Ok
Last Scan5/22/2025, 4:08:27 PM
Next Scan 5/29/2025, 4:08:27 PM

Last Scan

Scanned5/22/2025, 4:08:27 PM
URL https://wgc.ltd.uk/robots.txt
Domain IPs 109.228.9.119
Response IP 109.228.9.119
Found Yes
Hash 2fe61512ca78c4e92f670fcfb48ee5c0e9974b3159878457e44af63e58de12ff
SimHash 8551504577d0

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Disallow /junk/
Disallow /apps/
Disallow /awards/
Disallow /central/
Disallow /connect/
Disallow /clients/
Disallow /global/
Disallow /it/
Disallow /message/
Disallow /newsletter/
Disallow /office_srv/
Disallow /OneDrive/
Disallow /people/
Disallow /site/
Disallow /yourwgc/