wgc.ltd.uk
robots.txt
Robots Exclusion Standard data for wgc.ltd.uk
Resource Scan
Scan Details
Site Domain | wgc.ltd.uk |
Base Domain | wgc.ltd.uk |
Scan Status | Ok |
Last Scan | 5/22/2025, 4:08:27 PM |
Next Scan | 5/29/2025, 4:08:27 PM |
Last Scan
Scanned | 5/22/2025, 4:08:27 PM |
URL | https://wgc.ltd.uk/robots.txt |
Domain IPs | 109.228.9.119 |
Response IP | 109.228.9.119 |
Found | Yes |
Hash | 2fe61512ca78c4e92f670fcfb48ee5c0e9974b3159878457e44af63e58de12ff |
SimHash | 8551504577d0 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Disallow | /junk/ |
Disallow | /apps/ |
Disallow | /awards/ |
Disallow | /central/ |
Disallow | /connect/ |
Disallow | /clients/ |
Disallow | /global/ |
Disallow | /it/ |
Disallow | /message/ |
Disallow | /newsletter/ |
Disallow | /office_srv/ |
Disallow | /OneDrive/ |
Disallow | /people/ |
Disallow | /site/ |
Disallow | /yourwgc/ |