gre.ac.uk
robots.txt
Robots Exclusion Standard data for gre.ac.uk
Resource Scan
Scan Details
Site Domain | gre.ac.uk |
Base Domain | gre.ac.uk |
Scan Status | Ok |
Last Scan | 2024-10-18T16:01:07+00:00 |
Next Scan | 2024-11-17T16:01:07+00:00 |
Last Scan
Scanned | 2024-10-18T16:01:07+00:00 |
URL | https://gre.ac.uk/robots.txt |
Redirect | https://www.gre.ac.uk/robots.txt |
Redirect Domain | www.gre.ac.uk |
Redirect Base | gre.ac.uk |
Domain IPs | 193.37.244.43, 64:ff9b::c125:f42b |
Redirect IPs | 193.37.244.43 |
Response IP | 193.37.244.43 |
Found | Yes |
Hash | 9016dcba3052a4f779b3853db0e21b88d4f3db5c777e0748e98e80ce27879850 |
SimHash | 3c11482ae392 |
Groups
*
Rule | Path |
---|---|
Disallow | /*?sq_ |
Disallow | /*%26sq_ |
Disallow | /nugget/* |
Disallow | /ajax/ |
Disallow | /signup/interview-booking$ |
Disallow | /clearing/apply-direct |