qegs.lincs.sch.uk
robots.txt

Robots Exclusion Standard data for qegs.lincs.sch.uk

Resource Scan

Scan Details

Site Domain qegs.lincs.sch.uk
Base Domain qegs.lincs.sch.uk
Scan Status Ok
Last Scan2025-11-21T00:34:03+00:00
Next Scan 2025-12-05T00:34:03+00:00

Last Scan

Scanned2025-11-21T00:34:03+00:00
URL https://qegs.lincs.sch.uk/robots.txt
Redirect https://www.qegs.lincs.sch.uk/robots.txt
Redirect Domain www.qegs.lincs.sch.uk
Redirect Base qegs.lincs.sch.uk
Domain IPs 104.21.18.125, 172.67.181.213, 2606:4700:3033::ac43:b5d5, 2606:4700:3035::6815:127d
Redirect IPs 104.21.18.125, 172.67.181.213, 2606:4700:3033::ac43:b5d5, 2606:4700:3035::6815:127d
Response IP 104.21.18.125
Found Yes
Hash 5f42d2d6c06e3b10bdf3d6d05cf10d8d109f01fe97f19592e5f9682bc25d0708
SimHash 79155c13cf91

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap http://qegs.greenhousecms.co.uk/sitemap.xml