educatelondon.com
robots.txt

Robots Exclusion Standard data for educatelondon.com

Resource Scan

Scan Details

Site Domain educatelondon.com
Base Domain educatelondon.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-03-10T00:51:37+00:00
Next Scan 2026-04-09T00:51:37+00:00

Last Successful Scan

Scanned2026-02-08T17:42:10+00:00
URL http://educatelondon.com/robots.txt
Redirect https://www.standard.co.uk/robots.txt
Redirect Domain www.standard.co.uk
Redirect Base standard.co.uk
Domain IPs 52.48.9.82
Redirect IPs 104.18.43.17, 172.64.144.239, 2606:4700:4409::6812:2b11, 2a06:98c1:3108::ac40:90ef
Response IP 172.64.144.239
Found Yes
Hash 38b86a71c87f07c9ed09eaec9652b8014d768b41da97dee04b82821700ea879b
SimHash 21141f40e403

Groups

*

Rule Path
Disallow /api/
Disallow /internal-api/
Disallow /*ILC-refresh
Disallow /cdn-cgi/

nutch

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.standard.co.uk/sitemaps/googlenews
sitemap https://www.standard.co.uk/sitemap.xml

Comments

  • Robots.txt for https://www.standard.co.uk
  • Disallow list
  • Sitemaps