protocol-education.com
robots.txt

Robots Exclusion Standard data for protocol-education.com

Resource Scan

Scan Details

Site Domain protocol-education.com
Base Domain protocol-education.com
Scan Status Ok
Last Scan2025-07-14T10:18:39+00:00
Next Scan 2025-08-13T10:18:39+00:00

Last Scan

Scanned2025-07-14T10:18:39+00:00
URL https://www.protocol-education.com/robots.txt
Domain IPs 13.35.37.127, 13.35.37.13, 13.35.37.52, 13.35.37.83, 2600:9000:213e:0:e:47e7:6d80:93a1, 2600:9000:213e:5000:e:47e7:6d80:93a1, 2600:9000:213e:5200:e:47e7:6d80:93a1, 2600:9000:213e:6c00:e:47e7:6d80:93a1, 2600:9000:213e:ce00:e:47e7:6d80:93a1, 2600:9000:213e:dc00:e:47e7:6d80:93a1, 2600:9000:213e:ec00:e:47e7:6d80:93a1, 2600:9000:213e:f600:e:47e7:6d80:93a1
Response IP 13.35.37.127
Found Yes
Hash ae65cc9b0c89c9eadc09d60ffd22df63a0e766139f667be2730e8a6364040b41
SimHash 630141444f95

Groups

*

Rule Path
Disallow /admin$
Disallow /admin/*
Disallow /sa$
Disallow /sa/*
Disallow /api/*
Disallow /users/auth/*
Disallow /sso/*
Disallow /*?*
Disallow /templates/*
Allow /db_assets/production*?t=*
Disallow /job/*/apply
Disallow /job/*/save_job
Disallow /job/*/unsave_job
Disallow /jobs/*/*/*

Other Records

Field Value
sitemap https://www.protocol-education.com/sitemap.xml