www.jobs.gla.ac.uk
robots.txt

Robots Exclusion Standard data for www.jobs.gla.ac.uk

Resource Scan

Scan Details

Site Domain www.jobs.gla.ac.uk
Base Domain gla.ac.uk
Scan Status Ok
Last Scan2025-04-19T01:13:47+00:00
Next Scan 2025-05-19T01:13:47+00:00

Last Scan

Scanned2025-04-19T01:13:47+00:00
URL https://www.jobs.gla.ac.uk/robots.txt
Domain IPs 108.156.144.37, 108.156.144.46, 108.156.144.72, 108.156.144.91, 2600:9000:2079:2200:1c:398:5800:93a1, 2600:9000:2079:3600:1c:398:5800:93a1, 2600:9000:2079:8800:1c:398:5800:93a1, 2600:9000:2079:8a00:1c:398:5800:93a1, 2600:9000:2079:8c00:1c:398:5800:93a1, 2600:9000:2079:9400:1c:398:5800:93a1, 2600:9000:2079:9a00:1c:398:5800:93a1, 2600:9000:2079:d200:1c:398:5800:93a1
Response IP 108.156.144.91
Found Yes
Hash 227b472991cbffc0cc4a36a4a4caeb40b391e2778f9b1a1b83bf2a84e3b0ef3d
SimHash 630341444f95

Groups

*

Rule Path
Disallow /admin$
Disallow /admin/*
Disallow /sa$
Disallow /sa/*
Disallow /api/*
Disallow /users/auth/*
Disallow /sso/*
Disallow /*?*
Disallow /templates/*
Allow /db_assets/production*?t=*
Disallow /job/*/apply
Disallow /job/*/save_job
Disallow /job/*/unsave_job
Disallow /jobs/*/*/*

Other Records

Field Value
sitemap https://www.jobs.gla.ac.uk/sitemap.xml