apply.greateranglia.co.uk
robots.txt

Robots Exclusion Standard data for apply.greateranglia.co.uk

Resource Scan

Scan Details

Site Domain apply.greateranglia.co.uk
Base Domain greateranglia.co.uk
Scan Status Ok
Last Scan2025-10-15T00:15:47+00:00
Next Scan 2025-11-14T00:15:47+00:00

Last Scan

Scanned2025-10-15T00:15:47+00:00
URL https://apply.greateranglia.co.uk/robots.txt
Domain IPs 2600:9000:24f3:2600:9:ed9a:6040:93a1, 2600:9000:24f3:6c00:9:ed9a:6040:93a1, 2600:9000:24f3:7000:9:ed9a:6040:93a1, 2600:9000:24f3:800:9:ed9a:6040:93a1, 2600:9000:24f3:9600:9:ed9a:6040:93a1, 2600:9000:24f3:9800:9:ed9a:6040:93a1, 2600:9000:24f3:ba00:9:ed9a:6040:93a1, 2600:9000:24f3:c400:9:ed9a:6040:93a1, 3.165.75.101, 3.165.75.6, 3.165.75.72, 3.165.75.97
Response IP 3.165.75.101
Found Yes
Hash 2654f250cd033dd05e4de400e476c4b189cdc241c5d49e4715975bb868015981
SimHash 09085d204dc7

Groups

*

Rule Path
Disallow /mydata
Disallow /admin
Disallow /companies
Disallow /postings/b5fce899-d950-4ac5-86eb-08fb70ab64d4
Disallow /postings/f97808b2-cec1-4dc7-b22e-e2c43df3d304
Disallow /postings/d1e4b963-94f9-4d3c-9196-941328b37a5c
Disallow /postings/e889f74f-5d01-4af9-9b1b-24c27ebc5fc6
Disallow /postings/569945d8-29e2-48a1-9dba-18ca000ab308
Disallow /postings/f5f2ed25-1041-4caa-a792-97528f715a64
Disallow /postings/ef9ab949-6242-45ea-8ebe-feef30b95366

Other Records

Field Value
sitemap https://apply.greateranglia.co.uk/sitemap