cain.ulster.ac.uk
robots.txt

Robots Exclusion Standard data for cain.ulster.ac.uk

Resource Scan

Scan Details

Site Domain cain.ulster.ac.uk
Base Domain ulster.ac.uk
Scan Status Ok
Last Scan2025-06-01T14:28:11+00:00
Next Scan 2025-07-01T14:28:11+00:00

Last Scan

Scanned2025-06-01T14:28:11+00:00
URL https://cain.ulster.ac.uk/robots.txt
Domain IPs 3.169.71.126, 3.169.71.51, 3.169.71.74, 3.169.71.76
Response IP 52.85.5.92
Found Yes
Hash f644b241a26d10ba2c00946852c0fe2f1f0a4e22ba56b560567646b35a908293
SimHash 4d4d49c55bd5

Groups

*

Rule Path
Disallow /1-Read-Me/
Disallow /aajs/
Disallow /access/
Disallow /ajmenu/
Disallow /attitudeartwork/
Disallow /bgnd/
Disallow /cainapp/
Disallow /cainbgn/
Disallow /cgi-bin/
Disallow /commendation/
Disallow /database/
Disallow /dirres/
Disallow /email/
Disallow /exam-papers/
Disallow /Excite/
Disallow /JWscroller/
Disallow /logs/
Disallow /linklint/
Disallow /martin_melaugh/
Disallow /old-stats/
Disallow /oldccru/
Disallow /oldconfs/
Disallow /oldstuff/
Disallow /p4/
Disallow /periods/
Disallow /search/
Disallow /SFgate/
Disallow /st1100/
Disallow /stats/
Disallow /stats2/
Disallow /temp/
Disallow /textcounter/
Disallow /search/
Disallow /unesco/
Disallow /updatingcain/
Disallow /work/
Disallow /xhtdocs/
Disallow /xmartin/
Disallow /xmckeown/

Comments

  • robots.txt for http://cain.ulst.ac.uk/