ppll.ent.sirsi.net
robots.txt

Robots Exclusion Standard data for ppll.ent.sirsi.net

Resource Scan

Scan Details

Site Domain ppll.ent.sirsi.net
Base Domain sirsi.net
Scan Status Ok
Last Scan2025-07-25T21:34:58+00:00
Next Scan 2025-08-24T21:34:58+00:00

Last Scan

Scanned2025-07-25T21:34:58+00:00
URL https://ppll.ent.sirsi.net/robots.txt
Domain IPs 104.18.6.204, 104.18.7.204
Response IP 104.18.7.204
Found Yes
Hash 1ce30e826c40d64361cc35a17f3d43983792df85b11c92f2dc723b1c0409a967
SimHash fd41c8055bb1

Groups

*

Rule Path
Disallow /client/default/search/
Disallow /client/en_US/default/search/
Disallow /client/kids/search/
Disallow /client/en_US/kids/search/
Disallow /client/teens/search/
Disallow /client/en_US/teens/search/
Disallow /client/test/search/
Disallow /client/en_US/test/search/
Disallow /custom/backup/
Disallow /custom/demo/
Disallow /custom/dixml/
Disallow /custom/hypersonic/
Disallow /custom/import/
Disallow /custom/pub/
Disallow /custom/resource_storage/
Disallow /custom/spellchecker/
Disallow /custom/tessdata/
Disallow /custom/tx-object-store/
Disallow /custom/upgrade/
Disallow /custom/wsdl/
Disallow /custom/xmbean-attrs/
Disallow /ACADEMIC-extraction.tar
Disallow /PUBLIC-extraction.tar