mirrors.cat.pdx.edu
robots.txt

Robots Exclusion Standard data for mirrors.cat.pdx.edu

Resource Scan

Scan Details

Site Domain mirrors.cat.pdx.edu
Base Domain pdx.edu
Scan Status Ok
Last Scan2025-06-18T07:21:45+00:00
Next Scan 2025-07-18T07:21:45+00:00

Last Scan

Scanned2025-06-18T07:21:45+00:00
URL https://mirrors.cat.pdx.edu/robots.txt
Domain IPs 131.252.208.20
Response IP 131.252.208.20
Found Yes
Hash c8bf144d7cf6d4b3a21eb1868c796ada2b73f72c2947626afe59cdb07d84504b
SimHash d80b058a5b95

Groups

*

Rule Path
Disallow /archlinux
Disallow /awstats-icon
Disallow /cat
Disallow /centos
Disallow /deepin
Disallow /debian
Disallow /dev
Disallow /epel
Disallow /fedora
Disallow /nginx-default
Disallow /opencsw
Disallow /pakcs
Disallow /penguin-transp.png
Disallow /penguin.png
Disallow /pgxn
Disallow /raspbian
Disallow /rocky
Disallow /stats
Disallow /ubuntu
Disallow /ubuntu-releases