getty.edu
robots.txt

Robots Exclusion Standard data for getty.edu

Resource Scan

Scan Details

Site Domain getty.edu
Base Domain getty.edu
Scan Status Ok
Last Scan2024-11-10T03:39:38+00:00
Next Scan 2024-12-10T03:39:38+00:00

Last Scan

Scanned2024-11-10T03:39:38+00:00
URL https://getty.edu/robots.txt
Redirect https://www.getty.edu/robots.txt
Redirect Domain www.getty.edu
Redirect Base getty.edu
Domain IPs 153.10.241.9
Redirect IPs 3.165.82.45, 3.165.82.53, 3.165.82.67, 3.165.82.83
Response IP 3.165.82.45
Found Yes
Hash 81e0f746bedb35facca14ee60db728f4d001a79e4da9429630bfe13e08433a3b
SimHash c0008d20e333

Groups

sortsite

Rule Path
Allow /

*

Rule Path
Disallow /cgi-bin

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /slampoets

*

Rule Path
Disallow /research/collections/mirador/

yeti

Rule Path
Disallow /art/collection/

Comments

  • /robots.txt