getty.edu
robots.txt

Robots Exclusion Standard data for getty.edu

Resource Scan

Scan Details

Site Domain getty.edu
Base Domain getty.edu
Scan Status Ok
Last Scan2024-05-13T21:27:24+00:00
Next Scan 2024-06-12T21:27:24+00:00

Last Scan

Scanned2024-05-13T21:27:24+00:00
URL https://getty.edu/robots.txt
Redirect https://www.getty.edu/robots.txt
Redirect Domain www.getty.edu
Redirect Base getty.edu
Domain IPs 153.10.241.9
Redirect IPs 13.33.30.104, 13.33.30.18, 13.33.30.42, 13.33.30.44
Response IP 13.33.30.18
Found Yes
Hash 81e0f746bedb35facca14ee60db728f4d001a79e4da9429630bfe13e08433a3b
SimHash c0008d20e333

Groups

sortsite

Rule Path
Allow /

*

Rule Path
Disallow /cgi-bin

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /slampoets

*

Rule Path
Disallow /research/collections/mirador/

yeti

Rule Path
Disallow /art/collection/

Comments

  • /robots.txt