artic.edu
robots.txt

Robots Exclusion Standard data for artic.edu

Resource Scan

Scan Details

Site Domain artic.edu
Base Domain artic.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-08T04:14:59+00:00
Next Scan 2024-12-07T04:14:59+00:00

Last Successful Scan

Scanned2024-04-19T03:31:50+00:00
URL https://artic.edu/robots.txt
Redirect https://www.artic.edu/robots.txt
Redirect Domain www.artic.edu
Redirect Base artic.edu
Domain IPs 198.40.30.252
Redirect IPs 13.224.163.11, 13.224.163.15, 13.224.163.34, 13.224.163.42, 2600:9000:2668:1800:1e:c9e4:2300:93a1, 2600:9000:2668:1e00:1e:c9e4:2300:93a1, 2600:9000:2668:2600:1e:c9e4:2300:93a1, 2600:9000:2668:6a00:1e:c9e4:2300:93a1, 2600:9000:2668:a400:1e:c9e4:2300:93a1, 2600:9000:2668:c800:1e:c9e4:2300:93a1, 2600:9000:2668:e400:1e:c9e4:2300:93a1, 2600:9000:2668:ee00:1e:c9e4:2300:93a1
Response IP 18.155.68.56
Found Yes
Hash 7a13145040e283dcdb70530700a308bb8aef897fb8a04991d331699e88d48c1c
SimHash a4115004ebf1

Groups

*

Rule Path
Disallow /press/exhibition-press-room
Disallow /press/art-institute-images
Disallow /authors/57/james-rondeau