sacredheart.edu
robots.txt

Robots Exclusion Standard data for sacredheart.edu

Resource Scan

Scan Details

Site Domain sacredheart.edu
Base Domain sacredheart.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2024-09-10T17:16:57+00:00
Next Scan 2024-12-09T17:16:57+00:00

Last Successful Scan

Scanned2024-04-21T05:37:12+00:00
URL https://sacredheart.edu/robots.txt
Redirect https://www.sacredheart.edu/robots.txt
Redirect Domain www.sacredheart.edu
Redirect Base sacredheart.edu
Domain IPs 15.197.230.140, 3.33.207.254
Redirect IPs 107.22.232.134, 52.200.164.208
Response IP 107.22.232.134
Found Yes
Hash 02f75497055f8df12fca3e620d1713b93e112a57dd2090ab3c7def20532d1d33
SimHash 6010d9104775

Groups

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

googlebot-image

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

duckduckbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

msnbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

slurp

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

rogerbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

dubbotbot/0.2

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

*

Rule Path
Disallow /