connect.colgate.edu
robots.txt

Robots Exclusion Standard data for connect.colgate.edu

Resource Scan

Scan Details

Site Domain connect.colgate.edu
Base Domain colgate.edu
Scan Status Ok
Last Scan2024-11-08T11:49:59+00:00
Next Scan 2024-11-22T11:49:59+00:00

Last Scan

Scanned2024-11-08T11:49:59+00:00
URL https://connect.colgate.edu/robots.txt
Domain IPs 34.198.122.35
Response IP 34.198.122.35
Found Yes
Hash 1d609720c091f83824bfdac3416b8265fa2bf0e06d2d3baed86b3184e2eec85c
SimHash e955d860c113

Groups

gsa-crawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow