gcisd.net
robots.txt
Robots Exclusion Standard data for gcisd.net
Resource Scan
Scan Details
Site Domain | gcisd.net |
Base Domain | gcisd.net |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-02-23T00:30:26+00:00 |
Next Scan | 2024-05-23T00:30:26+00:00 |
Last Successful Scan
Scanned | 2023-01-02T06:45:44+00:00 |
URL | https://gcisd.net/robots.txt |
Redirect | https://www.gcisd.net/robots.txt/Default.aspx |
Redirect Domain | www.gcisd.net |
Redirect Base | gcisd.net |
Domain IPs | 162.159.136.49 |
Redirect IPs | 104.18.12.114, 104.18.13.114, 2606:4700::6812:c72, 2606:4700::6812:d72 |
Response IP | 104.18.13.114 |
Found | Yes |
Hash | ca9b915c2c20107b22e53580465fdeb957aa13524446bd4d9f24bdbe4ac560d0 |
SimHash | 8001575009d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /Search/ |
Disallow | /WebApi/ |
Disallow | /WebServices/ |
Disallow | /portal/svc/ |
Disallow | /Common/controls/StaffDirectory/ws/StaffDirectoryWS.asmx/ |
Disallow | /Common/controls/WorkspaceCalendar/ws/WorkspaceCalendarWS.asmx/ |
Disallow | /common/controls/General/CalendarPicker/CalendarPickerWS.asmx/ |
Comments