gcisd.net
robots.txt

Robots Exclusion Standard data for gcisd.net

Resource Scan

Scan Details

Site Domain gcisd.net
Base Domain gcisd.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-02-23T00:30:26+00:00
Next Scan 2024-05-23T00:30:26+00:00

Last Successful Scan

Scanned2023-01-02T06:45:44+00:00
URL https://gcisd.net/robots.txt
Redirect https://www.gcisd.net/robots.txt/Default.aspx
Redirect Domain www.gcisd.net
Redirect Base gcisd.net
Domain IPs 162.159.136.49
Redirect IPs 104.18.12.114, 104.18.13.114, 2606:4700::6812:c72, 2606:4700::6812:d72
Response IP 104.18.13.114
Found Yes
Hash ca9b915c2c20107b22e53580465fdeb957aa13524446bd4d9f24bdbe4ac560d0
SimHash 8001575009d1

Groups

*

Rule Path
Disallow /Search/
Disallow /WebApi/
Disallow /WebServices/
Disallow /portal/svc/
Disallow /Common/controls/StaffDirectory/ws/StaffDirectoryWS.asmx/
Disallow /Common/controls/WorkspaceCalendar/ws/WorkspaceCalendarWS.asmx/
Disallow /common/controls/General/CalendarPicker/CalendarPickerWS.asmx/

Comments

  • Global Level