niagaracollegetoronto.ca
robots.txt

Robots Exclusion Standard data for niagaracollegetoronto.ca

Resource Scan

Scan Details

Site Domain niagaracollegetoronto.ca
Base Domain niagaracollegetoronto.ca
Scan Status Ok
Last Scan2024-11-04T06:54:42+00:00
Next Scan 2024-12-04T06:54:42+00:00

Last Scan

Scanned2024-11-04T06:54:42+00:00
URL https://www.niagaracollegetoronto.ca/robots.txt
Domain IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash 8723f492729f8db9a93754aa64c62516603c83807b24a986934d486626d25d81
SimHash f9081c40ce84

Groups

*

Rule Path
Disallow /Assets/
Disallow /App_Browsers/
Disallow /App_Code/
Disallow /App_Data/
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /install/
Disallow /macroScripts/
Disallow /masterpages/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /Views/
Disallow /xslt/
Disallow /*.axd

Other Records

Field Value
sitemap https://www.niagaracollegetoronto.ca/sitemap.xml