calgary.ca
robots.txt

Robots Exclusion Standard data for calgary.ca

Resource Scan

Scan Details

Site Domain calgary.ca
Base Domain calgary.ca
Scan Status Ok
Last Scan2024-09-27T15:58:48+00:00
Next Scan 2024-10-27T15:58:48+00:00

Last Scan

Scanned2024-09-27T15:58:48+00:00
URL https://calgary.ca/robots.txt
Redirect https://www.calgary.ca/robots.txt
Redirect Domain www.calgary.ca
Redirect Base calgary.ca
Domain IPs 198.160.191.75
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash 2ab331116a89a242ecd11f12eb5465f9ba526b73fb9d717f2677d6fa373df2dd
SimHash 432bc5b58b96

Groups

*

Rule Path
Disallow /portal
Disallow /cca
Disallow /docgallery
Disallow /imageserver
Disallow /cweb
Disallow /proxy
Disallow /lists/
Disallow */scripts/
Disallow /*.WebFldr.aspx
Disallow /*.Upload.aspx
Disallow /*.EditForm.aspx
Disallow /*.DispForm.aspx
Disallow /*.WorkFlowTasks
Disallow /*.txt$
Disallow /ApplicationSettings_Demo
Disallow /ApplicationSettings
Disallow /~
Disallow /5286973
Disallow /pda/dba
Disallow /pda/lupp
Disallow /general/about-us
Disallow /general/patternlibrary
Disallow /vote2018
Disallow /311c3
Disallow /content/www/en/home/*
Disallow /search.html

Other Records

Field Value
sitemap https://www.calgary.ca/sitemap.xml