calgaryherald.com
robots.txt
Robots Exclusion Standard data for calgaryherald.com
Resource Scan
Scan Details
Site Domain | calgaryherald.com |
Base Domain | calgaryherald.com |
Scan Status | Ok |
Last Scan | 2024-11-13T12:49:46+00:00 |
Next Scan | 2024-11-20T12:49:46+00:00 |
Last Scan
Scanned | 2024-11-13T12:49:46+00:00 |
URL | https://calgaryherald.com/robots.txt |
Domain IPs | 34.117.147.204 |
Response IP | 34.117.147.204 |
Found | Yes |
Hash | b84182958a376366fca05f0ae047cd31f5e297ff1e5d0b8e13ebc08c76d0a1f9 |
SimHash | d8095a960306 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /*? |
Allow | /wp-admin/admin-ajax.php |
Allow | /sitemap-story.xml? |
Allow | /sitemap-wired-video.xml? |
Allow | /sitemap-exclusive-video.xml? |
Allow | /*?r |
Allow | /*?amu |
Other Records
Field | Value |
---|---|
sitemap | https://calgaryherald.com/sitemap.xml |
sitemap | https://calgaryherald.com/sitemap-news.xml |
sitemap | https://calgaryherald.com/sitemap-category.xml |
sitemap | https://calgaryherald.com/sitemap-video.xml |
sitemap | https://calgaryherald.com/sitemap-old.xml |
sitemap | https://calgaryherald.com/sitemap-static.xml |