theglobeandmail.com
robots.txt
Robots Exclusion Standard data for theglobeandmail.com
Resource Scan
Scan Details
Site Domain | theglobeandmail.com |
Base Domain | theglobeandmail.com |
Scan Status | Ok |
Last Scan | 2024-06-25T08:57:38+00:00 |
Next Scan | 2024-07-02T08:57:38+00:00 |
Last Scan
Scanned | 2024-06-25T08:57:38+00:00 |
URL | https://theglobeandmail.com/robots.txt |
Redirect | https://www.theglobeandmail.com/robots.txt |
Redirect Domain | www.theglobeandmail.com |
Redirect Base | theglobeandmail.com |
Domain IPs | 199.198.138.250 |
Redirect IPs | 23.47.190.74, 23.47.190.9, 2600:1413:a000::17d2:fa91, 2600:1413:a000::17d2:fab8 |
Response IP | 42.99.140.146 |
Found | Yes |
Hash | ba72f5ef7d92a7063e647da0be7462b9c32f885f95d94f72443cdf9ae77d02ce |
SimHash | 743c595ae481 |
Groups
googlebot-news
Rule | Path |
---|---|
Disallow | /feeds/ |
Disallow | /incoming/ |
Disallow | /test/ |
Disallow | /partners/ |
Disallow | /search/ |
Disallow | /business/adv/appointmentnotices/search/ |
adsbot-google
Rule | Path |
---|---|
Disallow | /feeds/ |
Disallow | /incoming/ |
Disallow | /test/ |
Disallow | /search/ |
Disallow | /business/adv/appointmentnotices/search/ |
*
Rule | Path |
---|---|
Disallow | /feeds/ |
Disallow | /incoming/ |
Disallow | /test/ |
Disallow | /search/ |
Disallow | /business/adv/appointmentnotices/search/ |
Disallow | /marketing-containers/ |
Disallow | /coupons/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.theglobeandmail.com/sitemap.xml?outputType=xml |