globeandmail.com
robots.txt
Robots Exclusion Standard data for globeandmail.com
Resource Scan
Scan Details
Site Domain | globeandmail.com |
Base Domain | globeandmail.com |
Scan Status | Ok |
Last Scan | 2024-11-16T00:10:15+00:00 |
Next Scan | 2024-11-23T00:10:15+00:00 |
Last Scan
Scanned | 2024-11-16T00:10:15+00:00 |
URL | https://globeandmail.com/robots.txt |
Redirect | https://www.theglobeandmail.com/robots.txt |
Redirect Domain | www.theglobeandmail.com |
Redirect Base | theglobeandmail.com |
Domain IPs | 3.164.85.117, 3.164.85.37, 3.164.85.9, 3.164.85.98 |
Redirect IPs | 23.209.46.87, 23.209.46.97, 2600:1413:b000:13::b857:c186, 2600:1413:b000:13::b857:c18e |
Response IP | 23.52.171.145 |
Found | Yes |
Hash | e22bc7b0251ffacac1d35a022809d6028cdd28e5ad2274b32df056eae227bb68 |
SimHash | 743e5956e081 |
Groups
googlebot-news
Rule | Path |
---|---|
Disallow | /feeds/ |
Disallow | /incoming/ |
Disallow | /test/ |
Disallow | /partners/ |
Disallow | /search/ |
Disallow | /business/adv/appointmentnotices/search/ |
adsbot-google
Rule | Path |
---|---|
Disallow | /feeds/ |
Disallow | /incoming/ |
Disallow | /test/ |
Disallow | /search/ |
Disallow | /business/adv/appointmentnotices/search/ |
*
Rule | Path |
---|---|
Disallow | /feeds/ |
Disallow | /incoming/ |
Disallow | /test/ |
Disallow | /search/ |
Disallow | /business/adv/appointmentnotices/search/ |
Disallow | /marketing-containers/ |
Disallow | /coupons/ |
Disallow | /files/advertising/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.theglobeandmail.com/sitemap.xml?outputType=xml |