cyberpresse.ca
robots.txt
Robots Exclusion Standard data for cyberpresse.ca
Resource Scan
Scan Details
Site Domain | cyberpresse.ca |
Base Domain | cyberpresse.ca |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-02T09:24:35+00:00 |
Next Scan | 2024-11-09T09:24:35+00:00 |
Last Successful Scan
Scanned | 2023-09-12T15:40:01+00:00 |
URL | https://www.cyberpresse.ca/robots.txt |
Redirect | https://www.lapresse.ca:443/robots.txt |
Redirect Domain | www.lapresse.ca |
Redirect Base | lapresse.ca |
Domain IPs | 107.21.131.88, 18.215.27.81, 54.234.227.182 |
Redirect IPs | 13.32.121.102, 13.32.121.109, 13.32.121.26, 13.32.121.92 |
Response IP | 52.84.45.108 |
Found | Yes |
Hash | 0890afaa2b9db04c107571abb51f5fef15b01205ca9fe572a6a28e6367b878b5 |
SimHash | 082940488192 |
Groups
*
Rule | Path |
---|---|
Disallow | /_previsualisation/ |
Disallow | /includes/boite/1/ |
Disallow | /includes/boite/2/ |
Disallow | /includes/boite/3/ |
Disallow | /multimedias/top-10-des-noms-de-familles-quebecois/ |
Disallow | /multimedias/carte-des-milieux-humides/ |
Disallow | /xtra/sante-et-mieux-etre/ |
Disallow | /mon-compte |
Disallow | /webparts |
Disallow | /meteo |
Disallow | /recherche |
Other Records
Field | Value |
---|---|
sitemap | https://www.lapresse.ca/sitemapindex.xml |
sitemap | https://www.lapresse.ca/newsSitemap.xml |