cyberpresse.ca
robots.txt

Robots Exclusion Standard data for cyberpresse.ca

Resource Scan

Scan Details

Site Domain cyberpresse.ca
Base Domain cyberpresse.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-02T09:24:35+00:00
Next Scan 2024-11-09T09:24:35+00:00

Last Successful Scan

Scanned2023-09-12T15:40:01+00:00
URL https://www.cyberpresse.ca/robots.txt
Redirect https://www.lapresse.ca:443/robots.txt
Redirect Domain www.lapresse.ca
Redirect Base lapresse.ca
Domain IPs 107.21.131.88, 18.215.27.81, 54.234.227.182
Redirect IPs 13.32.121.102, 13.32.121.109, 13.32.121.26, 13.32.121.92
Response IP 52.84.45.108
Found Yes
Hash 0890afaa2b9db04c107571abb51f5fef15b01205ca9fe572a6a28e6367b878b5
SimHash 082940488192

Groups

*

Rule Path
Disallow /_previsualisation/
Disallow /includes/boite/1/
Disallow /includes/boite/2/
Disallow /includes/boite/3/
Disallow /multimedias/top-10-des-noms-de-familles-quebecois/
Disallow /multimedias/carte-des-milieux-humides/
Disallow /xtra/sante-et-mieux-etre/
Disallow /mon-compte
Disallow /webparts
Disallow /meteo
Disallow /recherche

Other Records

Field Value
sitemap https://www.lapresse.ca/sitemapindex.xml
sitemap https://www.lapresse.ca/newsSitemap.xml