lapresse.ca
robots.txt

Robots Exclusion Standard data for lapresse.ca

Resource Scan

Scan Details

Site Domain lapresse.ca
Base Domain lapresse.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-10T19:30:57+00:00
Next Scan 2025-12-09T19:30:57+00:00

Last Successful Scan

Scanned2024-01-19T14:30:01+00:00
URL https://lapresse.ca/robots.txt
Redirect https://www.lapresse.ca:443/robots.txt
Redirect Domain www.lapresse.ca
Redirect Base lapresse.ca
Domain IPs 3.220.9.20, 3.230.137.180, 52.3.49.176
Redirect IPs 52.84.45.108, 52.84.45.32, 52.84.45.47, 52.84.45.86
Response IP 108.156.2.51
Found Yes
Hash 825e2a8d64212d2f52e26ad271ea017d157bedca93b95e71e49315ed71d27e50
SimHash 08214050e193

Groups

*

Rule Path
Disallow /_previsualisation/
Disallow /includes/boite/1/
Disallow /includes/boite/2/
Disallow /includes/boite/3/
Disallow /multimedias/top-10-des-noms-de-familles-quebecois/
Disallow /multimedias/carte-des-milieux-humides/
Disallow /xtra/sante-et-mieux-etre/
Disallow /mon-compte
Disallow /webparts
Disallow /meteo
Disallow /recherche
Disallow /*/embed$
Disallow /*/embed/$

Other Records

Field Value
sitemap https://www.lapresse.ca/sitemapindex.xml
sitemap https://www.lapresse.ca/newsSitemap.xml