weathercrave.ca
robots.txt

Robots Exclusion Standard data for weathercrave.ca

Resource Scan

Scan Details

Site Domain weathercrave.ca
Base Domain weathercrave.ca
Scan Status Ok
Last Scan2024-11-15T03:45:54+00:00
Next Scan 2024-11-22T03:45:54+00:00

Last Scan

Scanned2024-11-15T03:45:54+00:00
URL https://weathercrave.ca/robots.txt
Redirect https://www.weathercrave.ca/robots.txt
Redirect Domain www.weathercrave.ca
Redirect Base weathercrave.ca
Domain IPs 81.92.80.55, 81.92.80.56
Redirect IPs 23.50.90.81, 2600:1413:b000:791::31da, 2600:1413:b000:79f::31da
Response IP 23.50.90.81
Found Yes
Hash 321be9be2b8e8ed3958a4d4a0835d6f943867058871d976c95d87631900e83ba
SimHash 7d15c9b687be

Groups

*

Rule Path
Disallow /weather-forecast-search
Disallow /ajax/*
Disallow /get-shapes/*
Disallow /getvsc_*
Disallow /getviamichelin_*
Disallow /searchAjax
Disallow /ajaxInputValue
Disallow /getGeoloc
Disallow /get/hours-and-compare
Disallow /get/liveobs
Disallow /nearbyForecast
Disallow /get-media_diapo
Disallow /reporter/*
Disallow /js/redesign/carto/*
Disallow /common/recherche/getgeoipentite
Disallow /login
Disallow /test/*
Disallow /monitoring/*
Disallow /webhook/*
Disallow /preview/*
Disallow /launch-mobile-application
Disallow /forecast/*
Disallow *-1-janvier$
Disallow *-1-january$
Disallow *-1-enero$
Disallow *-1-gennaio$
Disallow /mc-srto.html

Other Records

Field Value
sitemap https://www.weathercrave.ca/sitemaps/www-en-ca/sitemap-index.xml