weathercrave.com
robots.txt

Robots Exclusion Standard data for weathercrave.com

Resource Scan

Scan Details

Site Domain weathercrave.com
Base Domain weathercrave.com
Scan Status Ok
Last Scan2024-11-08T17:56:05+00:00
Next Scan 2024-11-15T17:56:05+00:00

Last Scan

Scanned2024-11-08T17:56:05+00:00
URL https://weathercrave.com/robots.txt
Redirect https://www.weathercrave.com/robots.txt
Redirect Domain www.weathercrave.com
Redirect Base weathercrave.com
Domain IPs 81.92.80.55, 81.92.80.56
Redirect IPs 23.50.90.81, 2600:1413:b000:681::31da, 2600:1413:b000:68f::31da
Response IP 23.50.90.81
Found Yes
Hash 7dd36d8d5965633a00adbb43e410d0cbbbbd4655854bc865508602623d204607
SimHash 7c15c9b687be

Groups

*

Rule Path
Disallow /weather-forecast-search
Disallow /ajax/*
Disallow /get-shapes/*
Disallow /getvsc_*
Disallow /getviamichelin_*
Disallow /searchAjax
Disallow /ajaxInputValue
Disallow /getGeoloc
Disallow /get/hours-and-compare
Disallow /get/liveobs
Disallow /nearbyForecast
Disallow /get-media_diapo
Disallow /reporter/*
Disallow /js/redesign/carto/*
Disallow /common/recherche/getgeoipentite
Disallow /login
Disallow /test/*
Disallow /monitoring/*
Disallow /webhook/*
Disallow /preview/*
Disallow /launch-mobile-application
Disallow /forecast/*
Disallow *-1-janvier$
Disallow *-1-january$
Disallow *-1-enero$
Disallow *-1-gennaio$
Disallow /mc-srto.html

Other Records

Field Value
sitemap https://www.weathercrave.com/sitemaps/www-en-us/sitemap-index.xml