weathercrave.co.uk
robots.txt

Robots Exclusion Standard data for weathercrave.co.uk

Resource Scan

Scan Details

Site Domain weathercrave.co.uk
Base Domain weathercrave.co.uk
Scan Status Ok
Last Scan2024-06-22T03:36:15+00:00
Next Scan 2024-06-29T03:36:15+00:00

Last Scan

Scanned2024-06-22T03:36:15+00:00
URL https://weathercrave.co.uk/robots.txt
Redirect https://www.weathercrave.co.uk/robots.txt
Redirect Domain www.weathercrave.co.uk
Redirect Base weathercrave.co.uk
Domain IPs 81.92.80.55, 81.92.80.56
Redirect IPs 104.69.165.115, 2600:1417:3f:b86::31da, 2600:1417:3f:ba8::31da
Response IP 104.76.128.20
Found Yes
Hash be39572e2e7e86e76026872fa1966eb7dbdeb0bfc76fa130f5d67a91e085ee37
SimHash 5c1448b687be

Groups

*

Rule Path
Disallow /weather-forecast-search
Disallow /ajax/*
Disallow /get-shapes/*
Disallow /getvsc_*
Disallow /getviamichelin_*
Disallow /searchAjax
Disallow /ajaxInputValue
Disallow /getGeoloc
Disallow /get/hours-and-compare
Disallow /get/liveobs
Disallow /nearbyForecast
Disallow /get-media_diapo
Disallow /reporter/*
Disallow /js/redesign/carto/*
Disallow /common/recherche/getgeoipentite
Disallow /login
Disallow /test/*
Disallow /monitoring/*
Disallow /webhook/*
Disallow /preview/*
Disallow /launch-mobile-application
Disallow /forecast/*
Disallow *-1-janvier$
Disallow *-1-january$
Disallow *-1-enero$
Disallow *-1-gennaio$
Disallow /mc-srto.html

Other Records

Field Value
sitemap https://www.weathercrave.co.uk/sitemaps/www-en-gb/sitemap-index.xml