wur.nl
robots.txt

Robots Exclusion Standard data for wur.nl

Resource Scan

Scan Details

Site Domain wur.nl
Base Domain wur.nl
Scan Status Ok
Last Scan2024-11-08T15:52:25+00:00
Next Scan 2024-11-22T15:52:25+00:00

Last Scan

Scanned2024-11-08T15:52:25+00:00
URL https://wur.nl/robots.txt
Redirect https://www.wur.nl/robots.txt
Redirect Domain www.wur.nl
Redirect Base wur.nl
Domain IPs 137.224.9.30
Redirect IPs 2600:1413:b000:6::17d5:2bc9, 2600:1413:b000:6::17d5:2bd9, 96.17.96.17, 96.17.96.8
Response IP 23.44.5.40
Found Yes
Hash 2415e3153c0ce7812c303552a437d61d7816ee4898a7cf4d29f46b4e4bf24db5
SimHash ac09a832eb91

Groups

wbsearchbot

Product Comment
wbsearchbot zie http://www.warebay.com/bot.html en SR112667
Rule Path
Disallow /

*

Rule Path
Disallow *presentationid%3D*
Disallow *interestarea%3D*
Disallow *category%3D*
Disallow *courseformat%3D*
Disallow *credits%3D*
Disallow *coursestartsimmediately%3D*
Disallow *courselanguage%3D*
Disallow *coursegroupsize%3D*
Disallow *coursepricerange%3D*
Disallow *scholarship%3D*
Disallow */zoekresultaten.htm*
Disallow */search-results.htm*
Disallow */suchergebnisse.htm*

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.wur.nl/sitemap.xml.gz

Comments

  • Disallow URL parameters
  • Disallow internal search URLs