wur.nl
robots.txt
Robots Exclusion Standard data for wur.nl
Resource Scan
Scan Details
Site Domain | wur.nl |
Base Domain | wur.nl |
Scan Status | Ok |
Last Scan | 2024-11-08T15:52:25+00:00 |
Next Scan | 2024-11-22T15:52:25+00:00 |
Last Scan
Scanned | 2024-11-08T15:52:25+00:00 |
URL | https://wur.nl/robots.txt |
Redirect | https://www.wur.nl/robots.txt |
Redirect Domain | www.wur.nl |
Redirect Base | wur.nl |
Domain IPs | 137.224.9.30 |
Redirect IPs | 2600:1413:b000:6::17d5:2bc9, 2600:1413:b000:6::17d5:2bd9, 96.17.96.17, 96.17.96.8 |
Response IP | 23.44.5.40 |
Found | Yes |
Hash | 2415e3153c0ce7812c303552a437d61d7816ee4898a7cf4d29f46b4e4bf24db5 |
SimHash | ac09a832eb91 |
Groups
wbsearchbot
Product | Comment |
---|---|
wbsearchbot | zie http://www.warebay.com/bot.html en SR112667 |
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | *presentationid%3D* |
Disallow | *interestarea%3D* |
Disallow | *category%3D* |
Disallow | *courseformat%3D* |
Disallow | *credits%3D* |
Disallow | *coursestartsimmediately%3D* |
Disallow | *courselanguage%3D* |
Disallow | *coursegroupsize%3D* |
Disallow | *coursepricerange%3D* |
Disallow | *scholarship%3D* |
Disallow | */zoekresultaten.htm* |
Disallow | */search-results.htm* |
Disallow | */suchergebnisse.htm* |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://www.wur.nl/sitemap.xml.gz |
Comments