wegreizen.nl
robots.txt

Robots Exclusion Standard data for wegreizen.nl

Resource Scan

Scan Details

Site Domain wegreizen.nl
Base Domain wegreizen.nl
Scan Status Ok
Last Scan2025-12-06T20:40:40+00:00
Next Scan 2025-12-13T20:40:40+00:00

Last Scan

Scanned2025-12-06T20:40:40+00:00
URL https://wegreizen.nl/robots.txt
Domain IPs 104.21.5.10, 172.67.132.182, 2606:4700:3034::ac43:84b6, 2606:4700:3036::6815:50a
Response IP 104.21.5.10
Found Yes
Hash c6bd6a1d70d694c1405d3ed95827cc553a524d4044ee90d652e1be28d2b387a7
SimHash 532d4672ba82

Groups

googlebot

Rule Path
Disallow /*?country=*
Disallow /*?country=*&city=*
Disallow /*?newtopic=*
Disallow /*?edit=*&textid=*
Disallow /*?edit=*&categid=*
Disallow /images/protoGallery.swf
Disallow /gallery.php
Disallow /swfscripts/infocity.php
Disallow /swfscripts/infocountry.php
Disallow /swfscripts/infofile.php
Disallow /world/
Disallow /popup/
Disallow /orderforms/
Disallow /orderform/
Disallow /orderbanner/
Disallow /orderbanner3/
Disallow /ordergallery/
Disallow /friend/
Disallow /addphoto/
Disallow /review/
Disallow /adv/

*

Rule Path
Disallow /world/
Disallow /popup/
Disallow /orderforms/
Disallow /orderform/
Disallow /orderbanner/
Disallow /orderbanner3/
Disallow /ordergallery/
Disallow /friend/
Disallow /addphoto/
Disallow /review/
Disallow /adv/

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

urlmetrics

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Warnings

  • 2 invalid lines.