youbahn.nl
robots.txt

Robots Exclusion Standard data for youbahn.nl

Resource Scan

Scan Details

Site Domain youbahn.nl
Base Domain youbahn.nl
Scan Status Ok
Last Scan2024-09-19T14:07:13+00:00
Next Scan 2024-09-26T14:07:13+00:00

Last Scan

Scanned2024-09-19T14:07:13+00:00
URL https://youbahn.nl/robots.txt
Redirect https://www.youbahn.nl/robots.txt
Redirect Domain www.youbahn.nl
Redirect Base youbahn.nl
Domain IPs 104.110.191.40, 104.110.191.67
Redirect IPs 2600:1413:b000:6::17d5:2bd4, 2600:1413:b000:6::17d5:2bda, 96.17.96.18, 96.17.96.30
Response IP 23.44.5.49
Found Yes
Hash eb2fc9d3a5e95d9f8960a24e0ad8ed859f8d1561fc67ed53b5a1def2fdd19c42
SimHash 05082c747b32

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /registratie-afronden
Disallow /werkgever/
Disallow /werkgever/
Disallow /wachtwoord-vergeten/
Disallow /404-opdracht
Disallow /zzp/inloggen
Disallow *?postalcode
Disallow *?functions%5B1%5D
Disallow *?functions%5B2%5D
Disallow *?functions%5B3%5D
Disallow *?functions%5B4%5D
Disallow *?functions%5B5%5D
Disallow *?max_commuting_distance
Disallow *?start_date
Disallow *?end_date

Other Records

Field Value
sitemap https://www.youbahn.nl/sitemaps-2-sitemap.xml

Comments

  • robots.txt for /
  • live - don't allow web crawlers to index cpresources/ or vendor/