cps.nl
robots.txt

Robots Exclusion Standard data for cps.nl

Resource Scan

Scan Details

Site Domain cps.nl
Base Domain cps.nl
Scan Status Ok
Last Scan2024-11-04T22:58:37+00:00
Next Scan 2024-11-18T22:58:37+00:00

Last Scan

Scanned2024-11-04T22:58:37+00:00
URL https://cps.nl/robots.txt
Redirect https://www.cps.nl/robots.txt
Redirect Domain www.cps.nl
Redirect Base cps.nl
Domain IPs 213.193.247.135
Redirect IPs 213.193.247.135
Response IP 213.193.247.135
Found Yes
Hash 169b03c9fab4a6d7f6229fb33c4ef79b188bd85453e4e86969b2ee1eeabb0726
SimHash a81ed802bac3

Groups

*

Rule Path
Disallow /cookie_control/
Disallow /page/
Disallow /l/
Disallow /academie/1
Disallow /blogs?
Disallow /attendees
Disallow /location
Disallow /materials
Disallow /schedule
Disallow /speakers
Disallow /register

jobdiggerspider

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

olbicobot

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.cps.nl/l/sitemaps/index

Warnings

  • 2 invalid lines.