cps.nl
robots.txt

Robots Exclusion Standard data for cps.nl

Resource Scan

Scan Details

Site Domain cps.nl
Base Domain cps.nl
Scan Status Ok
Last Scan2024-09-23T11:13:20+00:00
Next Scan 2024-10-07T11:13:20+00:00

Last Scan

Scanned2024-09-23T11:13:20+00:00
URL https://cps.nl/robots.txt
Redirect https://www.cps.nl/robots.txt
Redirect Domain www.cps.nl
Redirect Base cps.nl
Domain IPs 213.193.247.135
Redirect IPs 213.193.247.135
Response IP 213.193.247.135
Found Yes
Hash d18f9807eb186fc57738676acf36ec40a9afbf970a1e64ef44569462a5c77d86
SimHash 8c5ed8007a43

Groups

*

Rule Path
Disallow /cookie_control/
Disallow /page/
Disallow /l/
Disallow /*?
Disallow www.cps.nl/academie-detail
Disallow /location
Disallow /schedule
Disallow /speakers
Disallow /register
Disallow /attendees
Disallow /materials

jobdiggerspider

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

olbicobot

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.cps.nl/l/sitemaps/index

Warnings

  • 2 invalid lines.