space.cweb.nl
robots.txt

Robots Exclusion Standard data for space.cweb.nl

Resource Scan

Scan Details

Site Domain space.cweb.nl
Base Domain cweb.nl
Scan Status Ok
Last Scan2026-02-08T08:08:55+00:00
Next Scan 2026-03-10T08:08:55+00:00

Last Scan

Scanned2026-02-08T08:08:55+00:00
URL https://space.cweb.nl/robots.txt
Domain IPs 104.21.20.122, 172.67.192.217, 2606:4700:3034::6815:147a, 2606:4700:3037::ac43:c0d9
Response IP 172.67.192.217
Found Yes
Hash 1d01cf108ea25118fb7d7992c6efddf339602be9fe837fcf8407ba9384b567eb
SimHash 8841d9d4a311

Groups

turnitinbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /content/
Disallow /include/
Allow /

*

Rule Path
Disallow /content/
Disallow /include/
Disallow /js/
Disallow /style/