gcc.nl
robots.txt

Robots Exclusion Standard data for gcc.nl

Resource Scan

Scan Details

Site Domain gcc.nl
Base Domain gcc.nl
Scan Status Ok
Last Scan2025-11-03T06:40:38+00:00
Next Scan 2025-11-17T06:40:38+00:00

Last Scan

Scanned2025-11-03T06:40:38+00:00
URL https://gcc.nl/robots.txt
Redirect https://www.gcc.nl/robots.txt
Redirect Domain www.gcc.nl
Redirect Base gcc.nl
Domain IPs 2a00:d10:201a:0:31:200:209:177, 31.200.209.177
Redirect IPs 2a00:d10:201a:0:31:200:209:177, 31.200.209.177
Response IP 31.200.209.177
Found Yes
Hash 5cacd18972dfd356a9ad731af2d07796e902d9a4e7459008430cc07c5a60a426
SimHash 7e0814cae194

Groups

*

Rule Path
Disallow /application/attributes
Disallow /application/authentication
Disallow /application/bootstrap
Disallow /application/config
Disallow /application/controllers
Disallow /application/elements
Disallow /application/helpers
Disallow /application/jobs
Disallow /application/languages
Disallow /application/mail
Disallow /application/models
Disallow /application/page_types
Disallow /application/single_pages
Disallow /application/views
Disallow /ccm/system/captcha/picture