capilanou.ca
robots.txt

Robots Exclusion Standard data for capilanou.ca

Resource Scan

Scan Details

Site Domain capilanou.ca
Base Domain capilanou.ca
Scan Status Ok
Last Scan2024-09-19T07:37:48+00:00
Next Scan 2024-10-19T07:37:48+00:00

Last Scan

Scanned2024-09-19T07:37:48+00:00
URL https://capilanou.ca/robots.txt
Redirect https://www.capilanou.ca/robots.txt
Redirect Domain www.capilanou.ca
Redirect Base capilanou.ca
Domain IPs 20.220.131.99
Redirect IPs 20.220.131.99
Response IP 20.220.131.99
Found Yes
Hash 6ceb84d9884251e3e4b4496f96908a455aae98fb296b1ea490f7283eb26768f4
SimHash a01104638117

Groups

terminalfour-nutch-spider

Rule Path
Allow /site-search/

*

Rule Path
Allow /about-capu/get-to-know-us/events/items/*
Allow /student-services/community/blueshore-financial-centre-for-the-performing-arts/our-events/all-events/events/*
Allow /about-capu/get-to-know-us/news/*/
Disallow /site-search/

Other Records

Field Value
sitemap https://www.capilanou.ca/sitemap-en.xml