britishcolumbia.ca
robots.txt

Robots Exclusion Standard data for britishcolumbia.ca

Resource Scan

Scan Details

Site Domain britishcolumbia.ca
Base Domain britishcolumbia.ca
Scan Status Ok
Last Scan2025-12-03T17:17:19+00:00
Next Scan 2025-12-17T17:17:19+00:00

Last Scan

Scanned2025-12-03T17:17:19+00:00
URL https://britishcolumbia.ca/robots.txt
Redirect https://www.britishcolumbia.ca/robots.txt
Redirect Domain www.britishcolumbia.ca
Redirect Base britishcolumbia.ca
Domain IPs 142.44.217.176
Redirect IPs 104.18.34.241, 172.64.153.15
Response IP 172.64.153.15
Found Yes
Hash 2d74619b47fa4aa1d1414db78c57442c0c1edb9143e9f9bf96cc4d116f76d727
SimHash 1c305cc0e6a2

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /refer/
Disallow /persona/*
Disallow /test/*
Disallow /taxonomy/*
Disallow /resource/*
Disallow /tir/*
Disallow /office/*
Disallow /feed/*
Disallow /content/*
Disallow /?term=*
Disallow /search/
Disallow /?taxonomy=
Disallow /?first_nations_category=*

Other Records

Field Value
crawl-delay 2

*

Rule Path
Disallow /wp-content/cache/
Disallow /wp-content/uploads/wpcf7_captcha/

Other Records

Field Value
sitemap https://www.britishcolumbia.ca/sitemap_index.xml

Comments

  • This robots.txt file controls crawling of URLs under https://www.britishcolumbia.ca
  • BEGIN W3TC ROBOTS
  • END W3TC ROBOTS