wedu.ca
robots.txt

Robots Exclusion Standard data for wedu.ca

Resource Scan

Scan Details

Site Domain wedu.ca
Base Domain wedu.ca
Scan Status Ok
Last Scan2026-01-03T01:18:18+00:00
Next Scan 2026-02-02T01:18:18+00:00

Last Scan

Scanned2026-01-03T01:18:18+00:00
URL https://wedu.ca/robots.txt
Redirect https://www.wedu.ca/robots.txt
Redirect Domain www.wedu.ca
Redirect Base wedu.ca
Domain IPs 104.26.10.68, 104.26.11.68, 172.67.73.116, 2606:4700:20::681a:a44, 2606:4700:20::681a:b44, 2606:4700:20::ac43:4974
Redirect IPs 104.26.10.68, 104.26.11.68, 172.67.73.116, 2606:4700:20::681a:a44, 2606:4700:20::681a:b44, 2606:4700:20::ac43:4974
Response IP 104.26.11.68
Found Yes
Hash dbf95cdbac3fd9538b61b6961bc10677662ce8978ea1c8ac261cf31c5a8bffa5
SimHash 5411d1602b13

Groups

*

Rule Path
Disallow /api
Disallow /*callbackUrl%3D*
Disallow */auth/new-password
Disallow */auth/new-verification
Disallow */auth/reset
Disallow */auth/error
Disallow */profile
Disallow */auth/reset-password
Disallow */auth/verify-phone
Disallow /monitoring
Disallow /cdn-cgi/zaraz

Other Records

Field Value
crawl-delay 1

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.wedu.ca/sitemap.xml
sitemap https://www.wedu.ca/sitemaps/listings/index.xml
sitemap https://blog.wedu.ca/sitemap_index.xml