mcmastercce.ca
robots.txt

Robots Exclusion Standard data for mcmastercce.ca

Resource Scan

Scan Details

Site Domain mcmastercce.ca
Base Domain mcmastercce.ca
Scan Status Ok
Last Scan2025-03-29T20:05:34+00:00
Next Scan 2025-04-28T20:05:34+00:00

Last Scan

Scanned2025-03-29T20:05:34+00:00
URL https://mcmastercce.ca/robots.txt
Redirect https://continuing.mcmaster.ca/robots.txt
Redirect Domain continuing.mcmaster.ca
Redirect Base mcmaster.ca
Domain IPs 66.209.177.41
Redirect IPs 130.113.213.56
Response IP 130.113.213.56
Found Yes
Hash 9b64b19303cab3ae6ba51f7eee7ba95d4be61adebd2f7d7628e8b0267235f606
SimHash 5010d160e313

Groups

*

Rule Path
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /