scmajobs.ca
robots.txt

Robots Exclusion Standard data for scmajobs.ca

Resource Scan

Scan Details

Site Domain scmajobs.ca
Base Domain scmajobs.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-07-27T15:30:36+00:00
Next Scan 2024-10-25T15:30:36+00:00

Last Successful Scan

Scanned2023-03-13T04:27:58+00:00
URL https://scmajobs.ca/robots.txt
Redirect https://www.scmajobs.ca/robots.txt
Redirect Domain www.scmajobs.ca
Redirect Base scmajobs.ca
Domain IPs 65.9.112.105, 65.9.112.60, 65.9.112.66, 65.9.112.97
Redirect IPs 18.161.111.103, 18.161.111.124, 18.161.111.85, 18.161.111.93
Response IP 18.65.25.53
Found Yes
Hash 40c32e49eefff580701c9c2f80d2df4ec704df6f9f301d6f4893e3bd4e7b6413
SimHash 7d5f2f8041d0

Groups

*

Rule Path
Disallow /job-alerts/
Disallow /quick-add-alert/
Disallow /add-job-alert/
Disallow /add-job-alert-logged-out/
Disallow /job-preview/
Disallow /session-img/
Disallow /ads/
Disallow /metric-scripts/
Disallow /advert-scripts/
Disallow /health/
Disallow /supporting-documents/
Disallow /job-view/

Other Records

Field Value
sitemap https://www.scmajobs.ca/sitemapindex.xml

Comments

  • Robot exclusion file