guardtraining.ca
robots.txt

Robots Exclusion Standard data for guardtraining.ca

Resource Scan

Scan Details

Site Domain guardtraining.ca
Base Domain guardtraining.ca
Scan Status Ok
Last Scan2024-09-24T23:14:46+00:00
Next Scan 2024-10-08T23:14:46+00:00

Last Scan

Scanned2024-09-24T23:14:46+00:00
URL https://guardtraining.ca/robots.txt
Domain IPs 3.98.65.114, 52.60.184.165
Response IP 3.98.65.114
Found Yes
Hash 5306216ee2249d18564c8cb289344b9693485ad3e67a50fc10eec1ecbb53831a
SimHash f2842d8de651

Groups

*

Rule Path
Disallow /app/
Disallow /admin/
Disallow /scorm/
Disallow /orders/
Disallow /checkout/
Disallow /search/
Disallow /users/confirmation/

Other Records

Field Value
sitemap https://guardtraining.ca/sitemap.xml

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /