tpsgc-pwgsc.gc.ca
robots.txt

Robots Exclusion Standard data for tpsgc-pwgsc.gc.ca

Resource Scan

Scan Details

Site Domain tpsgc-pwgsc.gc.ca
Base Domain tpsgc-pwgsc.gc.ca
Scan Status Ok
Last Scan2024-09-18T17:43:13+00:00
Next Scan 2024-10-18T17:43:13+00:00

Last Scan

Scanned2024-09-18T17:43:13+00:00
URL https://tpsgc-pwgsc.gc.ca/robots.txt
Redirect https://www.tpsgc-pwgsc.gc.ca/robots.txt
Redirect Domain www.tpsgc-pwgsc.gc.ca
Redirect Base tpsgc-pwgsc.gc.ca
Domain IPs 205.193.233.215
Redirect IPs 205.193.233.215
Response IP 205.193.233.215
Found Yes
Hash 069c0d36901409c8d6cafa41767911312131c57325b75bc3bf9a2361a1c0d506
SimHash 41548a524510

Groups

*

Rule Path
Disallow /cgi-bin/language.pl
Disallow /pffsslefifddxxzz/

gsa-crawler

Rule Path
Disallow /cgi-bin/proactive/cl.pl
Disallow /pffsslefifddxxzz/

siteimprovebot-crawler

Rule Path
Allow /