gov.nl.ca
robots.txt

Robots Exclusion Standard data for gov.nl.ca

Resource Scan

Scan Details

Site Domain gov.nl.ca
Base Domain gov.nl.ca
Scan Status Ok
Last Scan2024-09-23T01:25:50+00:00
Next Scan 2024-10-23T01:25:50+00:00

Last Scan

Scanned2024-09-23T01:25:50+00:00
URL https://gov.nl.ca/robots.txt
Redirect https://www.gov.nl.ca/robots.txt
Redirect Domain www.gov.nl.ca
Redirect Base gov.nl.ca
Domain IPs 98.143.128.70
Redirect IPs 98.143.128.70
Response IP 98.143.128.70
Found Yes
Hash 172cdf724f6c077361dc72fbd422b3157b13d8c650164e898f7ef4f7b3402004
SimHash 5040c9c369d2

Groups

*

Rule Path
Disallow /lrb/searchsys.html
Disallow lrb/searchsys.html
Disallow /nlwin
Disallow */misc/*
Disallow */private/*
Disallow *.tmp
Disallow */protectedsite/*

Comments

  • robots.txt specific for sub-website http://www.gov.nl.ca/lrb/