www.earthquakescanada.nrcan.gc.ca
robots.txt

Robots Exclusion Standard data for www.earthquakescanada.nrcan.gc.ca

Resource Scan

Scan Details

Site Domain www.earthquakescanada.nrcan.gc.ca
Base Domain nrcan.gc.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer redirected incorrectly.
Last Scan2024-10-31T10:59:46+00:00
Next Scan 2024-11-01T10:59:46+00:00

Last Successful Scan

Scanned2024-10-17T10:59:36+00:00
URL https://www.earthquakescanada.nrcan.gc.ca/robots.txt
Domain IPs 2600:1413:b000:6::17d5:2bc6, 2600:1413:b000:6::17d5:2bda, 96.17.96.16, 96.17.96.25
Response IP 23.209.46.162
Found Yes
Hash bc5bf5be22e8b497eded793690c3e95ef59727ab27b92aad1c2fd58dabff9872
SimHash 21626261a919

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /stnsdata/bettydata/
Disallow /stndon/NWFA-ANFO/eve/
Disallow /stndon/GINA-AMI/eve_mo-en.php
Disallow /stndon/GINA-AMI/eve_mo-fr.php
Disallow /stndon/GINA-AMI/read_data-en.php
Disallow /stndon/GINA-AMI/read_data-fr.php
Disallow /stnsdata/nwfa/is/
Disallow /includes/
Disallow /wet-boew-php/
Disallow /dist-4.0/
Disallow /inc-php/

Comments

  • CGI directory
  • Waveform viewer images
  • National Waveform Archive data (not splash page)
  • Global infrasound Archive (GINA) but include splash page
  • WET4.0 content pages