wfca.wa.gov
robots.txt

Robots Exclusion Standard data for wfca.wa.gov

Resource Scan

Scan Details

Site Domain wfca.wa.gov
Base Domain wa.gov
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-01-30T00:57:10+00:00
Next Scan 2026-03-01T00:57:10+00:00

Last Successful Scan

Scanned2025-12-09T00:55:48+00:00
URL https://wfca.wa.gov/robots.txt
Domain IPs 35.169.50.49, 35.173.82.140, 35.174.132.21
Response IP 35.173.82.140
Found Yes
Hash a1ad83e60be6c0238833016dda3deb4c46415190a7f63f45152914c992e57d5c
SimHash ec941d42c3d8

Groups

*

Rule Path
Disallow /global_inc/
Allow /global_inc/*.css
Allow /global_inc/*.js

*

Rule Path
Disallow /global_engine/ajax/

Other Records

Field Value
sitemap https://wfca.wa.gov/autositemapindex.xml

Comments

  • When crawlers hit the engine dir they sometimes publish confusing links to site content
  • in their search results so we exclude these specific engines from crawling it.
  • Note: Certain crawlers do need access to this directory so we do not want a blanket
  • exlude statment here.

Warnings

  • 18 invalid lines.