/.well-known/

Log In Sign Up

doc.ks.gov
robots.txt

Robots Exclusion Standard data for doc.ks.gov

Archived Snapshots

Resource Scan

Scan Details

Site Domain	doc.ks.gov
Base Domain	ks.gov
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Request timed out.
Last Scan	2026-01-27T06:04:28+00:00
Next Scan	2026-02-26T06:04:28+00:00

Last Successful Scan

Scanned	2025-12-06T02:48:10+00:00
URL	https://doc.ks.gov/robots.txt
Domain IPs	165.201.34.4
Response IP	165.201.34.4
Found	Yes
Hash	9f939caf5a06482c2ed47b3c60886b4e9a37635ffe1c6616c41d4deed8d63606
SimHash	ac718b554d65

Groups

*

Rule

Path

Disallow

googlebot

Rule

Path

Disallow

/*sendto_form$

Disallow

/*folder_factories$

Back to top

Comments

Define access-restrictions for robots/spiders
http://www.robotstxt.org/wc/norobots.html
By default we allow robots to access all areas of our site
already accessible to anonymous users
Add Googlebot-specific syntax extension to exclude forms
that are repeated for each piece of content in the site
the wildcard is only supported by Googlebot
http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling

Back to top