clarksvillegw.com
robots.txt

Robots Exclusion Standard data for clarksvillegw.com

Resource Scan

Scan Details

Site Domain clarksvillegw.com
Base Domain clarksvillegw.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-13T06:21:58+00:00
Next Scan 2024-10-14T06:21:58+00:00

Last Successful Scan

Scanned2024-09-13T06:21:47+00:00
URL http://clarksvillegw.com/robots.txt
Redirect https://www.clarksvilletn.gov/robots.txt
Redirect Domain www.clarksvilletn.gov
Redirect Base clarksvilletn.gov
Domain IPs 207.38.74.161
Redirect IPs 207.38.74.161
Response IP 207.38.74.161
Found Yes
Hash 6771bf6465ddd6e600a7866b5e41825007c4a20433b378e56746547c40d205c5
SimHash 01199cd020e1

Groups

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

*

Rule Path
Disallow /activedit
Disallow /admin
Disallow /common/admin/
Disallow /OJA
Disallow /support
Disallow /currenteventsview.asp
Disallow /search.asp
Disallow /currenteventsview.aspx
Disallow /search.aspx
Disallow /currentevents.aspx
Disallow /Support
Disallow /CurrentEventsView.asp
Disallow /Search.asp
Disallow /CurrentEventsView.aspx
Disallow /Search.aspx
Disallow /Search
Disallow /CurrentEvents.aspx
Disallow /Currentevents.aspx
Disallow /map.aspx
Disallow /map.asp
Disallow /Map.aspx
Disallow /Map.asp
Disallow /RSS.aspx

siteimprove

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

siteimprovebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

siteimprovebot-crawler

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap /sitemap.xml