theconnecticutscoop.com
robots.txt

Robots Exclusion Standard data for theconnecticutscoop.com

Resource Scan

Scan Details

Site Domain theconnecticutscoop.com
Base Domain theconnecticutscoop.com
Scan Status Ok
Last Scan2025-12-09T19:53:52+00:00
Next Scan 2025-12-16T19:53:52+00:00

Last Scan

Scanned2025-12-09T19:53:52+00:00
URL https://theconnecticutscoop.com/robots.txt
Redirect https://www.theconnecticutscoop.com/robots.txt
Redirect Domain www.theconnecticutscoop.com
Redirect Base theconnecticutscoop.com
Domain IPs 199.34.228.68
Redirect IPs 199.34.228.68
Response IP 199.34.228.68
Found Yes
Hash 3910571e9e2bb3b2b7ad51c873f7de4350a27f15fd16b8d48ffc641f01e7c45b
SimHash e814d4046613

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /index.html
Disallow /the-team.html
Disallow /coffee-trail.html
Disallow /ct-tourism.html
Disallow /towns--series.html
Disallow /homepage.html

Other Records

Field Value
sitemap https://www.theconnecticutscoop.com/sitemap.xml