agctx.org
robots.txt

Robots Exclusion Standard data for agctx.org

Resource Scan

Scan Details

Site Domain agctx.org
Base Domain agctx.org
Scan Status Ok
Last Scan2026-01-22T15:48:20+00:00
Next Scan 2026-02-21T15:48:20+00:00

Last Scan

Scanned2026-01-22T15:48:20+00:00
URL https://agctx.org/robots.txt
Redirect https://www.agctx.org/robots.txt
Redirect Domain www.agctx.org
Redirect Base agctx.org
Domain IPs 199.34.229.100
Redirect IPs 199.34.229.100
Response IP 199.34.229.100
Found Yes
Hash 6abe6a21c64d5abb6c210713348f1c462c4f290e8c55741fe78d49604a7f25db
SimHash 0845d4146712

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /https%3A//web.agctx.org/events
Disallow /https%3A//web.agctx.org/atlas/forms/general/6
Disallow /https%3A//web.agctx.org/portal
Disallow /https%3A//web.agctx.org/template-update?cleartemplatecache=true
Disallow /https%3A//secure.anedot.com/agc-of-texas-pac/donate
Disallow /https%3A//web.agctx.org/atlas/directory/search
Disallow /about.html
Disallow /services.html
Disallow /news--events.html
Disallow /membership.html
Disallow /404.html
Disallow /site-map.html
Disallow /mc-template.html
Disallow /how-to-guides.html

Other Records

Field Value
sitemap https://www.agctx.org/sitemap.xml