kta-hike.org
robots.txt

Robots Exclusion Standard data for kta-hike.org

Resource Scan

Scan Details

Site Domain kta-hike.org
Base Domain kta-hike.org
Scan Status Ok
Last Scan2025-11-18T19:36:15+00:00
Next Scan 2025-12-18T19:36:15+00:00

Last Scan

Scanned2025-11-18T19:36:15+00:00
URL https://kta-hike.org/robots.txt
Redirect https://www.kta-hike.org/robots.txt
Redirect Domain www.kta-hike.org
Redirect Base kta-hike.org
Domain IPs 199.34.228.46
Redirect IPs 199.34.228.46
Response IP 199.34.228.46
Found Yes
Hash 5f2cdefff4dc7a40d2b0ab8cd9693d01d35f81dc7d81c0c1c72a572a552f4c58
SimHash 4810c80cee93

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /2026-st-john-lottery-interest-form.html
Disallow /webinar-trailsideinvaders.html
Disallow /vinpinterest.html
Disallow /spring-registration.html
Disallow /get-involved.html
Disallow /webinar-penndot.html
Disallow /get-outdoors.html
Disallow /major-pa-hiking-trails.html
Disallow /clubs.html
Disallow /spring-hiking-weekend-test-copy.html
Disallow /store-old-archive.html
Disallow /link-page.html
Disallow /trailacademy.html

Other Records

Field Value
sitemap https://www.kta-hike.org/sitemap.xml