aasa.org
robots.txt

Robots Exclusion Standard data for aasa.org

Resource Scan

Scan Details

Site Domain aasa.org
Base Domain aasa.org
Scan Status Ok
Last Scan2025-10-10T05:38:12+00:00
Next Scan 2025-11-09T05:38:12+00:00

Last Scan

Scanned2025-10-10T05:38:12+00:00
URL https://aasa.org/robots.txt
Redirect https://www.aasa.org:443/robots.txt
Redirect Domain www.aasa.org
Redirect Base aasa.org
Domain IPs 52.54.112.189
Redirect IPs 3.223.46.202, 3.90.211.27
Response IP 3.90.211.27
Found Yes
Hash 0fce694154a321d62bbbba7e09b7fa5c1bee08f571a72ea91430e7e3ddfd84b0
SimHash 681299754ff0

Groups

*

Rule Path
Disallow /Sitefinity
Disallow /sandbox
Disallow /search-results
Disallow /advocacy/the-leading-edge-policy-advocacy-blog

Other Records

Field Value
crawl-delay 120

Comments

  • Do not delete /Sitefinity. Never any reason to allow indexing here
  • The same goes for sandbox
  • Also disallow search. We already have it set to "noindex", but keep getting googlebot hits