awa-la.org
robots.txt

Robots Exclusion Standard data for awa-la.org

Resource Scan

Scan Details

Site Domain awa-la.org
Base Domain awa-la.org
Scan Status Ok
Last Scan2026-02-18T00:09:18+00:00
Next Scan 2026-03-20T00:09:18+00:00

Last Scan

Scanned2026-02-18T00:09:18+00:00
URL https://awa-la.org/robots.txt
Domain IPs 104.21.78.180, 172.67.136.40, 2606:4700:3034::ac43:8828, 2606:4700:3036::6815:4eb4
Response IP 104.21.78.180
Found Yes
Hash 341a291796cf07f49972595f6525947acf641d4a6d455251cf21d1afcdb28a33
SimHash 4e245970c775

Groups

*

Rule Path
Disallow /search
Disallow /admin
Disallow /search?*
Disallow /search?search=
Disallow /*.pdf$
Disallow /?
Disallow /*?page=
Disallow /cgi-bin*
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://awa-la.org/sitemap.xml