missia.org
robots.txt

Robots Exclusion Standard data for missia.org

Resource Scan

Scan Details

Site Domain missia.org
Base Domain missia.org
Scan Status Ok
Last Scan2026-01-16T00:27:42+00:00
Next Scan 2026-02-15T00:27:42+00:00

Last Scan

Scanned2026-01-16T00:27:42+00:00
URL https://missia.org/robots.txt
Domain IPs 5.252.32.34
Response IP 5.252.32.34
Found Yes
Hash 6756374dc3143fdd832efd1282a2f06f2b29a7ba1c586f16ba14acf4bd55e9d8
SimHash 4c145970c771

Groups

*

Rule Path
Disallow /search
Disallow /admin
Disallow /search?*
Disallow /search?search=
Disallow /*.pdf$
Disallow /?
Disallow /*?page=
Disallow /cgi-bin*
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://missia.org/sitemap.xml