mswa.org.au
robots.txt

Robots Exclusion Standard data for mswa.org.au

Resource Scan

Scan Details

Site Domain mswa.org.au
Base Domain mswa.org.au
Scan Status Ok
Last Scan2025-12-14T23:37:53+00:00
Next Scan 2025-12-21T23:37:53+00:00

Last Scan

Scanned2025-12-14T23:37:53+00:00
URL https://mswa.org.au/robots.txt
Domain IPs 104.26.10.25, 104.26.11.25, 172.67.73.228, 2606:4700:20::681a:a19, 2606:4700:20::681a:b19, 2606:4700:20::ac43:49e4
Response IP 104.26.10.25
Found Yes
Hash 57f8df9ca211664e0f258583d67ef5d80758d2def67b90c8ff426a93a519c7c3
SimHash 615c995a4736

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /components/
Disallow /neuro-collective

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://mswa.org.au/sitemaps-1-sitemap.xml
sitemap https://donations.mswa.org.au/sitemaps-1-sitemap.xml
sitemap https://mswa.org.au/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://mswa.org.au/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
  • Disallow Perplexity bot, as there's no benefit to allowing it to index your site