catholiccaretas.org.au
robots.txt

Robots Exclusion Standard data for catholiccaretas.org.au

Resource Scan

Scan Details

Site Domain catholiccaretas.org.au
Base Domain catholiccaretas.org.au
Scan Status Ok
Last Scan2025-05-10T10:26:11+00:00
Next Scan 2025-05-17T10:26:11+00:00

Last Scan

Scanned2025-05-10T10:26:11+00:00
URL https://catholiccaretas.org.au/robots.txt
Domain IPs 52.64.188.147
Response IP 52.64.188.147
Found Yes
Hash f8ff458047466cf1d76877a6fc10ebbdf417408b6bc37f0d5b81cfb87df9d4b2
SimHash e36899722db2

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://catholiccaretas.org.au/sitemaps-1-sitemap.xml
sitemap https://catholiccaretas.org.au/ar/sitemaps-1-sitemap.xml
sitemap https://catholiccaretas.org.au/fa/sitemaps-1-sitemap.xml
sitemap $NE_SITE_URL/sitemaps-1-sitemap.xml
sitemap $TI_SITE_URL/sitemaps-1-sitemap.xml
sitemap $OM_SITE_URL/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://catholiccaretas.org.au/
  • live - don't allow web crawlers to index cpresources/ or vendor/