thecaq.org
robots.txt

Robots Exclusion Standard data for thecaq.org

Resource Scan

Scan Details

Site Domain thecaq.org
Base Domain thecaq.org
Scan Status Ok
Last Scan2025-10-17T04:48:58+00:00
Next Scan 2025-11-16T04:48:58+00:00

Last Scan

Scanned2025-10-17T04:48:58+00:00
URL https://thecaq.org/robots.txt
Redirect https://www.thecaq.org/robots.txt
Redirect Domain www.thecaq.org
Redirect Base thecaq.org
Domain IPs 13.35.37.100, 13.35.37.16, 13.35.37.53, 13.35.37.58
Redirect IPs 13.35.37.100, 13.35.37.16, 13.35.37.53, 13.35.37.58, 2600:9000:213e:1600:6:6dd3:7b40:93a1, 2600:9000:213e:1800:6:6dd3:7b40:93a1, 2600:9000:213e:200:6:6dd3:7b40:93a1, 2600:9000:213e:b200:6:6dd3:7b40:93a1, 2600:9000:213e:da00:6:6dd3:7b40:93a1, 2600:9000:213e:e000:6:6dd3:7b40:93a1, 2600:9000:213e:ee00:6:6dd3:7b40:93a1, 2600:9000:213e:fc00:6:6dd3:7b40:93a1
Response IP 13.35.37.100
Found Yes
Hash 05065538ea14cc32c334678e40009baad718288acfada2673a8a009790dc1c2f
SimHash 4a54d4308b12

Groups

*

Rule Path
Allow /
Disallow /api/
Disallow /wp-admin/
Disallow /wp-json/
Disallow /resource-hub

Other Records

Field Value
sitemap https://www.thecaq.org/sitemap.xml
sitemap https://www.thecaq.org/sitemaps/builder/sitemap.xml
sitemap https://www.thecaq.org/sitemaps/wp/sitemap.xml

Comments

  • *
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.