thecaq.org
robots.txt
Robots Exclusion Standard data for thecaq.org
Resource Scan
Scan Details
| Site Domain | thecaq.org |
| Base Domain | thecaq.org |
| Scan Status | Ok |
| Last Scan | 2025-10-17T04:48:58+00:00 |
| Next Scan | 2025-11-16T04:48:58+00:00 |
Last Scan
| Scanned | 2025-10-17T04:48:58+00:00 |
| URL | https://thecaq.org/robots.txt |
| Redirect | https://www.thecaq.org/robots.txt |
| Redirect Domain | www.thecaq.org |
| Redirect Base | thecaq.org |
| Domain IPs | 13.35.37.100, 13.35.37.16, 13.35.37.53, 13.35.37.58 |
| Redirect IPs | 13.35.37.100, 13.35.37.16, 13.35.37.53, 13.35.37.58, 2600:9000:213e:1600:6:6dd3:7b40:93a1, 2600:9000:213e:1800:6:6dd3:7b40:93a1, 2600:9000:213e:200:6:6dd3:7b40:93a1, 2600:9000:213e:b200:6:6dd3:7b40:93a1, 2600:9000:213e:da00:6:6dd3:7b40:93a1, 2600:9000:213e:e000:6:6dd3:7b40:93a1, 2600:9000:213e:ee00:6:6dd3:7b40:93a1, 2600:9000:213e:fc00:6:6dd3:7b40:93a1 |
| Response IP | 13.35.37.100 |
| Found | Yes |
| Hash | 05065538ea14cc32c334678e40009baad718288acfada2673a8a009790dc1c2f |
| SimHash | 4a54d4308b12 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /api/ |
| Disallow | /wp-admin/ |
| Disallow | /wp-json/ |
| Disallow | /resource-hub |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.thecaq.org/sitemap.xml |
| sitemap | https://www.thecaq.org/sitemaps/builder/sitemap.xml |
| sitemap | https://www.thecaq.org/sitemaps/wp/sitemap.xml |
Warnings
- `host` is not a known field.
Comments