thocauca.com
robots.txt
Robots Exclusion Standard data for thocauca.com
Resource Scan
Scan Details
| Site Domain | thocauca.com |
| Base Domain | thocauca.com |
| Scan Status | Ok |
| Last Scan | 2026-01-16T09:09:59+00:00 |
| Next Scan | 2026-01-23T09:09:59+00:00 |
Last Scan
| Scanned | 2026-01-16T09:09:59+00:00 |
| URL | https://thocauca.com/robots.txt |
| Domain IPs | 104.21.21.240, 172.67.201.109, 2606:4700:3031::6815:15f0, 2606:4700:3033::ac43:c96d |
| Response IP | 104.21.21.240 |
| Found | Yes |
| Hash | a2c166a3d1f214092aa32bb3bfc25d7bc12a52cd785caec493120d76f511b464 |
| SimHash | 46350b53cd94 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Allow | /wp-admin/admin-ajax.php |
Other Records
| Field | Value |
|---|---|
| sitemap | https://thocauca.com/wp-sitemap.xml |
Warnings
- `content-signal` is not a known field.
Comments