theguidex.com
robots.txt

Robots Exclusion Standard data for theguidex.com

Resource Scan

Scan Details

Site Domain theguidex.com
Base Domain theguidex.com
Scan Status Ok
Last Scan2026-01-04T07:43:06+00:00
Next Scan 2026-02-03T07:43:06+00:00

Last Scan

Scanned2026-01-04T07:43:06+00:00
URL https://theguidex.com/robots.txt
Domain IPs 104.21.46.5, 172.67.221.244, 2606:4700:3031::6815:2e05, 2606:4700:3037::ac43:ddf4
Response IP 104.21.46.5
Found Yes
Hash 427f6512116d2b4e3429308cb84ae71378ecae76d6a9ba90da8309d91a372f90
SimHash 69005800ef92

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /go/
Disallow /tag/
Allow *

Other Records

Field Value
sitemap https://theguidex.com/sitemap_index.xml