thecaglediaries.com
robots.txt

Robots Exclusion Standard data for thecaglediaries.com

Resource Scan

Scan Details

Site Domain thecaglediaries.com
Base Domain thecaglediaries.com
Scan Status Ok
Last Scan2025-03-31T11:13:21+00:00
Next Scan 2025-04-07T11:13:21+00:00

Last Scan

Scanned2025-03-31T11:13:21+00:00
URL https://thecaglediaries.com/robots.txt
Domain IPs 104.21.43.12, 172.67.215.179, 2606:4700:3030::ac43:d7b3, 2606:4700:3036::6815:2b0c
Response IP 172.67.215.179
Found Yes
Hash 7846202ab9b83d9ccb7b807bc7e7053e1095a4e4b3f7ffcf6c5489e79b3a99a7
SimHash 6105d84089b3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thecaglediaries.com/sitemap_index.xml