blogheist.com
robots.txt

Robots Exclusion Standard data for blogheist.com

Resource Scan

Scan Details

Site Domain blogheist.com
Base Domain blogheist.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-16T05:21:51+00:00
Next Scan 2025-12-15T05:21:51+00:00

Last Successful Scan

Scanned2025-05-19T20:25:33+00:00
URL https://blogheist.com/robots.txt
Domain IPs 104.21.49.92, 172.67.189.114, 2606:4700:3033::6815:315c, 2606:4700:3033::ac43:bd72
Response IP 104.21.49.92
Found Yes
Hash 5c9391915b518a2653c9eedf55a72c3aa900faeab6b9f5703fa3ed5b7264ecf9
SimHash e1194d87c7f6

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /linkout/
Disallow */feed/
Disallow /recommends/
Disallow /r/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /tag/
Disallow *?amp
Disallow /amp/
Disallow *?

Other Records

Field Value
sitemap https://blogheist.com/sitemap_index.xml