blogheist.com
robots.txt
Robots Exclusion Standard data for blogheist.com
Resource Scan
Scan Details
Site Domain | blogheist.com |
Base Domain | blogheist.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-09-16T05:21:51+00:00 |
Next Scan | 2025-12-15T05:21:51+00:00 |
Last Successful Scan
Scanned | 2025-05-19T20:25:33+00:00 |
URL | https://blogheist.com/robots.txt |
Domain IPs | 104.21.49.92, 172.67.189.114, 2606:4700:3033::6815:315c, 2606:4700:3033::ac43:bd72 |
Response IP | 104.21.49.92 |
Found | Yes |
Hash | 5c9391915b518a2653c9eedf55a72c3aa900faeab6b9f5703fa3ed5b7264ecf9 |
SimHash | e1194d87c7f6 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /linkout/ |
Disallow | */feed/ |
Disallow | /recommends/ |
Disallow | /r/ |
Disallow | /trackback/ |
Disallow | /index.php |
Disallow | /xmlrpc.php |
Disallow | /tag/ |
Disallow | *?amp |
Disallow | /amp/ |
Disallow | *? |
Other Records
Field | Value |
---|---|
sitemap | https://blogheist.com/sitemap_index.xml |