theenvironmentsite.org
robots.txt
Robots Exclusion Standard data for theenvironmentsite.org
Resource Scan
Scan Details
Site Domain | theenvironmentsite.org |
Base Domain | theenvironmentsite.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-08-27T04:20:31+00:00 |
Next Scan | 2025-09-03T04:20:31+00:00 |
Last Successful Scan
Scanned | 2025-08-19T03:34:18+00:00 |
URL | https://theenvironmentsite.org/robots.txt |
Redirect | https://pencethoki.com/robots.txt |
Redirect Domain | pencethoki.com |
Redirect Base | pencethoki.com |
Domain IPs | 184.168.111.100 |
Redirect IPs | 104.21.75.118, 172.67.175.146, 2606:4700:3033::ac43:af92, 2606:4700:3035::6815:4b76 |
Response IP | 104.21.75.118 |
Found | Yes |
Hash | 0d70d1e84c0f079f9b6e81a5d6e7226aeadc847afddf1d951c204a1924ac5482 |
SimHash | 685175925583 |
Groups
*
Rule | Path |
---|---|
Disallow | |
Disallow | /cgi-bin/ |
Other Records
Field | Value |
---|---|
sitemap | https://pencethoki.com/sitemap.xml |