interhelp.org
robots.txt
Robots Exclusion Standard data for interhelp.org
Resource Scan
Scan Details
Site Domain | interhelp.org |
Base Domain | interhelp.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-08-05T21:37:54+00:00 |
Next Scan | 2024-11-03T21:37:54+00:00 |
Last Successful Scan
Scanned | 2024-04-08T21:35:59+00:00 |
URL | https://interhelp.org/robots.txt |
Domain IPs | 104.21.62.184, 172.67.138.37, 2606:4700:3030::ac43:8a25, 2606:4700:3031::6815:3eb8 |
Response IP | 172.67.138.37 |
Found | Yes |
Hash | 9f59866e5326166456ec8cb9f49601eedc85f646fa3315046d8b0177d1abb06f |
SimHash | c1116f0866c3 |
Groups
*
Rule | Path |
---|---|
Disallow | /images/ |
Disallow | /buscadores/ |
Disallow | /cgi-bin/ |
Disallow | /descarga/ |
Disallow | /dist/ |
Disallow | /fernando/betas/ |
Disallow | /fernando/images/ |
Disallow | /Project/ |
Disallow | /lib/ |
Disallow | /Library/ |
Disallow | /stats/ |
Disallow | /sys/ |
Disallow | /webglimpse-1.6.edu/ |
Disallow | /glimpse-4.1-bin-Linux-2.0.30-i486/ |
Comments