protocol.com
robots.txt
Robots Exclusion Standard data for protocol.com
Resource Scan
Scan Details
Site Domain | protocol.com |
Base Domain | protocol.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-08-15T21:08:10+00:00 |
Next Scan | 2024-11-13T21:08:10+00:00 |
Last Successful Scan
Scanned | 2023-06-27T12:06:39+00:00 |
URL | https://protocol.com/robots.txt |
Redirect | https://www.protocol.com/robots.txt |
Redirect Domain | www.protocol.com |
Redirect Base | protocol.com |
Domain IPs | 54.243.223.181, 54.243.223.182 |
Redirect IPs | 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91 |
Response IP | 199.232.45.91 |
Found | Yes |
Hash | dd49f9a35f37a3fa95da434d1b9195e6957b35607af1a6391d90506f87e75af2 |
SimHash | 244db401cf93 |
Groups
*
Rule | Path |
---|---|
Disallow | /core/* |
Disallow | /r/* |
Disallow | /mnt/* |
Disallow | /res/* |
Disallow | /static/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.protocol.com/sitemap.xml |