polahoki.net
robots.txt
Robots Exclusion Standard data for polahoki.net
Resource Scan
Scan Details
Site Domain | polahoki.net |
Base Domain | polahoki.net |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-10-31T01:26:31+00:00 |
Next Scan | 2025-01-29T01:26:31+00:00 |
Last Successful Scan
Scanned | 2023-12-14T00:52:43+00:00 |
URL | https://polahoki.net/robots.txt |
Domain IPs | 104.21.78.6, 172.67.214.68, 2606:4700:3032::6815:4e06, 2606:4700:3032::ac43:d644 |
Response IP | 172.67.214.68 |
Found | Yes |
Hash | e22d14083f98319b176568b5febac2333fb4513a309a6bc103b935ce267f16c0 |
SimHash | 2919d337d7d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Allow | /*.jpg$ |
Allow | /*.jpeg$ |
Allow | /*.gif$ |
Allow | /*.png$ |
Allow | /*.webp$ |
Other Records
Field | Value |
---|---|
sitemap | https://marthanew.com/sitemap.xml |
Warnings
- 2 invalid lines.
- `user-agen` is not a known field.