marthanew.com
robots.txt
Robots Exclusion Standard data for marthanew.com
Resource Scan
Scan Details
Site Domain | marthanew.com |
Base Domain | marthanew.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-11-04T10:58:46+00:00 |
Next Scan | 2025-02-02T10:58:46+00:00 |
Last Successful Scan
Scanned | 2024-01-10T03:35:59+00:00 |
URL | https://marthanew.com/robots.txt |
Domain IPs | 104.21.41.167, 172.67.191.186, 2606:4700:3033::ac43:bfba, 2606:4700:3036::6815:29a7 |
Response IP | 104.21.41.167 |
Found | Yes |
Hash | e22d14083f98319b176568b5febac2333fb4513a309a6bc103b935ce267f16c0 |
SimHash | 2919d337d7d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Allow | /*.jpg$ |
Allow | /*.jpeg$ |
Allow | /*.gif$ |
Allow | /*.png$ |
Allow | /*.webp$ |
Other Records
Field | Value |
---|---|
sitemap | https://marthanew.com/sitemap.xml |
Warnings
- 2 invalid lines.
- `user-agen` is not a known field.