dianalegacy.com
robots.txt
Robots Exclusion Standard data for dianalegacy.com
Resource Scan
Scan Details
Site Domain | dianalegacy.com |
Base Domain | dianalegacy.com |
Scan Status | Ok |
Last Scan | 2024-11-09T11:46:53+00:00 |
Next Scan | 2024-11-16T11:46:53+00:00 |
Last Scan
Scanned | 2024-11-09T11:46:53+00:00 |
URL | https://dianalegacy.com/robots.txt |
Redirect | https://www.gone-hollywood.com/robots.txt |
Redirect Domain | www.gone-hollywood.com |
Redirect Base | gone-hollywood.com |
Domain IPs | 104.21.12.100, 172.67.152.21, 2606:4700:3031::6815:c64, 2606:4700:3035::ac43:9815 |
Redirect IPs | 104.21.49.153, 172.67.164.188, 2606:4700:3034::ac43:a4bc, 2606:4700:3037::6815:3199 |
Response IP | 172.67.164.188 |
Found | Yes |
Hash | b88104b5dad7727bf607dc7f0943e63f443d7f4faaea1e6388e5a6b031ae9bdc |
SimHash | 29135311c7d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Allow | /*.jpg$ |
Allow | /*.jpeg$ |
Allow | /*.gif$ |
Allow | /*.png$ |
Allow | /*.webp$ |
Other Records
Field | Value |
---|---|
sitemap | https://www.gone-hollywood.com/sitemap.xml |
Warnings
- 2 invalid lines.