101greatgoals.com
robots.txt
Robots Exclusion Standard data for 101greatgoals.com
Resource Scan
Scan Details
Site Domain | 101greatgoals.com |
Base Domain | 101greatgoals.com |
Scan Status | Ok |
Last Scan | 2024-05-01T21:32:23+00:00 |
Next Scan | 2024-05-08T21:32:23+00:00 |
Last Scan
Scanned | 2024-05-01T21:32:23+00:00 |
URL | https://101greatgoals.com/robots.txt |
Redirect | https://www.101greatgoals.com:443/robots.txt |
Redirect Domain | www.101greatgoals.com |
Redirect Base | 101greatgoals.com |
Domain IPs | 15.197.163.213, 3.33.162.52 |
Redirect IPs | 173.222.148.34, 173.222.148.35, 2600:1413:b000:13::b857:c18f, 2600:1413:b000:13::b857:c190 |
Response IP | 42.99.140.218 |
Found | Yes |
Hash | 9e8444be0f0806237ad51ffec9fe708d8026f13429aeec8b8044d11077a7356b |
SimHash | 2b008514ed92 |
Other Records
Field | Value |
---|---|
sitemap | https://www.101greatgoals.com/arc/outboundfeeds/sitemap-index-standard/subtype/default/?outputType=xml |
sitemap | https://www.101greatgoals.com/arc/outboundfeeds/sitemap-section/?outputType=xml |