101greatgoals.com
robots.txt

Robots Exclusion Standard data for 101greatgoals.com

Resource Scan

Scan Details

Site Domain 101greatgoals.com
Base Domain 101greatgoals.com
Scan Status Ok
Last Scan2024-05-01T21:32:23+00:00
Next Scan 2024-05-08T21:32:23+00:00

Last Scan

Scanned2024-05-01T21:32:23+00:00
URL https://101greatgoals.com/robots.txt
Redirect https://www.101greatgoals.com:443/robots.txt
Redirect Domain www.101greatgoals.com
Redirect Base 101greatgoals.com
Domain IPs 15.197.163.213, 3.33.162.52
Redirect IPs 173.222.148.34, 173.222.148.35, 2600:1413:b000:13::b857:c18f, 2600:1413:b000:13::b857:c190
Response IP 42.99.140.218
Found Yes
Hash 9e8444be0f0806237ad51ffec9fe708d8026f13429aeec8b8044d11077a7356b
SimHash 2b008514ed92

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.101greatgoals.com/arc/outboundfeeds/sitemap-index-standard/subtype/default/?outputType=xml
sitemap https://www.101greatgoals.com/arc/outboundfeeds/sitemap-section/?outputType=xml