discoverwildlife.com
robots.txt

Robots Exclusion Standard data for discoverwildlife.com

Resource Scan

Scan Details

Site Domain discoverwildlife.com
Base Domain discoverwildlife.com
Scan Status Ok
Last Scan2024-05-25T23:10:12+00:00
Next Scan 2024-06-01T23:10:12+00:00

Last Scan

Scanned2024-05-25T23:10:12+00:00
URL https://discoverwildlife.com/robots.txt
Redirect https://www.discoverwildlife.com/robots.txt
Redirect Domain www.discoverwildlife.com
Redirect Base discoverwildlife.com
Domain IPs 3.124.189.64
Redirect IPs 2600:9000:24bb:0:e:35bd:e6c0:93a1, 2600:9000:24bb:400:e:35bd:e6c0:93a1, 2600:9000:24bb:5000:e:35bd:e6c0:93a1, 2600:9000:24bb:6800:e:35bd:e6c0:93a1, 2600:9000:24bb:7000:e:35bd:e6c0:93a1, 2600:9000:24bb:9e00:e:35bd:e6c0:93a1, 2600:9000:24bb:a00:e:35bd:e6c0:93a1, 2600:9000:24bb:e00:e:35bd:e6c0:93a1, 3.163.24.101, 3.163.24.16, 3.163.24.19, 3.163.24.57
Response IP 108.157.52.37
Found Yes
Hash 93c9ea2ae3e7130b67d564d094803a73218ff982bd3557cef599b612f78f265a
SimHash e1489860e793

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.discoverwildlife.com/sitemap/index.xml.gz
sitemap https://www.discoverwildlife.com/sitemap/news.xml.gz