allthingsnature.org
robots.txt

Robots Exclusion Standard data for allthingsnature.org

Resource Scan

Scan Details

Site Domain allthingsnature.org
Base Domain allthingsnature.org
Scan Status Ok
Last Scan2024-11-14T04:22:10+00:00
Next Scan 2024-11-21T04:22:10+00:00

Last Scan

Scanned2024-11-14T04:22:10+00:00
URL https://allthingsnature.org/robots.txt
Redirect https://www.allthingsnature.org/robots.txt
Redirect Domain www.allthingsnature.org
Redirect Base allthingsnature.org
Domain IPs 52.52.200.44, 54.183.108.250
Redirect IPs 108.157.254.108, 108.157.254.50, 108.157.254.81, 108.157.254.93, 2600:9000:2753:1a00:9:2198:cb00:93a1, 2600:9000:2753:4400:9:2198:cb00:93a1, 2600:9000:2753:6800:9:2198:cb00:93a1, 2600:9000:2753:8000:9:2198:cb00:93a1, 2600:9000:2753:ac00:9:2198:cb00:93a1, 2600:9000:2753:c00:9:2198:cb00:93a1, 2600:9000:2753:e400:9:2198:cb00:93a1, 2600:9000:2753:f800:9:2198:cb00:93a1
Response IP 108.157.254.81
Found Yes
Hash d7a13bc13bf4a900433670afce45d32ce6725bdec50f626b57132fbbdd79ac70
SimHash ab01d72f3193

Groups

*

Rule Path
Disallow /s/
Disallow /templates/
Disallow /d/
Disallow /related/
Disallow /relevant/
Disallow /videos/
Disallow /captcha.php
Disallow /*?expand_article
Disallow /*.js?cb=
Disallow /quizzes*

mediapartners-google

Rule Path
Allow /s/
Allow /related/
Allow /relevant/

Other Records

Field Value
sitemap https://www.allthingsnature.org/sitemap-allthingsnature.org-index.xml