thehealthboard.com
robots.txt

Robots Exclusion Standard data for thehealthboard.com

Resource Scan

Scan Details

Site Domain thehealthboard.com
Base Domain thehealthboard.com
Scan Status Ok
Last Scan2024-09-28T18:12:02+00:00
Next Scan 2024-10-05T18:12:02+00:00

Last Scan

Scanned2024-09-28T18:12:02+00:00
URL https://thehealthboard.com/robots.txt
Redirect https://www.thehealthboard.com/robots.txt
Redirect Domain www.thehealthboard.com
Redirect Base thehealthboard.com
Domain IPs 52.52.207.191, 54.215.149.28
Redirect IPs 108.157.254.108, 108.157.254.50, 108.157.254.81, 108.157.254.93, 2600:9000:2055:4000:9:2198:cb00:93a1, 2600:9000:2055:4c00:9:2198:cb00:93a1, 2600:9000:2055:6c00:9:2198:cb00:93a1, 2600:9000:2055:8a00:9:2198:cb00:93a1, 2600:9000:2055:b200:9:2198:cb00:93a1, 2600:9000:2055:c800:9:2198:cb00:93a1, 2600:9000:2055:e400:9:2198:cb00:93a1, 2600:9000:2055:fc00:9:2198:cb00:93a1
Response IP 108.157.254.50
Found Yes
Hash e5e8b1e8e473b1233f66e3e18f5f34c68a3610db6828001ee916905376e309a5
SimHash ab01d33e3793

Groups

*

Rule Path
Disallow /s/
Disallow /templates/
Disallow /d/
Disallow /related/
Disallow /relevant/
Disallow /videos/
Disallow /captcha.php
Disallow /*?expand_article
Disallow /*.js?cb=
Disallow /quizzes*

mediapartners-google

Rule Path
Allow /s/
Allow /related/
Allow /relevant/

Other Records

Field Value
sitemap https://www.thehealthboard.com/sitemap-thehealthboard.com-index.xml