hd.stheadline.com
robots.txt

Robots Exclusion Standard data for hd.stheadline.com

Resource Scan

Scan Details

Site Domain hd.stheadline.com
Base Domain stheadline.com
Scan Status Ok
Last Scan2024-06-14T22:48:48+00:00
Next Scan 2024-06-21T22:48:48+00:00

Last Scan

Scanned2024-06-14T22:48:48+00:00
URL https://hd.stheadline.com/robots.txt
Domain IPs 104.22.12.216, 104.22.13.216, 172.67.27.248, 2606:4700:10::6816:cd8, 2606:4700:10::6816:dd8, 2606:4700:10::ac43:1bf8
Response IP 172.67.27.248
Found Yes
Hash aea3d2c8334b15e81e35016648dd02a5d2a3d016b7828be272b3b09e390b64d3
SimHash 5f10d2145353

Groups

googlebot
bingbot
msnbot
slurp
yandex
baiduspider

Rule Path
Allow /
Disallow /member/
Disallow /privacy.php
Disallow /terms.php
Disallow /qrcode_terms.php
Disallow /copyright.php
Disallow /membership/
Disallow /member
Disallow /ajax/
Disallow /m?
Disallow /m/
Disallow /mobile/
Disallow /racing_odds/

Other Records

Field Value
crawl-delay 300

Other Records

Field Value
sitemap http://hd.stheadline.com/sitemap.xml