hd.stheadline.com
robots.txt
Robots Exclusion Standard data for hd.stheadline.com
Resource Scan
Scan Details
Site Domain | hd.stheadline.com |
Base Domain | stheadline.com |
Scan Status | Ok |
Last Scan | 2024-11-16T11:37:27+00:00 |
Next Scan | 2024-11-23T11:37:27+00:00 |
Last Scan
Scanned | 2024-11-16T11:37:27+00:00 |
URL | https://hd.stheadline.com/robots.txt |
Domain IPs | 104.22.12.216, 104.22.13.216, 172.67.27.248, 2606:4700:10::6816:cd8, 2606:4700:10::6816:dd8, 2606:4700:10::ac43:1bf8 |
Response IP | 104.22.13.216 |
Found | Yes |
Hash | aea3d2c8334b15e81e35016648dd02a5d2a3d016b7828be272b3b09e390b64d3 |
SimHash | 5f10d2145353 |
Groups
googlebot
bingbot
msnbot
slurp
yandex
baiduspider
Rule | Path |
---|---|
Allow | / |
Disallow | /member/ |
Disallow | /privacy.php |
Disallow | /terms.php |
Disallow | /qrcode_terms.php |
Disallow | /copyright.php |
Disallow | /membership/ |
Disallow | /member |
Disallow | /ajax/ |
Disallow | /m? |
Disallow | /m/ |
Disallow | /mobile/ |
Disallow | /racing_odds/ |
Other Records
Field | Value |
---|---|
crawl-delay | 300 |
Other Records
Field | Value |
---|---|
sitemap | http://hd.stheadline.com/sitemap.xml |