hotsheet.com
robots.txt
Robots Exclusion Standard data for hotsheet.com
Resource Scan
Scan Details
Site Domain | hotsheet.com |
Base Domain | hotsheet.com |
Scan Status | Ok |
Last Scan | 2024-11-16T11:40:13+00:00 |
Next Scan | 2024-11-23T11:40:13+00:00 |
Last Scan
Scanned | 2024-11-16T11:40:13+00:00 |
URL | https://hotsheet.com/robots.txt |
Redirect | https://www.hotsheet.com/robots.txt |
Redirect Domain | www.hotsheet.com |
Redirect Base | hotsheet.com |
Domain IPs | 104.130.26.115 |
Redirect IPs | 104.130.26.115 |
Response IP | 104.130.26.115 |
Found | Yes |
Hash | 4b4ff7211f6f2582669a27a5525c19e42fd4e3fb63e9ab64aabad6945dc768a2 |
SimHash | fd0d95450590 |
Groups
*
Rule | Path |
---|---|
Disallow | /ads/ |
Disallow | /cache/ |
Disallow | /images/ |
Disallow | /dmoz/ |
Disallow | /label/ |
Disallow | /link/ |
Disallow | /local/ |
Disallow | /wap/ |
Disallow | /magpierss/ |
Disallow | /index_22.php |
Disallow | /index_33.php |
Disallow | /add-spot.php |
Disallow | /add-video.php |
Disallow | /add-post.php |
Disallow | /news202*.php$ |
Disallow | /cat*.php$ |
Disallow | /bld*.php$ |
Other Records
Field | Value |
---|---|
sitemap | https://www.hotsheet.com/sitemap.xml |